Machine Learning Trends

Our ML Trends dashboard offers curated key numbers, visualizations, and insights that showcase the significant growth and impact of artificial intelligence.

Last updated on Jun 07, 2024

Display growth values in:

Training compute

Likely

4.2 x/year

The training compute of notable ML models has been growing at since 2010.

90% confidence interval: to .

Training data

Plausible

2028

The median projected year in which most of the effective stock of publicly available human-generated text will be used in a training run is 2028.

90% confidence interval: 2026 to 2033.

Computational performance

Likely

1.35 x/year

The amount of FLOP/s for GPUs in FP32 precision grows by . A similar trend is observed for FP16.

90% confidence interval: to .

Algorithmic improvements

Plausible

5.1 %/year

The physical compute required to achieve a given performance in language models is declining at a rate of .

90% confidence interval: to .

Training costs

Likely

2.5 x/year

The cost in USD of training frontier ML models has grown by since 2016.

90% confidence interval: to .

Compute Trends

Deep Learning compute

Likely

4.2 x

The training compute of notable ML models has grown by since 2010.

90% confidence interval: to .

Pre-Deep Learning compute

Likely

1.5 x

The training compute of notable ML models grew by from 1956 to 2010.

90% confidence interval: to .

Report

Training compute of frontier AI models grows by 4-5x per year

Our expanded AI model database shows that the compute used to train recent models grew 4-5x yearly from 2010 to May 2024. We find similar growth in frontier models, recent large language models, and models from leading companies.

Most compute used in a training run

Plausible

5e25 FLOP

The total training compute for the final training run of Gemini Ultra, likely the most compute-intensive model to date, is estimated at 5e25 FLOP.

95% confidence interval: 1.1e25 to 2.1e26.

Data Trends

Language training dataset size

Likely

2.9 x

The training dataset size for language models has grown by since 2010.

90% confidence interval: to .

When will the largest training runs use all public human-generated text?

Plausible

2028

The median projected year in which most of the effective stock of publicly available human-generated text will be used in a training run is 2028.

90% confidence interval: 2026 to 2033.

Paper

Will we run out of data? Limits of LLM scaling based on human-generated data

We estimate the stock of human-generated public text at around 300 trillion tokens. If trends continue, language models will fully utilize this stock between 2026 and 2032, or even earlier if intensely overtrained.

Largest training dataset used to train an LLM

Uncertain

18 trillion tokens

Qwen2.5 models, including Qwen2.5-72B, were trained on 18 trillion tokens, making them the models with the largest publicly confirmed training datasets.

Stock of data on the internet

Plausible

510 trillion tokens

The amount of tokens in the indexed web, the portion of the web that is publicly accessible from search engines, is estimated at 510 trillion tokens.

95% confidence interval: 130 trillion tokens to 2100 trillion tokens.

Hardware Trends

Computational performance

Likely

1.35 x

The amount of FLOP/s for GPUs in FP32 precision is growing at . A similar trend is observed for FP16.

90% confidence interval: to .

Lower-precision number formats

Plausible

8 x

The average performance gain in FLOP/s from switching from FP32 to tensor-FP16 is 8x.

Memory capacity

Likely

1.2 x

DRAM capacity (Byte) is growing by .

90% confidence interval: to .

Memory bandwidth

Likely

1.18 x

DRAM bandwidth in Byte/s is growing by .

90% confidence interval: to .

Report

Trends in Machine Learning Hardware

FLOP/s performance in 47 ML hardware accelerators doubled every 2.3 years. Switching from FP32 to tensor-FP16 led to a further 10x performance increase. Memory capacity and bandwidth doubled every 4 years.

Highest performing GPU in Tensor-FP16

Likely

2.25e15 FLOP/s

The highest performance of a GPU in Tensor-FP16, from the NVIDIA B200 SXM, is 2.25e15 FLOP/s.

Highest performing GPU in INT8

Likely

4.5e15 OP/s

The highest performance of a GPU in INT8, from the NVIDIA B200 SXM, is 4.5e15 OP/s.

Algorithmic Progress

Compute-efficiency in language models

Plausible

5.1 %

The physical compute required to achieve a given performance in language models is declining at a rate of .

90% confidence interval: to .

Compute-efficiency in computer vision models

Plausible

331.1 %

The physical compute required to achieve a given performance in computer vision models is declining at a rate of .

95% confidence interval: to .

Contribution of algorithmic innovation

Plausible

35%

The improvements to compute efficiency explain roughly 35% of performance improvements in language modeling since 2014, vs 65% explained by increases in scale.

Paper

Algorithmic Progress in Language Models

Progress in language model performance surpasses what we’d expect from merely increasing computing resources, occurring at a pace equivalent to doubling computational power every 5 to 14 months.

Chinchilla scaling laws

Plausible

20 tokens per parameter

The ratio of data to parameters to achieve compute-optimal scaling for LLMs is 20 tokens per parameter.

Investment Trends

Training costs

Likely

2.5 x

The cost in USD of training frontier ML models has grown by since 2016.

90% confidence interval: to .

Hardware acquisition costs

Likely

297.6 x

The acquisition cost in USD of the hardware used to train frontier ML models has grown by since 2016.

90% confidence interval: to .

Paper

How Much Does It Cost to Train Frontier AI Models?

The cost of training frontier AI models has grown by a factor of 2 to 3x per year for the past eight years, suggesting that the largest models will cost over a billion dollars by 2027.

Most expensive AI model

Uncertain

$130 million

The total amortized cost of developing Gemini Ultra, including hardware, electricity, and staff compensation, is estimated at $130 million USD.

90% confidence interval: $70 million to $290 million.

Hardware acquisition cost for the most expensive AI model

Uncertain

$670 million

The acquisition cost of the hardware to train Gemini Ultra, including TPUs, other server components, and networking, is estimated at $670 million USD.

90% confidence interval: $280 million to $1.6 billion.

Biological Models

Training compute

Likely

8.7 x

The training compute of biological sequence models has been growing by since 2018.

Key DNA sequence database

Likely

8.3 x

The number of sequences stored in GenBank (INSDC) has been growing by between 2022 and 2023.

Report

Biological Sequence Models in the Context of the AI Directives

The expanded Epoch database now includes biological sequence models, revealing potential regulatory gaps in the White House’s Executive Order on AI and the growth of the compute used in their training.

Most compute-intensive biological sequence model

Likely

6.2e23 FLOP

The training compute for xTrimoPGLM, the most compute-intensive biological sequence model to date, is estimated at 6.2e23 FLOP.

Protein sequence data

Uncertain

~7 billion entries

The number of unique entries across protein sequence databases is estimated at ~7 billion entries.

Acknowledgements

We thank Tom Davidson, Lukas Finnveden, Charlie Giattino, Zach Stein-Perlman, Misha Yagudin, Jai Vipra, Patrick Levermore, Carl Shulman, Ben Bucknall and Daniel Kokotajlo for their feedback.

Several people have contributed to the design and maintenance of this dashboard, including Jaime Sevilla, Pablo Villalobos, Anson Ho, Tamay Besiroglu, Ege Erdil, Ben Cottier, Matthew Barnett, David Owen, Robi Rahman, Lennart Heim, Marius Hobbhahn, David Atkinson, Keith Wynroe, Christopher Phenicie, Nicole Maug, Aleksandar Kostovic, Alex Haase, Robert Sandler, Edu Roldan and Andrew Lucas.

Citation

Cite this work as

    Epoch AI (2023), "Key Trends and Figures in Machine Learning". Published online at epochai.org. Retrieved from: 'https://epochai.org/trends' [online resource]
  

BibTeX citation

@misc{epoch2023aitrends,
  title="Key Trends and Figures in Machine Learning",
  author={{Epoch AI}},
  year=2023,
  url={https://epochai.org/trends},
  note={Accessed: }
}

Cite this work as

              
                Epoch AI (2023), "Key Trends and Figures in Machine Learning". Published online at epochai.org. Retrieved from: 'https://epochai.org/trends' [online resource]

BibTeX citation

@misc{epoch2023aitrends,
  title={Key Trends and Figures in Machine Learning},
  author={{Epoch AI}},
  year={2023},
  url={https://epochai.org/trends},
  note={Accessed: }
}


            

If you spot an error or would like to provide feedback, please reach out at .

Research

Organization

@ 2024 Rethink Priorities

Epoch AI is fiscally sponsored by Rethink Priorities.