Data on Notable AI Models

Data insights

The training compute of notable AI models is doubling roughly every six months.

Since 2010, the training compute used to create AI models has been growing at a rate of 4.2x per year. Most of this growth comes from increased spending, although improvements in hardware have also played a role.

{"xAxis": {"label": "Publication date", "lim": [2009.25, 2025.75], "scaleType": "linear", "ticks": [2008.0, 2010.0, 2012.0, 2014.0, 2016.0, 2018.0, 2020.0, 2022.0, 2024.0, 2026.0], "tickLabels": ["2008", "2010", "2012", "2014", "2016", "2018", "2020", "2022", "2024", "2026"], "hideMinorGrid": true, "nice": false}, "yAxis": {"label": "Training compute (FLOP)", "lim": [23510102861063.668, 7.727945771965614e+26], "scaleType": "log", "ticks": [100000000000.0, 10000000000000.0, 1000000000000000.0, 1e+17, 1e+19, 1e+21, 1e+23, 1e+25, 1e+27, 1e+29], "tickLabels": ["$\\mathdefault{10^{11}}$", "$\\mathdefault{10^{13}}$", "$\\mathdefault{10^{15}}$", "$\\mathdefault{10^{17}}$", "$\\mathdefault{10^{19}}$", "$\\mathdefault{10^{21}}$", "$\\mathdefault{10^{23}}$", "$\\mathdefault{10^{25}}$", "$\\mathdefault{10^{27}}$", "$\\mathdefault{10^{29}}$"], "hideMinorGrid": true}, "showLegend": true, "legendPosition": "header", "showFrame": true, "objects": [{"type": "scatter", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2010.1666666666667, "y": 130788000000000.0, "tooltipData": {"Model": "6-layer MLP (MNIST)", "Domain": "Vision", "Training compute (FLOP)": "1.3e+14", "Organization": "IDSIA, University of Lugano, SUPSI", "Publication date": "2010-03-01"}, "size": 8}, {"x": 2010.366210045662, "y": 350000000000000.0, "tooltipData": {"Model": "Feedforward NN", "Domain": "Vision", "Training compute (FLOP)": "3.5e+14", "Organization": "University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2010-05-13"}, "size": 8}, {"x": 2011.0, "y": 3.672864e+16, "tooltipData": {"Model": "Deep Autoencoders", "Domain": "Vision", "Training compute (FLOP)": "3.7e+16", "Organization": "University of Toronto", "Publication date": "2011-01-01"}, "size": 8}, {"x": 2012.0, "y": 1.66e+16, "tooltipData": {"Model": "LSTM LM", "Domain": "Language", "Training compute (FLOP)": "1.7e+16", "Organization": "RWTH Aachen University", "Publication date": "2012-01-01"}, "size": 8}, {"x": 2012.116210045662, "y": 3726979200000000.0, "tooltipData": {"Model": "MCDNN (MNIST)", "Domain": "Vision", "Training compute (FLOP)": "3.7e+15", "Organization": "IDSIA", "Publication date": "2012-02-13"}, "size": 8}, {"x": 2012.4221461187215, "y": 4268700000000000.0, "tooltipData": {"Model": "Dropout (CIFAR)", "Domain": "Vision", "Training compute (FLOP)": "4.3e+15", "Organization": "University of Toronto", "Publication date": "2012-06-03"}, "size": 8}, {"x": 2012.4221461187215, "y": 2.731968e+17, "tooltipData": {"Model": "Dropout (ImageNet)", "Domain": "Vision", "Training compute (FLOP)": "2.7e+17", "Organization": "University of Toronto", "Publication date": "2012-06-03"}, "size": 8}, {"x": 2012.4221461187215, "y": 6039370800000000.0, "tooltipData": {"Model": "Dropout (MNIST)", "Domain": "Vision", "Training compute (FLOP)": "6.0e+15", "Organization": "University of Toronto", "Publication date": "2012-06-03"}, "size": 8}, {"x": 2012.5301369863014, "y": 6e+17, "tooltipData": {"Model": "Unsupervised High-level Feature Learner", "Domain": "Vision", "Training compute (FLOP)": "6.0e+17", "Organization": "Google", "Publication date": "2012-07-12"}, "size": 8}, {"x": 2012.7461187214612, "y": 4.7e+17, "tooltipData": {"Model": "AlexNet", "Domain": "Vision", "Training compute (FLOP)": "4.7e+17", "Organization": "University of Toronto", "Publication date": "2012-09-30"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2012.9221461187215, "y": 3.114e+17, "tooltipData": {"Model": "DistBelief Speech", "Domain": "Speech", "Training compute (FLOP)": "3.1e+17", "Organization": "Google", "Publication date": "2012-12-03"}, "size": 8}, {"x": 2013.041095890411, "y": 2.612736e+18, "tooltipData": {"Model": "DistBelief NNLM", "Domain": "Language", "Training compute (FLOP)": "2.6e+18", "Organization": "Google", "Publication date": "2013-01-16"}, "size": 8}, {"x": 2013.401826484018, "y": 1.2773376e+17, "tooltipData": {"Model": "ReLU-Speech", "Domain": "Speech", "Training compute (FLOP)": "1.3e+17", "Organization": "Google, University of Toronto, New York University (NYU)", "Publication date": "2013-05-26"}, "size": 8}, {"x": 2013.4468036529681, "y": 90842400000000.0, "tooltipData": {"Model": "Fisher Vector image classifier", "Domain": "Vision", "Training compute (FLOP)": "9.1e+13", "Organization": "Universidad Nacional de Cordoba, Inteligent Systems Lab Amsterdam, University of Amsterdam, LEAR Team, INRIA, Xerox Research Centre Europe (XRCE)", "Publication date": "2013-06-12"}, "size": 8}, {"x": 2013.5915525114156, "y": 4210000000000000.0, "tooltipData": {"Model": "RNN+weight noise+dynamic eval", "Domain": "Language", "Training compute (FLOP)": "4.2e+15", "Organization": "University of Toronto", "Publication date": "2013-08-04"}, "size": 8}, {"x": 2013.7242009132422, "y": 1.37e+17, "tooltipData": {"Model": "Mitosis", "Domain": "Vision, Medicine", "Training compute (FLOP)": "1.4e+17", "Organization": "IDSIA", "Publication date": "2013-09-22"}, "size": 8}, {"x": 2013.75, "y": 9331200000000000.0, "tooltipData": {"Model": "RCTM", "Domain": "Language", "Training compute (FLOP)": "9.3e+15", "Organization": "University of Oxford", "Publication date": "2013-10-01"}, "size": 8}, {"x": 2013.75, "y": 1.422e+16, "tooltipData": {"Model": "RNTN", "Domain": "Language", "Training compute (FLOP)": "1.4e+16", "Organization": "Stanford University", "Publication date": "2013-10-01"}, "size": 8}, {"x": 2013.791095890411, "y": 3.888e+16, "tooltipData": {"Model": "Word2Vec (large)", "Domain": "Language", "Training compute (FLOP)": "3.9e+16", "Organization": "Google", "Publication date": "2013-10-16"}, "size": 8}, {"x": 2013.8634703196346, "y": 5.32e+17, "tooltipData": {"Model": "Visualizing CNNs", "Domain": "Vision", "Training compute (FLOP)": "5.3e+17", "Organization": "New York University (NYU)", "Publication date": "2013-11-12"}, "size": 8}, {"x": 2013.9276255707764, "y": 1.340928e+18, "tooltipData": {"Model": "TransE", "Domain": "Language", "Training compute (FLOP)": "1.3e+18", "Organization": "Universite de Technologie de Compi\u00e8gne \u2013 CNRS, Google", "Publication date": "2013-12-05"}, "size": 8}, {"x": 2013.9659817351599, "y": 2300000000000000.0, "tooltipData": {"Model": "DQN", "Domain": "Games", "Training compute (FLOP)": "2.3e+15", "Organization": "DeepMind", "Publication date": "2013-12-19"}, "size": 8}, {"x": 2013.9687214611872, "y": 475200000000000.0, "tooltipData": {"Model": "Image generation", "Domain": "Vision", "Training compute (FLOP)": "4.8e+14", "Organization": "University of Amsterdam", "Publication date": "2013-12-20"}, "size": 8}, {"x": 2014.0, "y": 4.4e+16, "tooltipData": {"Model": "SPN-4+KN5", "Domain": "Language", "Training compute (FLOP)": "4.4e+16", "Organization": "Singapore University of Technology & Design, DSO National Laboratories", "Publication date": "2014-01-01"}, "size": 8}, {"x": 2014.4413242009134, "y": 5.184e+17, "tooltipData": {"Model": "GANs", "Domain": "Image generation", "Training compute (FLOP)": "5.2e+17", "Organization": "University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2014-06-10"}, "size": 8}, {"x": 2014.4632420091325, "y": 3.411072e+18, "tooltipData": {"Model": "SPPNet", "Domain": "Vision", "Training compute (FLOP)": "3.4e+18", "Organization": "Microsoft, Xi\u2019an Jiaotong University, University of Science and Technology of China", "Publication date": "2014-06-18"}, "size": 8}, {"x": 2014.5, "y": 6.9e+16, "tooltipData": {"Model": "SmooCT", "Domain": "Games", "Training compute (FLOP)": "6.9e+16", "Organization": "University College London (UCL)", "Publication date": "2014-07-01"}, "size": 8}, {"x": 2014.6666666666667, "y": 1.5552e+18, "tooltipData": {"Model": "RNNsearch-50*", "Domain": "Language", "Training compute (FLOP)": "1.6e+18", "Organization": "Jacobs University Bremen, University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2014-09-01"}, "size": 8}, {"x": 2014.674885844749, "y": 1.2291e+19, "tooltipData": {"Model": "VGG16", "Domain": "Vision", "Training compute (FLOP)": "1.2e+19", "Organization": "University of Oxford", "Publication date": "2014-09-04"}, "size": 8}, {"x": 2014.6858447488585, "y": 9.1e+16, "tooltipData": {"Model": "Large regularized LSTM", "Domain": "Language", "Training compute (FLOP)": "9.1e+16", "Organization": "New York University (NYU), Google Brain", "Publication date": "2014-09-08"}, "size": 8}, {"x": 2014.6913242009134, "y": 5.6e+19, "tooltipData": {"Model": "Seq2Seq LSTM", "Domain": "Language", "Training compute (FLOP)": "5.6e+19", "Organization": "Google", "Publication date": "2014-09-10"}, "size": 8}, {"x": 2014.7105022831051, "y": 1.51e+18, "tooltipData": {"Model": "GoogLeNet / InceptionV1", "Domain": "Vision", "Training compute (FLOP)": "1.5e+18", "Organization": "Google, University of Michigan, University of North Carolina", "Publication date": "2014-09-17"}, "size": 8}, {"x": 2014.9221461187215, "y": 2.97600000001e+20, "tooltipData": {"Model": "SNM-skip", "Domain": "Language", "Training compute (FLOP)": "3.0e+20", "Organization": "Google", "Publication date": "2014-12-03"}, "size": 8}, {"x": 2014.9632420091325, "y": 1e+17, "tooltipData": {"Model": "Fractional Max-Pooling", "Domain": "Vision", "Training compute (FLOP)": "1.0e+17", "Organization": "University of Warwick", "Publication date": "2014-12-18"}, "size": 8}, {"x": 2014.9742009132422, "y": 6.048e+16, "tooltipData": {"Model": "ADAM (CIFAR-10)", "Domain": "Vision", "Training compute (FLOP)": "6.0e+16", "Organization": "University of Amsterdam, OpenAI, University of Toronto", "Publication date": "2014-12-22"}, "size": 8}, {"x": 2015.0970319634703, "y": 2.397403008e+19, "tooltipData": {"Model": "MSRA (C, PReLU)", "Domain": "Vision", "Training compute (FLOP)": "2.4e+19", "Organization": "Microsoft Research", "Publication date": "2015-02-06"}, "size": 8}, {"x": 2015.2105022831051, "y": 7.3e+16, "tooltipData": {"Model": "genCNN + dyn eval", "Domain": "Language", "Training compute (FLOP)": "7.3e+16", "Organization": "Chinese Academy of Sciences, Huawei Noah's Ark Lab, Dublin City University", "Publication date": "2015-03-17"}, "size": 8}, {"x": 2015.513698630137, "y": 3340000000000000.0, "tooltipData": {"Model": "Search-Proven Best LSTM", "Domain": "Language", "Training compute (FLOP)": "3.3e+15", "Organization": "Google", "Publication date": "2015-07-06"}, "size": 8}, {"x": 2015.651826484018, "y": 2650000000000000.0, "tooltipData": {"Model": "LSTM-Char-Large", "Domain": "Language", "Training compute (FLOP)": "2.6e+15", "Organization": "Harvard University, New York University (NYU)", "Publication date": "2015-08-26"}, "size": 8}, {"x": 2015.75, "y": 3.8e+20, "tooltipData": {"Model": "AlphaGo Fan", "Domain": "Games", "Training compute (FLOP)": "3.8e+20", "Organization": "DeepMind", "Publication date": "2015-10-01"}, "size": 8}, {"x": 2015.9358447488585, "y": 2.6e+19, "tooltipData": {"Model": "DeepSpeech2 (English)", "Domain": "Speech", "Training compute (FLOP)": "2.6e+19", "Organization": "Baidu Research - Silicon Valley AI Lab", "Publication date": "2015-12-08"}, "size": 8}, {"x": 2015.9413242009134, "y": 1.041408e+19, "tooltipData": {"Model": "ResNet-152 (ImageNet)", "Domain": "Vision", "Training compute (FLOP)": "1.0e+19", "Organization": "Microsoft", "Publication date": "2015-12-10"}, "size": 8}, {"x": 2015.9577625570778, "y": 5620000000000000.0, "tooltipData": {"Model": "Variational (untied weights, MC) LSTM (Large)", "Domain": "Language", "Training compute (FLOP)": "5.6e+15", "Organization": "University of Cambridge", "Publication date": "2015-12-16"}, "size": 8}, {"x": 2016.0712328767124, "y": 1.9e+21, "tooltipData": {"Model": "AlphaGo Lee", "Domain": "Games", "Training compute (FLOP)": "1.9e+21", "Organization": "DeepMind", "Publication date": "2016-01-27"}, "size": 8}, {"x": 2016.4100456621004, "y": 9.69408e+16, "tooltipData": {"Model": "Named Entity Recognition model", "Domain": "Language", "Training compute (FLOP)": "9.7e+16", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.4100456621004, "y": 1.454112e+17, "tooltipData": {"Model": "Part-of-sentence tagging model", "Domain": "Language", "Training compute (FLOP)": "1.5e+17", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.4714611872148, "y": 6.1492939794e+16, "tooltipData": {"Model": "R-FCN", "Domain": "Vision", "Training compute (FLOP)": "6.1e+16", "Organization": "Tsinghua University, Microsoft Research", "Publication date": "2016-06-21"}, "size": 8}, {"x": 2016.5301369863014, "y": 3570000000000000.0, "tooltipData": {"Model": "VD-RHN", "Domain": "Language", "Training compute (FLOP)": "3.6e+15", "Organization": "ETH Zurich, IDSIA", "Publication date": "2016-07-12"}, "size": 8}, {"x": 2016.7105022831051, "y": 2.9741645e+19, "tooltipData": {"Model": "ResNet-200", "Domain": "Vision", "Training compute (FLOP)": "3.0e+19", "Organization": "Microsoft Research Asia", "Publication date": "2016-09-17"}, "size": 8}, {"x": 2016.7351598173516, "y": 1.68e+16, "tooltipData": {"Model": "Zoneout + Variational LSTM (WT2)", "Domain": "Language", "Training compute (FLOP)": "1.7e+16", "Organization": "MetaMind Inc, Salesforce", "Publication date": "2016-09-26"}, "size": 8}, {"x": 2016.7351598173516, "y": 7490000000000000.0, "tooltipData": {"Model": "Pointer Sentinel-LSTM (medium)", "Domain": "Language", "Training compute (FLOP)": "7.5e+15", "Organization": "MetaMind Inc, Salesforce", "Publication date": "2016-09-26"}, "size": 8}, {"x": 2016.7351598173516, "y": 6.620000000001e+21, "tooltipData": {"Model": "GNMT", "Domain": "Language", "Training compute (FLOP)": "6.6e+21", "Organization": "Google", "Publication date": "2016-09-26"}, "size": 8}, {"x": 2016.7664383561644, "y": 4.36e+20, "tooltipData": {"Model": "Xception", "Domain": "Vision", "Training compute (FLOP)": "4.4e+20", "Organization": "Google", "Publication date": "2016-10-07"}, "size": 8}, {"x": 2016.8239726027398, "y": 1.822e+16, "tooltipData": {"Model": "SPIDER2", "Domain": "Biology", "Training compute (FLOP)": "1.8e+16", "Organization": "Griffith University, University of Iowa, Dezhou University", "Publication date": "2016-10-28"}, "size": 8}, {"x": 2016.8415525114156, "y": 2.13e+16, "tooltipData": {"Model": "VD-LSTM+REAL Large", "Domain": "Language", "Training compute (FLOP)": "2.1e+16", "Organization": "Salesforce Research, Stanford University", "Publication date": "2016-11-04"}, "size": 8}, {"x": 2016.844292237443, "y": 1.05e+16, "tooltipData": {"Model": "NAS with base 8 and shared embeddings", "Domain": "Language", "Training compute (FLOP)": "1.0e+16", "Organization": "Google Brain", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2016.844292237443, "y": 2.2e+21, "tooltipData": {"Model": "NASv3 (CIFAR-10)", "Domain": "Vision", "Training compute (FLOP)": "2.2e+21", "Organization": "Google Brain", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2016.844292237443, "y": 3.4686144e+18, "tooltipData": {"Model": "BIDAF", "Domain": "Language", "Training compute (FLOP)": "3.5e+18", "Organization": "University of Washington, Allen Institute for AI", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2016.8771689497717, "y": 6.4e+19, "tooltipData": {"Model": "PolyNet", "Domain": "Vision", "Training compute (FLOP)": "6.4e+19", "Organization": "Chinese University of Hong Kong (CUHK)", "Publication date": "2016-11-17"}, "size": 8}, {"x": 2017.0, "y": 2.00010000000001e+23, "tooltipData": {"Model": "AlphaGo Master", "Domain": "Games", "Training compute (FLOP)": "2.0e+23", "Organization": "DeepMind", "Publication date": "2017-01-01"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2017.0, "y": 5.51e+20, "tooltipData": {"Model": "Libratus", "Domain": "Games", "Training compute (FLOP)": "5.5e+20", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2017-01-01"}, "size": 8}, {"x": 2017.013698630137, "y": 1.446336e+19, "tooltipData": {"Model": "DeepStack", "Domain": "Games", "Training compute (FLOP)": "1.4e+19", "Organization": "University of Alberta, Charles University, Czech Technical University", "Publication date": "2017-01-06"}, "size": 8}, {"x": 2017.0602739726028, "y": 9.393905664e+19, "tooltipData": {"Model": "MoE-Multi", "Domain": "Language", "Training compute (FLOP)": "9.4e+19", "Organization": "Jagiellonian University, Google Brain", "Publication date": "2017-01-23"}, "size": 8}, {"x": 2017.4468036529681, "y": 7.4245248e+18, "tooltipData": {"Model": "Transformer", "Domain": "Language", "Training compute (FLOP)": "7.4e+18", "Organization": "Google Research, Google Brain", "Publication date": "2017-06-12"}, "size": 8}, {"x": 2017.5246575342467, "y": 8.43e+20, "tooltipData": {"Model": "JFT", "Domain": "Vision", "Training compute (FLOP)": "8.4e+20", "Organization": "Google Research, Carnegie Mellon University (CMU)", "Publication date": "2017-07-10"}, "size": 8}, {"x": 2017.5657534246575, "y": 5.64e+19, "tooltipData": {"Model": "ConvS2S (ensemble of 8 models)", "Domain": "Language", "Training compute (FLOP)": "5.6e+19", "Organization": "Meta AI", "Publication date": "2017-07-25"}, "size": 8}, {"x": 2017.5997716894976, "y": 2.065392e+18, "tooltipData": {"Model": "RetinaNet-R101", "Domain": "Vision", "Training compute (FLOP)": "2.1e+18", "Organization": "Facebook AI Research", "Publication date": "2017-08-07"}, "size": 8}, {"x": 2017.5997716894976, "y": 3.09e+17, "tooltipData": {"Model": "AWD-LSTM - 3-layer LSTM (tied) + continuous cache pointer (WT2)", "Domain": "Language", "Training compute (FLOP)": "3.1e+17", "Organization": "Salesforce Research", "Publication date": "2017-08-07"}, "size": 8}, {"x": 2017.6107305936073, "y": 6.046095222592002e+20, "tooltipData": {"Model": "OpenAI TI7 DOTA 1v1", "Domain": "Games", "Training compute (FLOP)": "6.0e+20", "Organization": "OpenAI", "Publication date": "2017-08-11"}, "size": 8}, {"x": 2017.6189497716894, "y": 1.06e+16, "tooltipData": {"Model": "EI-REHN-1000D", "Domain": "Language", "Training compute (FLOP)": "1.1e+16", "Organization": "Korea Advanced Institute of Science and Technology (KAIST)", "Publication date": "2017-08-14"}, "size": 8}, {"x": 2017.6600456621004, "y": 4.74e+17, "tooltipData": {"Model": "GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)", "Domain": "Language", "Training compute (FLOP)": "4.7e+17", "Organization": "Ben-Gurion University of the Negev", "Publication date": "2017-08-29"}, "size": 8}, {"x": 2017.6803652968038, "y": 2340000000000000.0, "tooltipData": {"Model": "PyramidNet", "Domain": "Vision", "Training compute (FLOP)": "2.3e+15", "Organization": "Korea Advanced Institute of Science and Technology (KAIST)", "Publication date": "2017-09-06"}, "size": 8}, {"x": 2017.7050228310502, "y": 3400000000000000.0, "tooltipData": {"Model": "ISS", "Domain": "Language", "Training compute (FLOP)": "3.4e+15", "Organization": "Duke University, Microsoft", "Publication date": "2017-09-15"}, "size": 8}, {"x": 2017.7351598173516, "y": 3310000000000000.0, "tooltipData": {"Model": "AWD-LSTM+WT+Cache+IOG (WT2)", "Domain": "Language", "Training compute (FLOP)": "3.3e+15", "Organization": "NTT Communication Science Laboratories", "Publication date": "2017-09-26"}, "size": 8}, {"x": 2017.7965753424658, "y": 3.41e+23, "tooltipData": {"Model": "AlphaGo Zero", "Domain": "Games", "Training compute (FLOP)": "3.4e+23", "Organization": "DeepMind", "Publication date": "2017-10-18"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2017.8321917808219, "y": 9.85e+16, "tooltipData": {"Model": "Fraternal dropout + AWD-LSTM 3-layer (WT2)", "Domain": "Language", "Training compute (FLOP)": "9.8e+16", "Organization": "Jagiellonian University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2017-10-31"}, "size": 8}, {"x": 2017.85799086758, "y": 4.37e+17, "tooltipData": {"Model": "AWD-LSTM-MoS + dynamic evaluation (WT2, 2017)", "Domain": "Language", "Training compute (FLOP)": "4.4e+17", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2017-11-10"}, "size": 8}, {"x": 2017.919406392694, "y": 6.62904e+19, "tooltipData": {"Model": "PNASNet-5", "Domain": "Vision", "Training compute (FLOP)": "6.6e+19", "Organization": "Johns Hopkins University, Google AI, Stanford University", "Publication date": "2017-12-02"}, "size": 8}, {"x": 2017.9276255707764, "y": 3.667927300468287e+22, "tooltipData": {"Model": "AlphaZero", "Domain": "Games", "Training compute (FLOP)": "3.7e+22", "Organization": "DeepMind", "Publication date": "2017-12-05"}, "size": 8}, {"x": 2017.9276255707764, "y": 1340000000000000.0, "tooltipData": {"Model": "2-layer-LSTM+Deep-Gradient-Compression", "Domain": "Language", "Training compute (FLOP)": "1.3e+15", "Organization": "Tsinghua University, Stanford University, NVIDIA", "Publication date": "2017-12-05"}, "size": 8}, {"x": 2018.0465753424658, "y": 2.72538e+17, "tooltipData": {"Model": "ULM-FiT", "Domain": "Language", "Training compute (FLOP)": "2.7e+17", "Organization": "University of San Francisco, Insight Centre NUI Galway, Fast.ai", "Publication date": "2018-01-18"}, "size": 8}, {"x": 2018.0833333333333, "y": 3.6e+17, "tooltipData": {"Model": "QRNN", "Domain": "Language", "Training compute (FLOP)": "3.6e+17", "Organization": "Salesforce Research", "Publication date": "2018-02-01"}, "size": 8}, {"x": 2018.094292237443, "y": 3.85296912e+20, "tooltipData": {"Model": "AmoebaNet-A (F=448)", "Domain": "Vision", "Training compute (FLOP)": "3.9e+20", "Organization": "Google Brain", "Publication date": "2018-02-05"}, "size": 8}, {"x": 2018.094292237443, "y": 1.68e+20, "tooltipData": {"Model": "IMPALA", "Domain": "Games", "Training compute (FLOP)": "1.7e+20", "Organization": "DeepMind", "Publication date": "2018-02-05"}, "size": 8}, {"x": 2018.1052511415523, "y": 2.01e+16, "tooltipData": {"Model": "ENAS", "Domain": "Language", "Training compute (FLOP)": "2.0e+16", "Organization": "Google Brain, Carnegie Mellon University (CMU), Stanford University", "Publication date": "2018-02-09"}, "size": 8}, {"x": 2018.2242009132422, "y": 2.4e+17, "tooltipData": {"Model": "4 layer QRNN (h=2500)", "Domain": "Language", "Training compute (FLOP)": "2.4e+17", "Organization": "Salesforce Research", "Publication date": "2018-03-22"}, "size": 8}, {"x": 2018.2378995433792, "y": 3.33e+19, "tooltipData": {"Model": "LSTM (Hebbian, Cache, MbPA)", "Domain": "Language", "Training compute (FLOP)": "3.3e+19", "Organization": "DeepMind, University College London (UCL)", "Publication date": "2018-03-27"}, "size": 8}, {"x": 2018.2691780821917, "y": 5.093919992e+19, "tooltipData": {"Model": "YOLOv3", "Domain": "Vision", "Training compute (FLOP)": "5.1e+19", "Organization": "University of Washington", "Publication date": "2018-04-08"}, "size": 8}, {"x": 2018.3360730593606, "y": 8.74395e+21, "tooltipData": {"Model": "ResNeXt-101 32x48d", "Domain": "Vision", "Training compute (FLOP)": "8.7e+21", "Organization": "Facebook", "Publication date": "2018-05-02"}, "size": 8}, {"x": 2018.338812785388, "y": 1.27e+17, "tooltipData": {"Model": "Dropout-LSTM+Noise(Bernoulli) (WT2)", "Domain": "Language", "Training compute (FLOP)": "1.3e+17", "Organization": "Columbia University, New York University (NYU), Princeton University", "Publication date": "2018-05-03"}, "size": 8}, {"x": 2018.3908675799087, "y": 7.59e+16, "tooltipData": {"Model": "aLSTM(depth-2)+RecurrentPolicy (WT2)", "Domain": "Language", "Training compute (FLOP)": "7.6e+16", "Organization": "University of Manchester, Alan Turing Institute", "Publication date": "2018-05-22"}, "size": 8}, {"x": 2018.4166666666667, "y": 1.7578125e+19, "tooltipData": {"Model": "GPT-1", "Domain": "Language", "Training compute (FLOP)": "1.8e+19", "Organization": "OpenAI", "Publication date": "2018-06-01"}, "size": 8}, {"x": 2018.4796803652969, "y": 1.1e+16, "tooltipData": {"Model": "DARTS", "Domain": "Language", "Training compute (FLOP)": "1.1e+16", "Organization": "DeepMind, Carnegie Mellon University (CMU)", "Publication date": "2018-06-24"}, "size": 8}, {"x": 2018.4878995433792, "y": 3.4875e+19, "tooltipData": {"Model": "QT-Opt", "Domain": "Robotics, Vision", "Training compute (FLOP)": "3.5e+19", "Organization": "Google Brain, UC Berkeley", "Publication date": "2018-06-27"}, "size": 8}, {"x": 2018.5054794520547, "y": 3.49e+19, "tooltipData": {"Model": "Population-based DRL", "Domain": "Games", "Training compute (FLOP)": "3.5e+19", "Organization": "DeepMind", "Publication date": "2018-07-03"}, "size": 8}, {"x": 2018.5246575342467, "y": 2.46048e+17, "tooltipData": {"Model": "Big-Little Net", "Domain": "Vision", "Training compute (FLOP)": "2.5e+17", "Organization": "IBM", "Publication date": "2018-07-10"}, "size": 8}, {"x": 2018.5246575342467, "y": 4.290048e+17, "tooltipData": {"Model": "Big-Little Net (speech)", "Domain": "Speech", "Training compute (FLOP)": "4.3e+17", "Organization": "IBM", "Publication date": "2018-07-10"}, "size": 8}, {"x": 2018.657305936073, "y": 4.7808e+20, "tooltipData": {"Model": "Big Transformer for Back-Translation", "Domain": "Language", "Training compute (FLOP)": "4.8e+20", "Organization": "Facebook AI Research, Google Brain", "Publication date": "2018-08-28"}, "size": 8}, {"x": 2018.6627853881278, "y": 6.93e+17, "tooltipData": {"Model": "(ensemble): AWD-LSTM-DOC (fin) \u00d7 5 (WT2)", "Domain": "Language", "Training compute (FLOP)": "6.9e+17", "Organization": "NTT Communication Science Laboratories, Tohoku University", "Publication date": "2018-08-30"}, "size": 8}, {"x": 2018.7105022831051, "y": 1.1e+19, "tooltipData": {"Model": "Transformer + Simple Recurrent Unit", "Domain": "Language", "Training compute (FLOP)": "1.1e+19", "Organization": "ASAPP, Cornell University, Google, Princeton University", "Publication date": "2018-09-17"}, "size": 8}, {"x": 2018.7296803652969, "y": 1020000000000000.0, "tooltipData": {"Model": "LSTM+NeuralCache", "Domain": "Language", "Training compute (FLOP)": "1.0e+15", "Organization": "KU Leuven, ESAT - PSI, Apple", "Publication date": "2018-09-24"}, "size": 8}, {"x": 2018.7406392694065, "y": 1.8e+21, "tooltipData": {"Model": "BigGAN-deep 512x512", "Domain": "Image generation", "Training compute (FLOP)": "1.8e+21", "Organization": "Heriot-Watt University, DeepMind", "Publication date": "2018-09-28"}, "size": 8}, {"x": 2018.7406392694065, "y": 4.47e+19, "tooltipData": {"Model": "Transformer (Adaptive Input Embeddings) WT103", "Domain": "Language", "Training compute (FLOP)": "4.5e+19", "Organization": "Facebook AI Research", "Publication date": "2018-09-28"}, "size": 8}, {"x": 2018.777397260274, "y": 2.85e+20, "tooltipData": {"Model": "BERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.8e+20", "Organization": "Google", "Publication date": "2018-10-11"}, "size": 8}, {"x": 2018.7883561643835, "y": 2.78e+18, "tooltipData": {"Model": "TrellisNet", "Domain": "Language", "Training compute (FLOP)": "2.8e+18", "Organization": "Carnegie Mellon University (CMU), Bosch Center for Artificial Intelligence, Intel Labs", "Publication date": "2018-10-15"}, "size": 8}, {"x": 2018.844292237443, "y": 6.84288e+19, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 2.9B (translation)", "Domain": "Language", "Training compute (FLOP)": "6.8e+19", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2018.844292237443, "y": 1.617408e+20, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 4.9B (language)", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2018.8634703196346, "y": 5.188e+16, "tooltipData": {"Model": "Fine-tuned-AWD-LSTM-DOC (fin)", "Domain": "Language", "Training compute (FLOP)": "5.2e+16", "Organization": "Samsung R&D Institute Russia", "Publication date": "2018-11-12"}, "size": 8}, {"x": 2018.8716894977167, "y": 2010000000000000.0, "tooltipData": {"Model": "Multi-cell LSTM", "Domain": "Language", "Training compute (FLOP)": "2.0e+15", "Organization": "University of Hyderabad", "Publication date": "2018-11-15"}, "size": 8}, {"x": 2019.0082191780823, "y": 2.47e+18, "tooltipData": {"Model": "Decoupled weight decay regularization", "Domain": "Vision", "Training compute (FLOP)": "2.5e+18", "Organization": "University of Freiburg", "Publication date": "2019-01-04"}, "size": 8}, {"x": 2019.021917808219, "y": 1.09e+19, "tooltipData": {"Model": "Transformer-XL (257M)", "Domain": "Language", "Training compute (FLOP)": "1.1e+19", "Organization": "Carnegie Mellon University (CMU), Google Brain", "Publication date": "2019-01-09"}, "size": 8}, {"x": 2019.0833333333333, "y": 4.3e+18, "tooltipData": {"Model": "Hanabi 4 player", "Domain": "Games", "Training compute (FLOP)": "4.3e+18", "Organization": "DeepMind, University of Oxford, Carnegie Mellon University (CMU), Google Brain", "Publication date": "2019-02-01"}, "size": 8}, {"x": 2019.1189497716894, "y": 1.920000000001e+21, "tooltipData": {"Model": "GPT-2 (1.5B)", "Domain": "Language", "Training compute (FLOP)": "1.9e+21", "Organization": "OpenAI", "Publication date": "2019-02-14"}, "size": 8}, {"x": 2019.143607305936, "y": 3.70656e+19, "tooltipData": {"Model": "ProxylessNAS", "Domain": "Vision", "Training compute (FLOP)": "3.7e+19", "Organization": "Massachusetts Institute of Technology (MIT)", "Publication date": "2019-02-23"}, "size": 8}, {"x": 2019.1545662100457, "y": 2.32e+19, "tooltipData": {"Model": "KataGo", "Domain": "Games", "Training compute (FLOP)": "2.3e+19", "Organization": "Jane Street", "Publication date": "2019-02-27"}, "size": 8}, {"x": 2019.2351598173516, "y": 8.926848e+19, "tooltipData": {"Model": "SciBERT", "Domain": "Language", "Training compute (FLOP)": "8.9e+19", "Organization": "Allen Institute for AI", "Publication date": "2019-03-26"}, "size": 8}, {"x": 2019.25, "y": 7.3e+18, "tooltipData": {"Model": "FAIRSEQ Adaptive Inputs", "Domain": "Language", "Training compute (FLOP)": "7.3e+18", "Organization": "Facebook AI Research, Google Brain", "Publication date": "2019-04-01"}, "size": 8}, {"x": 2019.2582191780823, "y": 2.56e+18, "tooltipData": {"Model": "Cross-lingual alignment", "Domain": "Language", "Training compute (FLOP)": "2.6e+18", "Organization": "Tel Aviv University, Massachusetts Institute of Technology (MIT)", "Publication date": "2019-04-04"}, "size": 8}, {"x": 2019.2691780821917, "y": 7.30000001e+17, "tooltipData": {"Model": "WeNet (Penn Treebank)", "Domain": "Language", "Training compute (FLOP)": "7.3e+17", "Organization": "Amazon", "Publication date": "2019-04-08"}, "size": 8}, {"x": 2019.3020547945205, "y": 5.21e+20, "tooltipData": {"Model": "BERT-Large-CAS (PTB+WT2+WT103)", "Domain": "Language", "Training compute (FLOP)": "5.2e+20", "Organization": "Amazon", "Publication date": "2019-04-20"}, "size": 8}, {"x": 2019.3689497716894, "y": 4.24e+17, "tooltipData": {"Model": "AWD-LSTM-DRILL + dynamic evaluation\u2020 (WT2)", "Domain": "Language", "Training compute (FLOP)": "4.2e+17", "Organization": "IDIAP", "Publication date": "2019-05-14"}, "size": 8}, {"x": 2019.4100456621004, "y": 1.5e+21, "tooltipData": {"Model": "MnasNet-A1 + SSDLite", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.4100456621004, "y": 1.5e+21, "tooltipData": {"Model": "MnasNet-A3", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.4155251141551, "y": 4e+18, "tooltipData": {"Model": "DLRM-2020", "Domain": "Recommendation", "Training compute (FLOP)": "4.0e+18", "Organization": "Facebook AI", "Publication date": "2019-05-31"}, "size": 8}, {"x": 2019.4166666666667, "y": 6.19e+21, "tooltipData": {"Model": "XLNet", "Domain": "Language", "Training compute (FLOP)": "6.2e+21", "Organization": "Carnegie Mellon University (CMU), Google Brain", "Publication date": "2019-06-01"}, "size": 8}, {"x": 2019.424885844749, "y": 7.3e+18, "tooltipData": {"Model": "Transformer-XL Large + Phrase Induction", "Domain": "Language", "Training compute (FLOP)": "7.3e+18", "Organization": "Massachusetts Institute of Technology (MIT), University of Illinois Urbana-Champaign (UIUC)", "Publication date": "2019-06-04"}, "size": 8}, {"x": 2019.4413242009134, "y": 3.28e+17, "tooltipData": {"Model": "AWD-LSTM + MoS + Partial Shuffled", "Domain": "Language", "Training compute (FLOP)": "3.3e+17", "Organization": "University of Texas at Austin", "Publication date": "2019-06-10"}, "size": 8}, {"x": 2019.4796803652969, "y": 4.76e+18, "tooltipData": {"Model": "Tensorized Transformer (257M)", "Domain": "Language", "Training compute (FLOP)": "4.8e+18", "Organization": "Tianjin University, Microsoft Research Asia, Beijing Institute of Technology", "Publication date": "2019-06-24"}, "size": 8}, {"x": 2019.5, "y": 8.5067e+21, "tooltipData": {"Model": "RoBERTa Large", "Domain": "Language", "Training compute (FLOP)": "8.5e+21", "Organization": "Facebook, University of Washington", "Publication date": "2019-07-01"}, "size": 8}, {"x": 2019.527397260274, "y": 6.6e+16, "tooltipData": {"Model": "Pluribus", "Domain": "Games", "Training compute (FLOP)": "6.6e+16", "Organization": "Facebook AI Research", "Publication date": "2019-07-11"}, "size": 8}, {"x": 2019.674885844749, "y": 6.37e+17, "tooltipData": {"Model": "UDSMProt", "Domain": "Biology", "Training compute (FLOP)": "6.4e+17", "Organization": "Fraunhofer Heinrich Hertz Institute", "Publication date": "2019-09-04"}, "size": 8}, {"x": 2019.6803652968038, "y": 1.94e+19, "tooltipData": {"Model": "ResNet-152 + ObjectNet", "Domain": "Vision", "Training compute (FLOP)": "1.9e+19", "Organization": "Massachusetts Institute of Technology (MIT)", "Publication date": "2019-09-06"}, "size": 8}, {"x": 2019.7105022831051, "y": 2.2e+22, "tooltipData": {"Model": "Megatron-BERT", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.7105022831051, "y": 9.1e+21, "tooltipData": {"Model": "Megatron-LM (8.3B)", "Domain": "Language", "Training compute (FLOP)": "9.1e+21", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.7527397260274, "y": 7.6e+18, "tooltipData": {"Model": "AlphaX-1", "Domain": "Vision", "Training compute (FLOP)": "7.6e+18", "Organization": "Facebook AI Research, Brown University", "Publication date": "2019-10-02"}, "size": 8}, {"x": 2019.7527397260274, "y": 1.24416e+19, "tooltipData": {"Model": "DistilBERT", "Domain": "Language", "Training compute (FLOP)": "1.2e+19", "Organization": "Hugging Face", "Publication date": "2019-10-02"}, "size": 8}, {"x": 2019.8102739726028, "y": 8.658654068736e+20, "tooltipData": {"Model": "T5-3B", "Domain": "Language", "Training compute (FLOP)": "8.7e+20", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.8102739726028, "y": 3.3e+22, "tooltipData": {"Model": "T5-11B", "Domain": "Language", "Training compute (FLOP)": "3.3e+22", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.8294520547945, "y": 5.9250000000001e+22, "tooltipData": {"Model": "AlphaStar", "Domain": "Games", "Training compute (FLOP)": "5.9e+22", "Organization": "DeepMind", "Publication date": "2019-10-30"}, "size": 8}, {"x": 2019.8333333333333, "y": 7.3e+18, "tooltipData": {"Model": "Base LM + kNN LM + Continuous Cache", "Domain": "Language", "Training compute (FLOP)": "7.3e+18", "Organization": "Stanford University, Facebook AI Research", "Publication date": "2019-11-01"}, "size": 8}, {"x": 2019.85799086758, "y": 8.3e+20, "tooltipData": {"Model": "CamemBERT", "Domain": "Language", "Training compute (FLOP)": "8.3e+20", "Organization": "Facebook, INRIA, Sorbonne University", "Publication date": "2019-11-10"}, "size": 8}, {"x": 2019.85799086758, "y": 1.58e+20, "tooltipData": {"Model": "Sandwich Transformer", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Organization": "Allen Institute for AI, Facebook AI Research", "Publication date": "2019-11-10"}, "size": 8}, {"x": 2019.8607305936073, "y": 2.612e+22, "tooltipData": {"Model": "Noisy Student (L2)", "Domain": "Vision", "Training compute (FLOP)": "2.6e+22", "Organization": "Carnegie Mellon University (CMU), Google", "Publication date": "2019-11-11"}, "size": 8}, {"x": 2019.8826484018264, "y": 4.8e+19, "tooltipData": {"Model": "MuZero", "Domain": "Games", "Training compute (FLOP)": "4.8e+19", "Organization": "DeepMind", "Publication date": "2019-11-19"}, "size": 8}, {"x": 2019.9045662100457, "y": 6.2e+18, "tooltipData": {"Model": "Transformer-XL DeFINE (141M)", "Domain": "Language", "Training compute (FLOP)": "6.2e+18", "Organization": "University of Washington, Allen Institute for AI", "Publication date": "2019-11-27"}, "size": 8}, {"x": 2019.9276255707764, "y": 2.32e+18, "tooltipData": {"Model": "MMLSTM", "Domain": "Language", "Training compute (FLOP)": "2.3e+18", "Organization": "Beijing University of Posts and Telecommunications, University of West London", "Publication date": "2019-12-05"}, "size": 8}, {"x": 2019.9495433789955, "y": 6.7e+22, "tooltipData": {"Model": "OpenAI Five", "Domain": "Games", "Training compute (FLOP)": "6.7e+22", "Organization": "OpenAI", "Publication date": "2019-12-13"}, "size": 8}, {"x": 2019.9495433789955, "y": 1.3e+22, "tooltipData": {"Model": "OpenAI Five Rerun", "Domain": "Games", "Training compute (FLOP)": "1.3e+22", "Organization": "OpenAI", "Publication date": "2019-12-13"}, "size": 8}, {"x": 2019.9659817351599, "y": 7.8e+20, "tooltipData": {"Model": "DD-PPO", "Domain": "Robotics", "Training compute (FLOP)": "7.8e+20", "Organization": "Georgia Institute of Technology, Facebook AI Research, Oregon State University, Simon Fraser University", "Publication date": "2019-12-19"}, "size": 8}, {"x": 2020.0383561643835, "y": 1e+20, "tooltipData": {"Model": "AlphaFold", "Domain": "Biology", "Training compute (FLOP)": "1.0e+20", "Organization": "DeepMind", "Publication date": "2020-01-15"}, "size": 8}, {"x": 2020.0493150684931, "y": 8.16e+21, "tooltipData": {"Model": "ContextNet + Noisy Student", "Domain": "Speech", "Training compute (FLOP)": "8.2e+21", "Organization": "Google", "Publication date": "2020-01-19"}, "size": 8}, {"x": 2020.0739726027398, "y": 1.12e+23, "tooltipData": {"Model": "Meena", "Domain": "Language", "Training compute (FLOP)": "1.1e+23", "Organization": "Google Brain", "Publication date": "2020-01-28"}, "size": 8}, {"x": 2020.102511415525, "y": 2.78e+19, "tooltipData": {"Model": "TaLK Convolution", "Domain": "Language", "Training compute (FLOP)": "2.8e+19", "Organization": "Carleton University", "Publication date": "2020-02-08"}, "size": 8}, {"x": 2020.1052511415523, "y": 2.39e+21, "tooltipData": {"Model": "ALBERT-xxlarge", "Domain": "Language", "Training compute (FLOP)": "2.4e+21", "Organization": "Toyota Technological Institute at Chicago, Google", "Publication date": "2020-02-09"}, "size": 8}, {"x": 2020.116210045662, "y": 1.57e+22, "tooltipData": {"Model": "Turing-NLG", "Domain": "Language", "Training compute (FLOP)": "1.6e+22", "Organization": "Microsoft", "Publication date": "2020-02-13"}, "size": 8}, {"x": 2020.1381278538813, "y": 4.41e+19, "tooltipData": {"Model": "Feedback Transformer", "Domain": "Language", "Training compute (FLOP)": "4.4e+19", "Organization": "LORIA, University of Lorraine, Facebook AI Research", "Publication date": "2020-02-21"}, "size": 8}, {"x": 2020.1940639269408, "y": 4.6e+17, "tooltipData": {"Model": "TransformerXL + spectrum control", "Domain": "Language", "Training compute (FLOP)": "4.6e+17", "Organization": "University of California Los Angeles (UCLA), JD.com", "Publication date": "2020-03-11"}, "size": 8}, {"x": 2020.2105022831051, "y": 1.58e+18, "tooltipData": {"Model": "Tensor-Transformer(1core)+PN (WT103)", "Domain": "Language", "Training compute (FLOP)": "1.6e+18", "Organization": "UC Berkeley", "Publication date": "2020-03-17"}, "size": 8}, {"x": 2020.2269406392695, "y": 3.1e+21, "tooltipData": {"Model": "ELECTRA", "Domain": "Language", "Training compute (FLOP)": "3.1e+21", "Organization": "Stanford University, Google, Google Brain", "Publication date": "2020-03-23"}, "size": 8}, {"x": 2020.3267123287671, "y": 1.78428096e+21, "tooltipData": {"Model": "Once for All", "Domain": "Vision", "Training compute (FLOP)": "1.8e+21", "Organization": "MIT-IBM Watson AI Lab, Massachusetts Institute of Technology (MIT), IBM", "Publication date": "2020-04-29"}, "size": 8}, {"x": 2020.3360730593606, "y": 3.825792e+19, "tooltipData": {"Model": "ATLAS", "Domain": "Language", "Training compute (FLOP)": "3.8e+19", "Organization": "Allen Institute for AI, University of Washington", "Publication date": "2020-05-02"}, "size": 8}, {"x": 2020.3360730593606, "y": 1.65e+19, "tooltipData": {"Model": "UnifiedQA", "Domain": "Language", "Training compute (FLOP)": "1.6e+19", "Organization": "Allen Institute for AI, University of Washington", "Publication date": "2020-05-02"}, "size": 8}, {"x": 2020.3470319634703, "y": 2.89e+18, "tooltipData": {"Model": "NAS+ESS (156M)", "Domain": "Language", "Training compute (FLOP)": "2.9e+18", "Organization": "Northeastern University (China), Chinese Academy of Sciences, NiuTrans Research, Kingsoft", "Publication date": "2020-05-06"}, "size": 8}, {"x": 2020.401826484018, "y": 4e+20, "tooltipData": {"Model": "DETR", "Domain": "Vision", "Training compute (FLOP)": "4.0e+20", "Organization": "Facebook", "Publication date": "2020-05-26"}, "size": 8}, {"x": 2020.407305936073, "y": 3.14e+23, "tooltipData": {"Model": "GPT-3 175B (davinci)", "Domain": "Language", "Training compute (FLOP)": "3.1e+23", "Organization": "OpenAI", "Publication date": "2020-05-28"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2020.4605022831051, "y": 8.91e+21, "tooltipData": {"Model": "iGPT-L", "Domain": "Image generation, Vision", "Training compute (FLOP)": "8.9e+21", "Organization": "OpenAI", "Publication date": "2020-06-17"}, "size": 8}, {"x": 2020.4605022831051, "y": 3.3e+22, "tooltipData": {"Model": "iGPT-XL", "Domain": "Vision, Image generation", "Training compute (FLOP)": "3.3e+22", "Organization": "OpenAI", "Publication date": "2020-06-17"}, "size": 8}, {"x": 2020.4961187214612, "y": 4.765e+22, "tooltipData": {"Model": "GShard (dense)", "Domain": "Language", "Training compute (FLOP)": "4.8e+22", "Organization": "Google", "Publication date": "2020-06-30"}, "size": 8}, {"x": 2020.588812785388, "y": 2.4e+19, "tooltipData": {"Model": "DeLighT", "Domain": "Language", "Training compute (FLOP)": "2.4e+19", "Organization": "University of Washington, Allen Institute for AI, Facebook AI Research", "Publication date": "2020-08-03"}, "size": 8}, {"x": 2020.5970319634703, "y": 2e+20, "tooltipData": {"Model": "ERNIE-GEN (large)", "Domain": "Language", "Training compute (FLOP)": "2.0e+20", "Organization": "Baidu", "Publication date": "2020-08-06"}, "size": 8}, {"x": 2020.6666666666667, "y": 9.72e+18, "tooltipData": {"Model": "ProBERTa", "Domain": "Biology", "Training compute (FLOP)": "9.7e+18", "Organization": "University of Illinois Urbana-Champaign (UIUC), Reed College", "Publication date": "2020-09-01"}, "size": 8}, {"x": 2020.7527397260274, "y": 1.8144e+22, "tooltipData": {"Model": "LUKE", "Domain": "Language", "Training compute (FLOP)": "1.8e+22", "Organization": "University of Washington, National Institute of Informatics", "Publication date": "2020-10-02"}, "size": 8}, {"x": 2020.8020547945205, "y": 8.2e+22, "tooltipData": {"Model": "mT5-XXL", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Organization": "Google, Google Research", "Publication date": "2020-10-20"}, "size": 8}, {"x": 2020.8020547945205, "y": 7.6e+21, "tooltipData": {"Model": "Conformer + Wav2vec 2.0 + Noisy Student", "Domain": "Speech", "Training compute (FLOP)": "7.6e+21", "Organization": "Google, Google Research, Google Brain", "Publication date": "2020-10-20"}, "size": 8}, {"x": 2020.804794520548, "y": 1.42829568e+21, "tooltipData": {"Model": "German ELECTRA Large", "Domain": "Language", "Training compute (FLOP)": "1.4e+21", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.804794520548, "y": 2.2444646e+21, "tooltipData": {"Model": "GBERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.2e+21", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.8075342465754, "y": 4.262e+21, "tooltipData": {"Model": "ViT-Huge/14", "Domain": "Vision", "Training compute (FLOP)": "4.3e+21", "Organization": "Google Brain, Google Research", "Publication date": "2020-10-22"}, "size": 8}, {"x": 2020.8075342465754, "y": 1.9e+21, "tooltipData": {"Model": "wave2vec 2.0 LARGE", "Domain": "Speech", "Training compute (FLOP)": "1.9e+21", "Organization": "Facebook", "Publication date": "2020-10-22"}, "size": 8}, {"x": 2020.893607305936, "y": 1.24e+20, "tooltipData": {"Model": "KEPLER", "Domain": "Language", "Training compute (FLOP)": "1.2e+20", "Organization": "Tsinghua University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), HEC, CIFAR AI Research, Princeton University, University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2020-11-23"}, "size": 8}, {"x": 2020.9127853881278, "y": 2.99e+21, "tooltipData": {"Model": "AlphaFold 2", "Domain": "Biology", "Training compute (FLOP)": "3.0e+21", "Organization": "DeepMind", "Publication date": "2020-11-30"}, "size": 8}, {"x": 2020.9166666666667, "y": 1.8e+21, "tooltipData": {"Model": "CPM-Large", "Domain": "Language", "Training compute (FLOP)": "1.8e+21", "Organization": "Tsinghua University, Beijing Academy of Artificial Intelligence / BAAI", "Publication date": "2020-12-01"}, "size": 8}, {"x": 2020.9550228310502, "y": 5.1e+21, "tooltipData": {"Model": "ESM1b", "Domain": "Biology", "Training compute (FLOP)": "5.1e+21", "Organization": "Facebook AI Research, New York University (NYU)", "Publication date": "2020-12-15"}, "size": 8}, {"x": 2020.9769406392695, "y": 2.09952e+18, "tooltipData": {"Model": "DensePhrases", "Domain": "Language", "Training compute (FLOP)": "2.1e+18", "Organization": "Korea University, Princeton University", "Publication date": "2020-12-23"}, "size": 8}, {"x": 2020.9824200913242, "y": 5.62e+17, "tooltipData": {"Model": "CT-MoS (WT2)", "Domain": "Language", "Training compute (FLOP)": "5.6e+17", "Organization": "Google, National Tsing Hua University", "Publication date": "2020-12-25"}, "size": 8}, {"x": 2020.9988584474886, "y": 2.91e+19, "tooltipData": {"Model": "ERNIE-Doc (247M)", "Domain": "Language", "Training compute (FLOP)": "2.9e+19", "Organization": "Baidu", "Publication date": "2020-12-31"}, "size": 8}, {"x": 2021.0109589041097, "y": 4.7e+22, "tooltipData": {"Model": "DALL-E", "Domain": "Image generation", "Training compute (FLOP)": "4.7e+22", "Organization": "OpenAI", "Publication date": "2021-01-05"}, "size": 8}, {"x": 2021.0109589041097, "y": 1.05e+22, "tooltipData": {"Model": "CLIP (ViT L/14@336px)", "Domain": "Multimodal, Vision, Language, Video", "Training compute (FLOP)": "1.0e+22", "Organization": "OpenAI", "Publication date": "2021-01-05"}, "size": 8}, {"x": 2021.027397260274, "y": 8.22e+22, "tooltipData": {"Model": "Switch", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Organization": "Google", "Publication date": "2021-01-11"}, "size": 8}, {"x": 2021.0383561643835, "y": 7.884e+19, "tooltipData": {"Model": "DeiT-B", "Domain": "Vision", "Training compute (FLOP)": "7.9e+19", "Organization": "Meta AI, Sorbonne University", "Publication date": "2021-01-15"}, "size": 8}, {"x": 2021.116210045662, "y": 5.49e+21, "tooltipData": {"Model": "MSA Transformer", "Domain": "Biology", "Training compute (FLOP)": "5.5e+21", "Organization": "Facebook AI Research, UC Berkeley, New York University (NYU)", "Publication date": "2021-02-13"}, "size": 8}, {"x": 2021.1463470319634, "y": 1.1e+19, "tooltipData": {"Model": "SRU++ Large", "Domain": "Language", "Training compute (FLOP)": "1.1e+19", "Organization": "ASAPP", "Publication date": "2021-02-24"}, "size": 8}, {"x": 2021.1666666666667, "y": 4.79e+22, "tooltipData": {"Model": "Meta Pseudo Labels", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Organization": "Google Brain, Google AI", "Publication date": "2021-03-01"}, "size": 8}, {"x": 2021.1776255707764, "y": 1.449e+22, "tooltipData": {"Model": "Generative BST", "Domain": "Language", "Training compute (FLOP)": "1.4e+22", "Organization": "Facebook AI Research", "Publication date": "2021-03-05"}, "size": 8}, {"x": 2021.1776255707764, "y": 5.5e+21, "tooltipData": {"Model": "M6-T", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.5e+21", "Organization": "Alibaba", "Publication date": "2021-03-05"}, "size": 8}, {"x": 2021.2993150684931, "y": 3.5997696e+22, "tooltipData": {"Model": "PLUG", "Domain": "Language", "Training compute (FLOP)": "3.6e+22", "Organization": "Alibaba", "Publication date": "2021-04-19"}, "size": 8}, {"x": 2021.3267123287671, "y": 2.1e+20, "tooltipData": {"Model": "ViT + DINO", "Domain": "Vision", "Training compute (FLOP)": "2.1e+20", "Organization": "INRIA, Facebook AI Research", "Publication date": "2021-04-29"}, "size": 8}, {"x": 2021.3415525114156, "y": 3.7e+22, "tooltipData": {"Model": "ProtT5-XXL-BFD", "Domain": "Biology", "Training compute (FLOP)": "3.7e+22", "Organization": "Technical University of Munich, Med AI Technology, NVIDIA, Oak Ridge National Laboratory, Google, Seoul National University", "Publication date": "2021-05-04"}, "size": 8}, {"x": 2021.3415525114156, "y": 3.9e+22, "tooltipData": {"Model": "ProtBERT-BFD", "Domain": "Biology", "Training compute (FLOP)": "3.9e+22", "Organization": "Technical University of Munich, NVIDIA, Seoul National University, Google, Oak Ridge National Laboratory, Med AI Technology", "Publication date": "2021-05-04"}, "size": 8}, {"x": 2021.3415525114156, "y": 7.37e+22, "tooltipData": {"Model": "ProtT5-XXL", "Domain": "Biology", "Training compute (FLOP)": "7.4e+22", "Organization": "Technical University of Munich, Med AI Technology, NVIDIA, Oak Ridge National Laboratory, Google, Seoul National University", "Publication date": "2021-05-04"}, "size": 8}, {"x": 2021.3607305936073, "y": 6.2e+21, "tooltipData": {"Model": "ADM", "Domain": "Image generation", "Training compute (FLOP)": "6.2e+21", "Organization": "OpenAI", "Publication date": "2021-05-11"}, "size": 8}, {"x": 2021.3853881278537, "y": 9.47e+18, "tooltipData": {"Model": "MedBERT", "Domain": "Medicine", "Training compute (FLOP)": "9.5e+18", "Organization": "Peng Cheng Laboratory, University of Texas at Houston", "Publication date": "2021-05-20"}, "size": 8}, {"x": 2021.401826484018, "y": 2.68e+22, "tooltipData": {"Model": "CogView", "Domain": "Image generation", "Training compute (FLOP)": "2.7e+22", "Organization": "Tsinghua University, Alibaba DAMO Academy", "Publication date": "2021-05-26"}, "size": 8}, {"x": 2021.401826484018, "y": 2.40576e+19, "tooltipData": {"Model": "Transformer local-attention (NesT-B)", "Domain": "Vision", "Training compute (FLOP)": "2.4e+19", "Organization": "Google Cloud, Google Research", "Publication date": "2021-05-26"}, "size": 8}, {"x": 2021.407305936073, "y": 8.1e+22, "tooltipData": {"Model": "ByT5-XXL", "Domain": "Language", "Training compute (FLOP)": "8.1e+22", "Organization": "Google, Google Research", "Publication date": "2021-05-28"}, "size": 8}, {"x": 2021.4358447488585, "y": 5.85e+22, "tooltipData": {"Model": "ViT-G/14", "Domain": "Vision", "Training compute (FLOP)": "5.8e+22", "Organization": "Google Brain, Google Research", "Publication date": "2021-06-08"}, "size": 8}, {"x": 2021.4385844748858, "y": 4.27e+22, "tooltipData": {"Model": "CoAtNet", "Domain": "Vision", "Training compute (FLOP)": "4.3e+22", "Organization": "Google, Google Research, Google Brain", "Publication date": "2021-06-09"}, "size": 8}, {"x": 2021.4385844748858, "y": 1.91e+21, "tooltipData": {"Model": "EMDR", "Domain": "Language", "Training compute (FLOP)": "1.9e+21", "Organization": "Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), McGill University, DeepMind", "Publication date": "2021-06-09"}, "size": 8}, {"x": 2021.4413242009134, "y": 2.588e+22, "tooltipData": {"Model": "DeBERTa", "Domain": "Language", "Training compute (FLOP)": "2.6e+22", "Organization": "Microsoft", "Publication date": "2021-06-10"}, "size": 8}, {"x": 2021.4440639269408, "y": 3.8e+20, "tooltipData": {"Model": "Denoising Diffusion Probabilistic Models (LSUN Bedroom)", "Domain": "Vision", "Training compute (FLOP)": "3.8e+20", "Organization": "UC Berkeley", "Publication date": "2021-06-11"}, "size": 8}, {"x": 2021.4440639269408, "y": 2.598670000001e+22, "tooltipData": {"Model": "ALIGN", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "2.6e+22", "Organization": "Google Research", "Publication date": "2021-06-11"}, "size": 8}, {"x": 2021.4769406392695, "y": 9.56e+19, "tooltipData": {"Model": "EfficientNetV2-XL", "Domain": "Vision", "Training compute (FLOP)": "9.6e+19", "Organization": "Google, Google Brain", "Publication date": "2021-06-23"}, "size": 8}, {"x": 2021.4906392694065, "y": 8.2e+19, "tooltipData": {"Model": "Adaptive Input Transformer + RD", "Domain": "Language", "Training compute (FLOP)": "8.2e+19", "Organization": "Microsoft Research Asia, Soochow University", "Publication date": "2021-06-28"}, "size": 8}, {"x": 2021.5109589041097, "y": 2.25e+22, "tooltipData": {"Model": "ERNIE 3.0", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Organization": "Baidu", "Publication date": "2021-07-05"}, "size": 8}, {"x": 2021.5164383561644, "y": 7.344e+22, "tooltipData": {"Model": "Codex", "Domain": "Language", "Training compute (FLOP)": "7.3e+22", "Organization": "OpenAI", "Publication date": "2021-07-07"}, "size": 8}, {"x": 2021.5712328767124, "y": 2.412e+22, "tooltipData": {"Model": "GOAT", "Domain": "Games", "Training compute (FLOP)": "2.4e+22", "Organization": "DeepMind", "Publication date": "2021-07-27"}, "size": 8}, {"x": 2021.5712328767124, "y": 5.54e+21, "tooltipData": {"Model": "HuBERT", "Domain": "Speech", "Training compute (FLOP)": "5.5e+21", "Organization": "Facebook AI Research", "Publication date": "2021-07-27"}, "size": 8}, {"x": 2021.5767123287671, "y": 1.8e+22, "tooltipData": {"Model": "SEER", "Domain": "Vision", "Training compute (FLOP)": "1.8e+22", "Organization": "Facebook AI Research, INRIA", "Publication date": "2021-07-29"}, "size": 8}, {"x": 2021.5970319634703, "y": 6.34275e+20, "tooltipData": {"Model": "YOLOX-X", "Domain": "Vision", "Training compute (FLOP)": "6.3e+20", "Organization": "Megvii Inc", "Publication date": "2021-08-06"}, "size": 8}, {"x": 2021.6107305936073, "y": 3.7e+23, "tooltipData": {"Model": "Jurassic-1-Jumbo", "Domain": "Language", "Training compute (FLOP)": "3.7e+23", "Organization": "AI21 Labs", "Publication date": "2021-08-11"}, "size": 8}, {"x": 2021.6216894977167, "y": 1.07e+20, "tooltipData": {"Model": "DNABERT", "Domain": "Biology", "Training compute (FLOP)": "1.1e+20", "Organization": "Northeastern University", "Publication date": "2021-08-15"}, "size": 8}, {"x": 2021.6271689497717, "y": 3.366e+22, "tooltipData": {"Model": "XLMR-XXL", "Domain": "Language", "Training compute (FLOP)": "3.4e+22", "Organization": "Facebook AI Research", "Publication date": "2021-08-17"}, "size": 8}, {"x": 2021.6721461187215, "y": 2.047e+24, "tooltipData": {"Model": "FLAN 137B", "Domain": "Language", "Training compute (FLOP)": "2.0e+24", "Organization": "Google Research", "Publication date": "2021-09-03"}, "size": 8}, {"x": 2021.6803652968038, "y": 2.775e+18, "tooltipData": {"Model": "PermuteFormer", "Domain": "Language", "Training compute (FLOP)": "2.8e+18", "Organization": "Peking University", "Publication date": "2021-09-06"}, "size": 8}, {"x": 2021.7187214611872, "y": 9.9e+21, "tooltipData": {"Model": "PLATO-XL", "Domain": "Language", "Training compute (FLOP)": "9.9e+21", "Organization": "Baidu", "Publication date": "2021-09-20"}, "size": 8}, {"x": 2021.7582191780823, "y": 4.35e+21, "tooltipData": {"Model": "AlphaFold-Multimer", "Domain": "Biology", "Training compute (FLOP)": "4.4e+21", "Organization": "Google DeepMind, DeepMind", "Publication date": "2021-10-04"}, "size": 8}, {"x": 2021.777397260274, "y": 1.17e+24, "tooltipData": {"Model": "Megatron-Turing NLG 530B", "Domain": "Language", "Training compute (FLOP)": "1.2e+24", "Organization": "Microsoft, NVIDIA", "Publication date": "2021-10-11"}, "size": 8}, {"x": 2021.7801369863014, "y": 3.5380000000001e+23, "tooltipData": {"Model": "Yuan 1.0", "Domain": "Language", "Training compute (FLOP)": "3.5e+23", "Organization": "Inspur", "Publication date": "2021-10-12"}, "size": 8}, {"x": 2021.7883561643835, "y": 9.1819e+20, "tooltipData": {"Model": "T0-XXL", "Domain": "Language", "Training compute (FLOP)": "9.2e+20", "Organization": "Hugging Face, Brown University", "Publication date": "2021-10-15"}, "size": 8}, {"x": 2021.7938356164384, "y": 7.3e+18, "tooltipData": {"Model": "base LM+GNN+kNN", "Domain": "Language", "Training compute (FLOP)": "7.3e+18", "Organization": "Shannon.AI, Nanjing University, Nanyang Technological University, Zhejiang University", "Publication date": "2021-10-17"}, "size": 8}, {"x": 2021.8321917808219, "y": 6e+20, "tooltipData": {"Model": "S4", "Domain": "Language", "Training compute (FLOP)": "6.0e+20", "Organization": "Stanford University", "Publication date": "2021-10-31"}, "size": 8}, {"x": 2021.8333333333333, "y": 1.56e+21, "tooltipData": {"Model": "CodeT5-base", "Domain": "Language", "Training compute (FLOP)": "1.6e+21", "Organization": "Salesforce, Nanyang Technological University", "Publication date": "2021-11-01"}, "size": 8}, {"x": 2021.8333333333333, "y": 1.05e+19, "tooltipData": {"Model": "Projected GAN", "Domain": "Image generation", "Training compute (FLOP)": "1.0e+19", "Organization": "Heidelberg University", "Publication date": "2021-11-01"}, "size": 8}, {"x": 2021.8607305936073, "y": 4.6e+20, "tooltipData": {"Model": "Masked Autoencoders ViT-H", "Domain": "Vision", "Training compute (FLOP)": "4.6e+20", "Organization": "Facebook AI Research", "Publication date": "2021-11-11"}, "size": 8}, {"x": 2021.879908675799, "y": 1.1e+21, "tooltipData": {"Model": "Swin Transformer V2", "Domain": "Vision, Video", "Training compute (FLOP)": "1.1e+21", "Organization": "Microsoft Research Asia", "Publication date": "2021-11-18"}, "size": 8}, {"x": 2021.8826484018264, "y": 4.12e+22, "tooltipData": {"Model": "BASIC-L", "Domain": "Vision", "Training compute (FLOP)": "4.1e+22", "Organization": "Google", "Publication date": "2021-11-19"}, "size": 8}, {"x": 2021.8908675799087, "y": 4.831e+22, "tooltipData": {"Model": "Florence", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Organization": "Microsoft", "Publication date": "2021-11-22"}, "size": 8}, {"x": 2021.8963470319634, "y": 4.8384e+21, "tooltipData": {"Model": "N\u00dcWA", "Domain": "Multimodal, Vision, Image generation, Video, Language", "Training compute (FLOP)": "4.8e+21", "Organization": "Microsoft Research, Peking University", "Publication date": "2021-11-24"}, "size": 8}, {"x": 2021.9303652968038, "y": 3.667927300468287e+22, "tooltipData": {"Model": "Student of Games", "Domain": "Games", "Training compute (FLOP)": "3.7e+22", "Organization": "DeepMind", "Publication date": "2021-12-06"}, "size": 8}, {"x": 2021.9358447488585, "y": 6.31e+23, "tooltipData": {"Model": "Gopher (280B)", "Domain": "Language", "Training compute (FLOP)": "6.3e+23", "Organization": "DeepMind", "Publication date": "2021-12-08"}, "size": 8}, {"x": 2021.9495433789955, "y": 3.6363112434e+23, "tooltipData": {"Model": "GLaM", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Organization": "Google", "Publication date": "2021-12-13"}, "size": 8}, {"x": 2021.9577625570778, "y": 1.57e+20, "tooltipData": {"Model": "Contriever", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Organization": "Meta AI, University College London (UCL), PSL University, Universit\u00e9 Grenoble Alpes", "Publication date": "2021-12-16"}, "size": 8}, {"x": 2021.9687214611872, "y": 4.7e+22, "tooltipData": {"Model": "GLIDE", "Domain": "Image generation", "Training compute (FLOP)": "4.7e+22", "Organization": "OpenAI", "Publication date": "2021-12-20"}, "size": 8}, {"x": 2021.9687214611872, "y": 2.25e+22, "tooltipData": {"Model": "XGLM-7.5B", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Organization": "Meta AI, Facebook AI Research", "Publication date": "2021-12-20"}, "size": 8}, {"x": 2021.9769406392695, "y": 1.0421e+24, "tooltipData": {"Model": "ERNIE 3.0 Titan", "Domain": "Language", "Training compute (FLOP)": "1.0e+24", "Organization": "Baidu, Peng Cheng Laboratory", "Publication date": "2021-12-23"}, "size": 8}, {"x": 2022.0164383561644, "y": 2.34399744e+19, "tooltipData": {"Model": "Detic", "Domain": "Vision", "Training compute (FLOP)": "2.3e+19", "Organization": "Meta AI, University of Texas at Austin", "Publication date": "2022-01-07"}, "size": 8}, {"x": 2022.0860730593606, "y": 1.63944e+23, "tooltipData": {"Model": "AlphaCode", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Organization": "DeepMind", "Publication date": "2022-02-02"}, "size": 8}, {"x": 2022.0997716894976, "y": 1.68e+22, "tooltipData": {"Model": "RETRO-7B", "Domain": "Language", "Training compute (FLOP)": "1.7e+22", "Organization": "DeepMind", "Publication date": "2022-02-07"}, "size": 8}, {"x": 2022.1052511415523, "y": 9.31627008e+22, "tooltipData": {"Model": "GPT-NeoX-20B", "Domain": "Language", "Training compute (FLOP)": "9.3e+22", "Organization": "EleutherAI", "Publication date": "2022-02-09"}, "size": 8}, {"x": 2022.10799086758, "y": 3.55e+23, "tooltipData": {"Model": "LaMDA", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Organization": "Google", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.10799086758, "y": 6.5e+19, "tooltipData": {"Model": "ProteinBERT", "Domain": "Biology", "Training compute (FLOP)": "6.5e+19", "Organization": "Hebrew University of Jerusalem, Ben-Gurion University of the Negev, Deep Trading", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.1271689497717, "y": 2.9e+23, "tooltipData": {"Model": "ST-MoE", "Domain": "Language", "Training compute (FLOP)": "2.9e+23", "Organization": "Google, Google Brain, Google Research", "Publication date": "2022-02-17"}, "size": 8}, {"x": 2022.151826484018, "y": 1.1e+21, "tooltipData": {"Model": "PolyCoder", "Domain": "Language", "Training compute (FLOP)": "1.1e+21", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2022-02-26"}, "size": 8}, {"x": 2022.1913242009134, "y": 3.4e+21, "tooltipData": {"Model": "ViT-G (model soup)", "Domain": "Vision", "Training compute (FLOP)": "3.4e+21", "Organization": "University of Washington, Columbia University, Google, Meta AI, Tel Aviv University", "Publication date": "2022-03-10"}, "size": 8}, {"x": 2022.2214611872148, "y": 2.65e+19, "tooltipData": {"Model": "Segatron-XL large, M=384 + HCP", "Domain": "Language", "Training compute (FLOP)": "2.6e+19", "Organization": "Microsoft Research, University of Waterloo", "Publication date": "2022-03-21"}, "size": 8}, {"x": 2022.2433789954339, "y": 5.76e+23, "tooltipData": {"Model": "Chinchilla", "Domain": "Language", "Training compute (FLOP)": "5.8e+23", "Organization": "DeepMind", "Publication date": "2022-03-29"}, "size": 8}, {"x": 2022.2582191780823, "y": 2.5272e+24, "tooltipData": {"Model": "PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Organization": "Google Research", "Publication date": "2022-04-04"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2022.2664383561644, "y": 1.4e+20, "tooltipData": {"Model": "BERT-RBP", "Domain": "Biology", "Training compute (FLOP)": "1.4e+20", "Organization": "Waseda University", "Publication date": "2022-04-07"}, "size": 8}, {"x": 2022.2828767123287, "y": 5e+22, "tooltipData": {"Model": "Stable Diffusion (LDM-KL-8-G)", "Domain": "Image generation", "Training compute (FLOP)": "5.0e+22", "Organization": "Runway, Ludwig Maximilian University", "Publication date": "2022-04-13"}, "size": 8}, {"x": 2022.285616438356, "y": 6.0770304e+19, "tooltipData": {"Model": "Sparse all-MLP", "Domain": "Language", "Training compute (FLOP)": "6.1e+19", "Organization": "Meta AI", "Publication date": "2022-04-14"}, "size": 8}, {"x": 2022.3267123287671, "y": 2.18972000000001e+23, "tooltipData": {"Model": "Flamingo", "Domain": "Multimodal, Vision, Language, Video", "Training compute (FLOP)": "2.2e+23", "Organization": "DeepMind", "Publication date": "2022-04-29"}, "size": 8}, {"x": 2022.3360730593606, "y": 4.3e+23, "tooltipData": {"Model": "OPT-175B", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Organization": "Meta AI", "Publication date": "2022-05-02"}, "size": 8}, {"x": 2022.35799086758, "y": 1.2e+23, "tooltipData": {"Model": "UL2", "Domain": "Language", "Training compute (FLOP)": "1.2e+23", "Organization": "Google Research, Google Brain", "Publication date": "2022-05-10"}, "size": 8}, {"x": 2022.3634703196346, "y": 4.02e+21, "tooltipData": {"Model": "Gato", "Domain": "Multimodal, Robotics, Games, Language", "Training compute (FLOP)": "4.0e+21", "Organization": "DeepMind", "Publication date": "2022-05-12"}, "size": 8}, {"x": 2022.393607305936, "y": 1.46e+22, "tooltipData": {"Model": "Imagen", "Domain": "Image generation", "Training compute (FLOP)": "1.5e+22", "Organization": "Google Brain", "Publication date": "2022-05-23"}, "size": 8}, {"x": 2022.4045662100457, "y": 7.24e+21, "tooltipData": {"Model": "Tranception", "Domain": "Biology", "Training compute (FLOP)": "7.2e+21", "Organization": "University of Oxford, Harvard Medical School, Cohere", "Publication date": "2022-05-27"}, "size": 8}, {"x": 2022.4303652968038, "y": 1.1e+19, "tooltipData": {"Model": "DITTO", "Domain": "Language", "Training compute (FLOP)": "1.1e+19", "Organization": "Tsinghua University, Apple, Westlake University, Chinese University of Hong Kong (CUHK)", "Publication date": "2022-06-06"}, "size": 8}, {"x": 2022.4522831050228, "y": 7.3e+22, "tooltipData": {"Model": "CoCa", "Domain": "Vision", "Training compute (FLOP)": "7.3e+22", "Organization": "Google Research", "Publication date": "2022-06-14"}, "size": 8}, {"x": 2022.4742009132422, "y": 3.962895376192635e+23, "tooltipData": {"Model": "Parti", "Domain": "Image generation", "Training compute (FLOP)": "4.0e+23", "Organization": "Google Research", "Publication date": "2022-06-22"}, "size": 8}, {"x": 2022.4878995433792, "y": 1.35e+22, "tooltipData": {"Model": "ProGen2-xlarge", "Domain": "Biology", "Training compute (FLOP)": "1.4e+22", "Organization": "Salesforce Research, Columbia University, Johns Hopkins University", "Publication date": "2022-06-27"}, "size": 8}, {"x": 2022.4933789954339, "y": 2.7415e+24, "tooltipData": {"Model": "Minerva (540B)", "Domain": "Language", "Training compute (FLOP)": "2.7e+24", "Organization": "Google", "Publication date": "2022-06-29"}, "size": 8}, {"x": 2022.5109589041097, "y": 2.72e+21, "tooltipData": {"Model": "CodeT5-large", "Domain": "Language", "Training compute (FLOP)": "2.7e+21", "Organization": "Salesforce", "Publication date": "2022-07-05"}, "size": 8}, {"x": 2022.513698630137, "y": 1.751113728e+22, "tooltipData": {"Model": "NLLB", "Domain": "Language", "Training compute (FLOP)": "1.8e+22", "Organization": "Meta AI", "Publication date": "2022-07-06"}, "size": 8}, {"x": 2022.527397260274, "y": 3.65664e+23, "tooltipData": {"Model": "BLOOM-176B", "Domain": "Language", "Training compute (FLOP)": "3.7e+23", "Organization": "Hugging Face, BigScience", "Publication date": "2022-07-11"}, "size": 8}, {"x": 2022.554794520548, "y": 7.35000000001e+22, "tooltipData": {"Model": "ESM2-15B", "Domain": "Biology", "Training compute (FLOP)": "7.4e+22", "Organization": "Meta AI, New York University (NYU), Stanford University, Massachusetts Institute of Technology (MIT)", "Publication date": "2022-07-21"}, "size": 8}, {"x": 2022.5575342465754, "y": 1.03514112e+22, "tooltipData": {"Model": "OmegaPLM", "Domain": "Biology", "Training compute (FLOP)": "1.0e+22", "Organization": "Massachusetts Institute of Technology (MIT), Westlake University", "Publication date": "2022-07-22"}, "size": 8}, {"x": 2022.5860730593606, "y": 2.04374016e+23, "tooltipData": {"Model": "AlexaTM 20B", "Domain": "Language", "Training compute (FLOP)": "2.0e+23", "Organization": "Amazon", "Publication date": "2022-08-02"}, "size": 8}, {"x": 2022.5915525114156, "y": 3.5490054945e+23, "tooltipData": {"Model": "GLM-130B", "Domain": "Language", "Training compute (FLOP)": "3.5e+23", "Organization": "Tsinghua University", "Publication date": "2022-08-04"}, "size": 8}, {"x": 2022.60799086758, "y": 4.3e+23, "tooltipData": {"Model": "BlenderBot 3", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Organization": "McGill University, Meta AI, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms)", "Publication date": "2022-08-10"}, "size": 8}, {"x": 2022.6408675799087, "y": 7e+19, "tooltipData": {"Model": "BEIT-3", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "7.0e+19", "Organization": "Microsoft", "Publication date": "2022-08-22"}, "size": 8}, {"x": 2022.7022831050228, "y": 1.69e+23, "tooltipData": {"Model": "PaLI", "Domain": "Language, Vision, Multimodal", "Training compute (FLOP)": "1.7e+23", "Organization": "Google", "Publication date": "2022-09-14"}, "size": 8}, {"x": 2022.7214611872148, "y": 4.2072663e+21, "tooltipData": {"Model": "Whisper", "Domain": "Speech", "Training compute (FLOP)": "4.2e+21", "Organization": "OpenAI", "Publication date": "2022-09-21"}, "size": 8}, {"x": 2022.7582191780823, "y": 7.2e+19, "tooltipData": {"Model": "DiffDock", "Domain": "Biology", "Training compute (FLOP)": "7.2e+19", "Organization": "Massachusetts Institute of Technology (MIT)", "Publication date": "2022-10-04"}, "size": 8}, {"x": 2022.777397260274, "y": 1.42e+21, "tooltipData": {"Model": "GenSLM", "Domain": "Biology", "Training compute (FLOP)": "1.4e+21", "Organization": "University of Chicago, NVIDIA, Harvard University, Cerebras Systems, Technical University of Munich, California Institute of Technology", "Publication date": "2022-10-11"}, "size": 8}, {"x": 2022.8020547945205, "y": 2.5e+24, "tooltipData": {"Model": "Flan-PaLM 540B", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Organization": "Google", "Publication date": "2022-10-20"}, "size": 8}, {"x": 2022.8020547945205, "y": 3.3e+22, "tooltipData": {"Model": "Flan-T5 11B", "Domain": "Language", "Training compute (FLOP)": "3.3e+22", "Organization": "Google", "Publication date": "2022-10-20"}, "size": 8}, {"x": 2022.8020547945205, "y": 2.53e+24, "tooltipData": {"Model": "U-PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Organization": "Google", "Publication date": "2022-10-20"}, "size": 8}, {"x": 2022.8360730593606, "y": 5.46e+19, "tooltipData": {"Model": "eDiff-I", "Domain": "Image generation", "Training compute (FLOP)": "5.5e+19", "Organization": "NVIDIA", "Publication date": "2022-11-02"}, "size": 8}, {"x": 2022.838812785388, "y": 1.4e+17, "tooltipData": {"Model": "Mogrifier RLSTM (WT2)", "Domain": "Language", "Training compute (FLOP)": "1.4e+17", "Organization": "DeepMind", "Publication date": "2022-11-03"}, "size": 8}, {"x": 2022.85799086758, "y": 2.408e+21, "tooltipData": {"Model": "InternImage", "Domain": "Vision", "Training compute (FLOP)": "2.4e+21", "Organization": "Shanghai AI Lab, Tsinghua University, Nanjing University, SenseTime, Chinese University of Hong Kong (CUHK)", "Publication date": "2022-11-10"}, "size": 8}, {"x": 2022.8689497716894, "y": 1.501e+22, "tooltipData": {"Model": "EVA-01", "Domain": "Vision", "Training compute (FLOP)": "1.5e+22", "Organization": "Beijing Academy of Artificial Intelligence / BAAI, Huazhong University of Science and Technology, Zhejiang University, Beijing Institute of Technology", "Publication date": "2022-11-14"}, "size": 8}, {"x": 2022.8744292237443, "y": 3.24e+23, "tooltipData": {"Model": "Galactica", "Domain": "Language, Biology", "Training compute (FLOP)": "3.2e+23", "Organization": "Meta AI", "Publication date": "2022-11-16"}, "size": 8}, {"x": 2022.879908675799, "y": 1.3e+20, "tooltipData": {"Model": "Fusion in Encoder", "Domain": "Language", "Training compute (FLOP)": "1.3e+20", "Organization": "Samsung", "Publication date": "2022-11-18"}, "size": 8}, {"x": 2022.8853881278537, "y": 5.1e+20, "tooltipData": {"Model": "AR-LDM", "Domain": "Image generation", "Training compute (FLOP)": "5.1e+20", "Organization": "Alibaba, University of Waterloo, Vector Institute", "Publication date": "2022-11-20"}, "size": 8}, {"x": 2022.907305936073, "y": 2.1570000001e+20, "tooltipData": {"Model": "Discriminator Guidance", "Domain": "Image generation", "Training compute (FLOP)": "2.2e+20", "Organization": "Korea Advanced Institute of Science and Technology (KAIST), NAVER", "Publication date": "2022-11-28"}, "size": 8}, {"x": 2022.907305936073, "y": 2.578e+24, "tooltipData": {"Model": "GPT-3.5 (text-davinci-003)", "Domain": "Language", "Training compute (FLOP)": "2.6e+24", "Organization": "OpenAI", "Publication date": "2022-11-28"}, "size": 8}, {"x": 2022.9659817351599, "y": 2.9e+19, "tooltipData": {"Model": "CaLM", "Domain": "Biology", "Training compute (FLOP)": "2.9e+19", "Organization": "University of Oxford", "Publication date": "2022-12-19"}, "size": 8}, {"x": 2022.9906392694065, "y": 8.49e+20, "tooltipData": {"Model": "Hybrid H3-2.7B", "Domain": "Language", "Training compute (FLOP)": "8.5e+20", "Organization": "Stanford University, University at Buffalo", "Publication date": "2022-12-28"}, "size": 8}, {"x": 2023.0109589041097, "y": 1.01e+19, "tooltipData": {"Model": "VALL-E", "Domain": "Audio, Speech", "Training compute (FLOP)": "1.0e+19", "Organization": "Microsoft", "Publication date": "2023-01-05"}, "size": 8}, {"x": 2023.0383561643835, "y": 8.08e+21, "tooltipData": {"Model": "Nucleotide Transformer", "Domain": "Biology", "Training compute (FLOP)": "8.1e+21", "Organization": "NVIDIA, Technical University of Munich", "Publication date": "2023-01-15"}, "size": 8}, {"x": 2023.041095890411, "y": 6.5e+21, "tooltipData": {"Model": "Ankh_large", "Domain": "Biology", "Training compute (FLOP)": "6.5e+21", "Organization": "Technical University of Munich, Columbia University", "Publication date": "2023-01-16"}, "size": 8}, {"x": 2023.0712328767124, "y": 3.5e+20, "tooltipData": {"Model": "DDPM-IP (CelebA)", "Domain": "Image generation", "Training compute (FLOP)": "3.5e+20", "Organization": "Utrecht University", "Publication date": "2023-01-27"}, "size": 8}, {"x": 2023.0794520547945, "y": 1.20000000001e+21, "tooltipData": {"Model": "BLIP-2 (Q-Former)", "Domain": "Vision, Language", "Training compute (FLOP)": "1.2e+21", "Organization": "Salesforce Research", "Publication date": "2023-01-30"}, "size": 8}, {"x": 2023.10799086758, "y": 1.93248e+23, "tooltipData": {"Model": "ViT-22B", "Domain": "Vision", "Training compute (FLOP)": "1.9e+23", "Organization": "Google", "Publication date": "2023-02-10"}, "size": 8}, {"x": 2023.1463470319634, "y": 5.5e+23, "tooltipData": {"Model": "LLaMA-65B", "Domain": "Language", "Training compute (FLOP)": "5.5e+23", "Organization": "Meta AI", "Publication date": "2023-02-24"}, "size": 8}, {"x": 2023.169406392694, "y": 6e+20, "tooltipData": {"Model": "DiT-XL/2", "Domain": "Image generation", "Training compute (FLOP)": "6.0e+20", "Organization": "New York University (NYU), UC Berkeley", "Publication date": "2023-03-02"}, "size": 8}, {"x": 2023.1776255707764, "y": 9.5e+21, "tooltipData": {"Model": "AudioGen", "Domain": "Audio", "Training compute (FLOP)": "9.5e+21", "Organization": "Meta AI, Hebrew University of Jerusalem", "Publication date": "2023-03-05"}, "size": 8}, {"x": 2023.2050228310502, "y": 2.4e+23, "tooltipData": {"Model": "Falcon-40B", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Organization": "Technology Innovation Institute", "Publication date": "2023-03-15"}, "size": 8}, {"x": 2023.2050228310502, "y": 2.1e+25, "tooltipData": {"Model": "GPT-4", "Domain": "Multimodal, Language, Vision, Image generation", "Training compute (FLOP)": "2.1e+25", "Organization": "OpenAI", "Publication date": "2023-03-15"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.2187214611872, "y": 4.67e+23, "tooltipData": {"Model": "PanGu-\u03a3", "Domain": "Language", "Training compute (FLOP)": "4.7e+23", "Organization": "Huawei Noah's Ark Lab", "Publication date": "2023-03-20"}, "size": 8}, {"x": 2023.2433789954339, "y": 9.7e+21, "tooltipData": {"Model": "VideoMAE V2", "Domain": "Video", "Training compute (FLOP)": "9.7e+21", "Organization": "Nanjing University, Shenzhen Institute of Advanced Technology, Shanghai AI Lab", "Publication date": "2023-03-29"}, "size": 8}, {"x": 2023.2461187214612, "y": 2.36e+23, "tooltipData": {"Model": "BloombergGPT", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Organization": "Bloomberg, Johns Hopkins University", "Publication date": "2023-03-30"}, "size": 8}, {"x": 2023.2609589041097, "y": 7.8e+21, "tooltipData": {"Model": "Segment Anything Model", "Domain": "Vision", "Training compute (FLOP)": "7.8e+21", "Organization": "Meta AI", "Publication date": "2023-04-05"}, "size": 8}, {"x": 2023.271917808219, "y": 3.00001e+21, "tooltipData": {"Model": "Incoder-6.7B", "Domain": "Language", "Training compute (FLOP)": "3.0e+21", "Organization": "Facebook AI Research, University of Washington, UC Berkeley, Carnegie Mellon University (CMU), Toyota Technological Institute at Chicago", "Publication date": "2023-04-09"}, "size": 8}, {"x": 2023.285616438356, "y": 7.41851136e+21, "tooltipData": {"Model": "DINOv2", "Domain": "Vision", "Training compute (FLOP)": "7.4e+21", "Organization": "Facebook AI Research, INRIA", "Publication date": "2023-04-14"}, "size": 8}, {"x": 2023.2938356164384, "y": 7.8049e+22, "tooltipData": {"Model": "LLaVA", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "7.8e+22", "Organization": "University of Wisconsin Madison, Microsoft Research, Columbia University", "Publication date": "2023-04-17"}, "size": 8}, {"x": 2023.3552511415523, "y": 8.46e+22, "tooltipData": {"Model": "StarCoder", "Domain": "Language", "Training compute (FLOP)": "8.5e+22", "Organization": "Hugging Face, ServiceNow, Northeastern University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), Carnegie Mellon University (CMU), Johns Hopkins University, Leipzig University, ScaDS.AI, Queen Mary University of London, Roblox, Sea AI Lab, Technion - Israel Institute of Technology, Monash University, CSIRO, Data61, McGill University, Saama, University of British Columbia (UBC), Massachusetts Institute of Technology (MIT), Technical University of Munich, IBM, University of Vermont, UnfoldML, SAP, University of Notre Dame, Columbia University, New York University (NYU), University of Allahabad, Discover Dollar, Toloka, Telefonica, Stanford University, Weizmann Institute of Science, Alan Turing Institute, Wellesley College, EleutherAI, Forschungszentrum Julich", "Publication date": "2023-05-09"}, "size": 8}, {"x": 2023.35799086758, "y": 7.34e+24, "tooltipData": {"Model": "PaLM 2", "Domain": "Language", "Training compute (FLOP)": "7.3e+24", "Organization": "Google", "Publication date": "2023-05-10"}, "size": 8}, {"x": 2023.3607305936073, "y": 1.94e+20, "tooltipData": {"Model": "InstructBLIP", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "1.9e+20", "Organization": "Salesforce Research, Hong Kong University of Science and Technology, Nanyang Technological University", "Publication date": "2023-05-11"}, "size": 8}, {"x": 2023.379908675799, "y": 1.8e+20, "tooltipData": {"Model": "ONE-PEACE", "Domain": "Multimodal, Vision, Speech, Language", "Training compute (FLOP)": "1.8e+20", "Organization": "Alibaba, Huazhong University of Science and Technology", "Publication date": "2023-05-18"}, "size": 8}, {"x": 2023.4878995433792, "y": 1.811e+21, "tooltipData": {"Model": "HyenaDNA", "Domain": "Biology", "Training compute (FLOP)": "1.8e+21", "Organization": "Stanford University, Harvard University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2023-06-27"}, "size": 8}, {"x": 2023.5109589041097, "y": 3.98e+22, "tooltipData": {"Model": "Pangu-Weather", "Domain": "Earth science", "Training compute (FLOP)": "4.0e+22", "Organization": "Huawei", "Publication date": "2023-07-05"}, "size": 8}, {"x": 2023.513698630137, "y": 6.2e+23, "tooltipData": {"Model": "xTrimoPGLM -100B", "Domain": "Biology", "Training compute (FLOP)": "6.2e+23", "Organization": "Tsinghua University, BioMap Research", "Publication date": "2023-07-06"}, "size": 8}, {"x": 2023.527397260274, "y": 3.866e+24, "tooltipData": {"Model": "Claude 2", "Domain": "Language", "Training compute (FLOP)": "3.9e+24", "Organization": "Anthropic", "Publication date": "2023-07-11"}, "size": 8}, {"x": 2023.5465753424658, "y": 8.4e+22, "tooltipData": {"Model": "Llama 2-7B", "Domain": "Language", "Training compute (FLOP)": "8.4e+22", "Organization": "Meta AI", "Publication date": "2023-07-18"}, "size": 8}, {"x": 2023.5465753424658, "y": 8.1e+23, "tooltipData": {"Model": "Llama 2-70B", "Domain": "Language", "Training compute (FLOP)": "8.1e+23", "Organization": "Meta AI", "Publication date": "2023-07-18"}, "size": 8}, {"x": 2023.5684931506848, "y": 3.9e+18, "tooltipData": {"Model": "AudioLM", "Domain": "Audio", "Training compute (FLOP)": "3.9e+18", "Organization": "Google", "Publication date": "2023-07-26"}, "size": 8}, {"x": 2023.594292237443, "y": 7.56e+21, "tooltipData": {"Model": "GGNN", "Domain": "Biology", "Training compute (FLOP)": "7.6e+21", "Organization": "Westlake University, Tsinghua University, Toyota Technological Institute at Chicago", "Publication date": "2023-08-05"}, "size": 8}, {"x": 2023.657305936073, "y": 4.9e+16, "tooltipData": {"Model": "PeptideBERT", "Domain": "Biology", "Training compute (FLOP)": "4.9e+16", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2023-08-28"}, "size": 8}, {"x": 2023.6600456621004, "y": 3.08e+22, "tooltipData": {"Model": "Jais", "Domain": "Language", "Training compute (FLOP)": "3.1e+22", "Organization": "Cerebras Systems, Mohamed bin Zayed University of Artificial Intelligence, Inception", "Publication date": "2023-08-29"}, "size": 8}, {"x": 2023.6627853881278, "y": 5.337e+16, "tooltipData": {"Model": "Swift", "Domain": "Robotics", "Training compute (FLOP)": "5.3e+16", "Organization": "Intel Labs", "Publication date": "2023-08-30"}, "size": 8}, {"x": 2023.6803652968038, "y": 3.76e+24, "tooltipData": {"Model": "Falcon-180B", "Domain": "Language", "Training compute (FLOP)": "3.8e+24", "Organization": "Technology Innovation Institute", "Publication date": "2023-09-06"}, "size": 8}, {"x": 2023.7406392694065, "y": 4.8e+24, "tooltipData": {"Model": "Amazon Titan", "Domain": "Language, Image generation", "Training compute (FLOP)": "4.8e+24", "Organization": "Amazon", "Publication date": "2023-09-28"}, "size": 8}, {"x": 2023.7664383561644, "y": 1.6e+23, "tooltipData": {"Model": "FinGPT-13B", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Organization": "University of California Los Angeles (UCLA), Columbia University, New York University (NYU)", "Publication date": "2023-10-07"}, "size": 8}, {"x": 2023.8184931506848, "y": 7.92e+18, "tooltipData": {"Model": "CODEFUSION (Python)", "Domain": "Language", "Training compute (FLOP)": "7.9e+18", "Organization": "Microsoft, Microsoft Research", "Publication date": "2023-10-26"}, "size": 8}, {"x": 2023.8212328767124, "y": 5.04e+22, "tooltipData": {"Model": "ChatGLM3-6B", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.0e+22", "Organization": "Zhipu AI", "Publication date": "2023-10-27"}, "size": 8}, {"x": 2023.8294520547945, "y": 2.5e+23, "tooltipData": {"Model": "Skywork-13B", "Domain": "Language", "Training compute (FLOP)": "2.5e+23", "Organization": "Kunlun Inc.", "Publication date": "2023-10-30"}, "size": 8}, {"x": 2023.8360730593606, "y": 6.1e+23, "tooltipData": {"Model": "Yi-34B", "Domain": "Language", "Training compute (FLOP)": "6.1e+23", "Organization": "01.AI", "Publication date": "2023-11-02"}, "size": 8}, {"x": 2023.844292237443, "y": 7.807e+22, "tooltipData": {"Model": "LLaVA 1.5", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "7.8e+22", "Organization": "University of Wisconsin Madison, Microsoft Research", "Publication date": "2023-11-05"}, "size": 8}, {"x": 2023.8470319634703, "y": 6.331e+22, "tooltipData": {"Model": "CogVLM-17B", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "6.3e+22", "Organization": "Tsinghua University, Zhipu AI, Beihang University", "Publication date": "2023-11-06"}, "size": 8}, {"x": 2023.852511415525, "y": 2.6e+19, "tooltipData": {"Model": "MultiBand Diffusion", "Domain": "Audio, Speech", "Training compute (FLOP)": "2.6e+19", "Organization": "Meta AI, Hebrew University of Jerusalem, LORIA", "Publication date": "2023-11-08"}, "size": 8}, {"x": 2023.866210045662, "y": 3.04e+22, "tooltipData": {"Model": "SPHINX (Llama 2 13B)", "Domain": "Vision, Language, Multimodal", "Training compute (FLOP)": "3.0e+22", "Organization": "Shanghai AI Lab, Chinese University of Hong Kong (CUHK), ShanghaiTech University", "Publication date": "2023-11-13"}, "size": 8}, {"x": 2023.866210045662, "y": 4.56e+22, "tooltipData": {"Model": "Volcano 13B", "Domain": "Language, Multimodal, Vision", "Training compute (FLOP)": "4.6e+22", "Organization": "Korea University, Korea Advanced Institute of Science and Technology (KAIST), LG", "Publication date": "2023-11-13"}, "size": 8}, {"x": 2023.8689497716894, "y": 2.1e+22, "tooltipData": {"Model": "GraphCast", "Domain": "Earth science", "Training compute (FLOP)": "2.1e+22", "Organization": "Google DeepMind", "Publication date": "2023-11-14"}, "size": 8}, {"x": 2023.8716894977167, "y": 1.8e+23, "tooltipData": {"Model": "Nemotron-3-8B", "Domain": "Language", "Training compute (FLOP)": "1.8e+23", "Organization": "NVIDIA", "Publication date": "2023-11-15"}, "size": 8}, {"x": 2023.8908675799087, "y": 1.001e+25, "tooltipData": {"Model": "Inflection-2", "Domain": "Language", "Training compute (FLOP)": "1.0e+25", "Organization": "Inflection AI", "Publication date": "2023-11-22"}, "size": 8}, {"x": 2023.9127853881278, "y": 1.3e+24, "tooltipData": {"Model": "Qwen-72B", "Domain": "Language", "Training compute (FLOP)": "1.3e+24", "Organization": "Alibaba", "Publication date": "2023-11-30"}, "size": 8}, {"x": 2023.9303652968038, "y": 5.0000000001e+25, "tooltipData": {"Model": "Gemini 1.0 Ultra", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.0e+25", "Organization": "Google DeepMind", "Publication date": "2023-12-06"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.9331050228311, "y": 1.6e+23, "tooltipData": {"Model": "Llama Guard", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Organization": "Meta AI", "Publication date": "2023-12-07"}, "size": 8}, {"x": 2023.9522831050228, "y": 6.707e+22, "tooltipData": {"Model": "CogAgent", "Domain": "Vision, Language", "Training compute (FLOP)": "6.7e+22", "Organization": "Tsinghua University, Zhipu AI", "Publication date": "2023-12-14"}, "size": 8}, {"x": 2023.9522831050228, "y": 3.87e+23, "tooltipData": {"Model": "FunSearch", "Domain": "Language, Search", "Training compute (FLOP)": "3.9e+23", "Organization": "Google DeepMind", "Publication date": "2023-12-14"}, "size": 8}, {"x": 2024.0915525114156, "y": 1.3e+24, "tooltipData": {"Model": "Qwen1.5 72B", "Domain": "Language", "Training compute (FLOP)": "1.3e+24", "Organization": "Alibaba", "Publication date": "2024-02-04"}, "size": 8}, {"x": 2024.143607305936, "y": 1.2e+25, "tooltipData": {"Model": "MegaScale (Production)", "Domain": "Language", "Training compute (FLOP)": "1.2e+25", "Organization": "ByteDance, Peking University", "Publication date": "2024-02-23"}, "size": 8}, {"x": 2024.151826484018, "y": 1.12e+25, "tooltipData": {"Model": "Mistral Large", "Domain": "Language", "Training compute (FLOP)": "1.1e+25", "Organization": "Mistral AI", "Publication date": "2024-02-26"}, "size": 8}, {"x": 2024.1831050228311, "y": 1.0001e+25, "tooltipData": {"Model": "Inflection-2.5", "Domain": "Language", "Training compute (FLOP)": "1.0e+25", "Organization": "Inflection AI", "Publication date": "2024-03-07"}, "size": 8}, {"x": 2024.2022831050228, "y": 4.86e+23, "tooltipData": {"Model": "MM1-30B", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "4.9e+23", "Organization": "Apple", "Publication date": "2024-03-14"}, "size": 8}, {"x": 2024.2965753424658, "y": 7.861e+24, "tooltipData": {"Model": "Llama 3-70B", "Domain": "Language", "Training compute (FLOP)": "7.9e+24", "Organization": "Meta AI", "Publication date": "2024-04-18"}, "size": 8}, {"x": 2024.4331050228311, "y": 3.02e+24, "tooltipData": {"Model": "Qwen2-72B", "Domain": "Language", "Training compute (FLOP)": "3.0e+24", "Organization": "Alibaba", "Publication date": "2024-06-07"}, "size": 8}, {"x": 2024.4495433789955, "y": 1.1e+23, "tooltipData": {"Model": "OpenVLA", "Domain": "Robotics, Vision, Language", "Training compute (FLOP)": "1.1e+23", "Organization": "Stanford University, UC Berkeley, Toyota Research Institute, Google DeepMind, Massachusetts Institute of Technology (MIT), Physical Intelligence", "Publication date": "2024-06-13"}, "size": 8}, {"x": 2024.4522831050228, "y": 1.8e+25, "tooltipData": {"Model": "Nemotron-4 340B", "Domain": "Language", "Training compute (FLOP)": "1.8e+25", "Organization": "NVIDIA", "Publication date": "2024-06-14"}, "size": 8}, {"x": 2024.4605022831051, "y": 1.2852e+24, "tooltipData": {"Model": "DeepSeek-Coder-V2 236B", "Domain": "Language", "Training compute (FLOP)": "1.3e+24", "Organization": "DeepSeek", "Publication date": "2024-06-17"}, "size": 8}, {"x": 2024.4632420091325, "y": 1.2e+25, "tooltipData": {"Model": "GLM-4 (0520)", "Domain": "Language", "Training compute (FLOP)": "1.2e+25", "Organization": "Zhipu AI", "Publication date": "2024-06-18"}, "size": 8}, {"x": 2024.4824200913242, "y": 1.07e+24, "tooltipData": {"Model": "ESM3 (98B)", "Domain": "Biology", "Training compute (FLOP)": "1.1e+24", "Organization": "EvolutionaryScale, UC Berkeley", "Publication date": "2024-06-25"}, "size": 8}, {"x": 2024.5602739726028, "y": 3.8e+25, "tooltipData": {"Model": "Llama 3.1-405B", "Domain": "Language", "Training compute (FLOP)": "3.8e+25", "Organization": "Meta AI", "Publication date": "2024-07-23"}, "size": 8}, {"x": 2024.5630136986301, "y": 2.13e+25, "tooltipData": {"Model": "Mistral Large 2", "Domain": "Language", "Training compute (FLOP)": "2.1e+25", "Organization": "Mistral AI", "Publication date": "2024-07-24"}, "size": 8}, {"x": 2024.5767123287671, "y": 4.5126e+23, "tooltipData": {"Model": "AFM-on-device", "Domain": "Language", "Training compute (FLOP)": "4.5e+23", "Organization": "Apple", "Publication date": "2024-07-29"}, "size": 8}, {"x": 2024.7159817351599, "y": 7.8e+24, "tooltipData": {"Model": "Qwen2.5-72B", "Domain": "Language", "Training compute (FLOP)": "7.8e+24", "Organization": "Alibaba", "Publication date": "2024-09-19"}, "size": 8}, {"x": 2024.7582191780823, "y": 1.65e+24, "tooltipData": {"Model": "Meta Movie Gen Video", "Domain": "Video", "Training compute (FLOP)": "1.6e+24", "Organization": "Meta AI", "Publication date": "2024-10-04"}, "size": 8}], "size": 8, "fillColor": "rgb(0.0, 165.0, 166.0)", "strokeColor": "rgb(0.0, 165.0, 166.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "line", "color": "#E03D90", "zOrder": 2, "clip": true, "strokeWidth": 1.5, "lineStyle": "-", "tooltipData": {"Growth rate": "4.2x/year", "90% CI": "3.8x to 4.7x per year", "R\u00b2": "0.55"}, "points": [{"x": 2010.0, "y": 158983707727677.75}, {"x": 2010.1515151515152, "y": 197445205230236.25}, {"x": 2010.3030303030303, "y": 245211346656895.25}, {"x": 2010.4545454545455, "y": 304533120767219.94}, {"x": 2010.6060606060605, "y": 379698868671882.25}, {"x": 2010.7575757575758, "y": 471555998549382.8}, {"x": 2010.909090909091, "y": 585635297112414.8}, {"x": 2011.060606060606, "y": 730183499277772.4}, {"x": 2011.2121212121212, "y": 906830221355649.0}, {"x": 2011.3636363636363, "y": 1126211495024629.2}, {"x": 2011.5151515151515, "y": 1404186281835543.2}, {"x": 2011.6666666666667, "y": 1743888430840981.5}, {"x": 2011.8181818181818, "y": 2165771663319237.8}, {"x": 2011.969696969697, "y": 2689717309138309.5}, {"x": 2012.121212121212, "y": 3353601134594198.5}, {"x": 2012.2727272727273, "y": 4164907673524017.0}, {"x": 2012.4242424242425, "y": 5192902262239931.0}, {"x": 2012.5757575757575, "y": 6449174368638923.0}, {"x": 2012.7272727272727, "y": 8040978286596279.0}, {"x": 2012.878787878788, "y": 9986259791904942.0}, {"x": 2013.030303030303, "y": 1.2451097390316966e+16}, {"x": 2013.1818181818182, "y": 1.5463279317801464e+16}, {"x": 2013.3333333333333, "y": 1.9204171308329908e+16}, {"x": 2013.4848484848485, "y": 2.3850063628813564e+16}, {"x": 2013.6363636363637, "y": 2.973680549029776e+16}, {"x": 2013.7878787878788, "y": 3.69307632010981e+16}, {"x": 2013.939393939394, "y": 4.5865090352781016e+16}, {"x": 2014.090909090909, "y": 5.71856449459472e+16}, {"x": 2014.2424242424242, "y": 7.102005333726589e+16}, {"x": 2014.3939393939395, "y": 8.820129563631328e+16}, {"x": 2014.5454545454545, "y": 1.099713951773541e+17}, {"x": 2014.6969696969697, "y": 1.3657578503226706e+17}, {"x": 2014.8484848484848, "y": 1.6961633547612877e+17}, {"x": 2015.0, "y": 2.1148153122481382e+17}, {"x": 2015.1515151515152, "y": 2.6264335466758397e+17}, {"x": 2015.3030303030303, "y": 3.261822975818726e+17}, {"x": 2015.4545454545455, "y": 4.0509264508309594e+17}, {"x": 2015.6060606060605, "y": 5.0507878636598e+17}, {"x": 2015.7575757575758, "y": 6.272679512688721e+17}, {"x": 2015.909090909091, "y": 7.790172410922506e+17}, {"x": 2016.060606060606, "y": 9.712965354089093e+17}, {"x": 2016.2121212121212, "y": 1.2062735642178168e+18}, {"x": 2016.3636363636363, "y": 1.5040095030982454e+18}, {"x": 2016.5151515151515, "y": 1.8678609855804772e+18}, {"x": 2016.6666666666667, "y": 2.3288918501675965e+18}, {"x": 2016.8181818181818, "y": 2.8922996946532654e+18}, {"x": 2016.969696969697, "y": 3.592007728089643e+18}, {"x": 2017.121212121212, "y": 4.47859749106932e+18}, {"x": 2017.2727272727273, "y": 5.562064273167392e+18}, {"x": 2017.4242424242425, "y": 6.907644422285116e+18}, {"x": 2017.5757575757575, "y": 8.612609248281953e+18}, {"x": 2017.7272727272727, "y": 1.0696180287276157e+19}, {"x": 2017.878787878788, "y": 1.3283810914878933e+19}, {"x": 2018.030303030303, "y": 1.656255964317972e+19}, {"x": 2018.1818181818182, "y": 2.056939062892627e+19}, {"x": 2018.3333333333333, "y": 2.554555817219881e+19}, {"x": 2018.4848484848485, "y": 3.1725565141997502e+19}, {"x": 2018.6363636363637, "y": 3.955616112309221e+19}, {"x": 2018.7878787878788, "y": 4.9125627164559745e+19}, {"x": 2018.939393939394, "y": 6.101014799695694e+19}, {"x": 2019.090909090909, "y": 7.606884963308749e+19}, {"x": 2019.2424242424242, "y": 9.447150177903532e+19}, {"x": 2019.3939393939395, "y": 1.17326141928521e+20}, {"x": 2019.5454545454545, "y": 1.4628492048300063e+20}, {"x": 2019.6969696969697, "y": 1.8167431468090304e+20}, {"x": 2019.8484848484848, "y": 2.2562514650039593e+20}, {"x": 2020.0, "y": 2.8131459939153094e+20}, {"x": 2020.1515151515152, "y": 3.4937050849426584e+20}, {"x": 2020.3030303030303, "y": 4.338905711596584e+20}, {"x": 2020.4545454545455, "y": 5.409847000601401e+20}, {"x": 2020.6060606060605, "y": 6.71860259497851e+20}, {"x": 2020.7575757575758, "y": 8.376907569006113e+20}, {"x": 2020.909090909091, "y": 1.0403457422129217e+21}, {"x": 2021.060606060606, "y": 1.2971268949349997e+21}, {"x": 2021.2121212121212, "y": 1.6109291300388586e+21}, {"x": 2021.3636363636363, "y": 2.000646715553451e+21}, {"x": 2021.5151515151515, "y": 2.494452139043436e+21}, {"x": 2021.6666666666667, "y": 3.097912494115834e+21}, {"x": 2021.8181818181818, "y": 3.8473625815423585e+21}, {"x": 2021.969696969697, "y": 4.778120383309659e+21}, {"x": 2022.121212121212, "y": 5.957469911151531e+21}, {"x": 2022.2727272727273, "y": 7.39870698748015e+21}, {"x": 2022.4242424242425, "y": 9.188609578056035e+21}, {"x": 2022.5757575757575, "y": 1.1456568837780151e+22}, {"x": 2022.7272727272727, "y": 1.4228153423648124e+22}, {"x": 2022.878787878788, "y": 1.7670242523161507e+22}, {"x": 2023.030303030303, "y": 2.2031663020098045e+22}, {"x": 2023.1818181818182, "y": 2.7361584962012793e+22}, {"x": 2023.3333333333333, "y": 3.398092694820606e+22}, {"x": 2023.4848484848485, "y": 4.220162676476669e+22}, {"x": 2023.6363636363637, "y": 5.261795465241024e+22}, {"x": 2023.7878787878788, "y": 6.534734284179571e+22}, {"x": 2023.939393939394, "y": 8.11562373469715e+22}, {"x": 2024.090909090909, "y": 1.0118745517289768e+23}, {"x": 2024.2424242424242, "y": 1.2566682548101084e+23}, {"x": 2024.3939393939395, "y": 1.5668427573498537e+23}, {"x": 2024.5454545454545, "y": 1.945894923512298e+23}, {"x": 2024.6969696969697, "y": 2.426186350930149e+23}, {"x": 2024.8484848484848, "y": 3.0131317782998497e+23}, {"x": 2025.0, "y": 3.756841700820019e+23}]}, {"type": "annotation", "color": "#3E555E", "text": "Gemini Ultra", "x": 2023.9303652968038, "y": 5.0000000001e+25, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2023.9303652968038, "targetY": 5.0000000001e+25, "relDx": -0.005, "relDy": 0.03, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-4", "x": 2023.2050228310502, "y": 2.1e+25, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2023.2050228310502, "targetY": 2.1e+25, "relDx": -0.012, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "AlphaGo Zero", "x": 2017.7965753424658, "y": 3.41e+23, "ha": "right", "va": "bottom", "background": true, "hasArrow": true, "targetX": 2017.7965753424658, "targetY": 3.41e+23, "relDx": -0.05, "relDy": 0.03, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "AlphaGo Master", "x": 2017.0, "y": 2.00010000000001e+23, "ha": "right", "va": "bottom", "background": true, "hasArrow": true, "targetX": 2017.0, "targetY": 2.00010000000001e+23, "relDx": -0.06, "relDy": -0.07, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-3", "x": 2020.407305936073, "y": 3.14e+23, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2020.407305936073, "targetY": 3.14e+23, "relDx": -0.05, "relDy": 0.06, "arrowPosition": "right", "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "PaLM", "x": 2022.2582191780823, "y": 2.5272e+24, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2022.2582191780823, "targetY": 2.5272e+24, "relDx": -0.04, "relDy": 0.02, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "AlexNet", "x": 2012.7461187214612, "y": 4.7e+17, "ha": "left", "va": "center", "background": true, "hasArrow": true, "targetX": 2012.7461187214612, "targetY": 4.7e+17, "relDx": -0.09, "relDy": 0.06, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#E03D90", "text": "4.2x/year", "x": 2014, "y": [5.021467325632995e+16], "background": true, "weight": "bold", "hasArrow": true, "targetX": 2014, "targetY": [5.021467325632995e+16], "relDx": 0.08, "relDy": -0.14, "hasArrowHead": true, "arrowType": "arc", "arrowColor": "#E03D90", "targetSize": 8}], "additionalLegendItems": [], "tooltipKeyWidth": 120, "tooltipMinWidth": 250, "topRightText": "367 models", "addDataPadding": false, "title": "Training compute of notable models", "originalDataAspectRatio": 0.7451612903225805}

Enable JavaScript to see an interactive visualization.

Training compute costs are doubling every nine months for the largest AI models.

The cost of training large-scale ML models is growing at a rate of 2.5x per year. The most advanced models now cost hundreds of millions of dollars, with expenses measured by amortizing cluster costs over the training period. About half of this spending is on GPUs, with the remainder on other hardware and energy.

{"xAxis": {"label": "Publication date", "lim": [2015.5, 2024.5], "scaleType": "linear", "ticks": [2015.0, 2016.0, 2017.0, 2018.0, 2019.0, 2020.0, 2021.0, 2022.0, 2023.0, 2024.0, 2025.0], "tickLabels": ["2015", "2016", "2017", "2018", "2019", "2020", "2021", "2022", "2023", "2024", "2025"], "hideMinorGrid": true, "nice": false}, "yAxis": {"label": "Cost (2023 USD)", "lim": [112.15479119060363, 298631862.7870085], "scaleType": "log", "ticks": [10, 100, 1000, 10000, 100000, 1000000, 10000000, 100000000, 1000000000], "tickLabels": ["10", "100", "1k", "10k", "100k", "1M", "10M", "100M", "1B"], "hideMinorGrid": true}, "showLegend": true, "legendPosition": "header", "showFrame": true, "objects": [{"type": "scatter", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2015.9358447488585, "y": 206.30577788417412, "tooltipData": {"Model": "DeepSpeech2 (English)", "Domain": "Speech", "Training compute (FLOP)": "2.6e+19", "Training compute cost (2023 USD)": "$200", "Organization": "Baidu Research - Silicon Valley AI Lab", "Publication date": "2015-12-08"}, "size": 8}, {"x": 2016.7351598173516, "y": 193787.11043815003, "tooltipData": {"Model": "GNMT", "Domain": "Language", "Training compute (FLOP)": "6.6e+21", "Training compute cost (2023 USD)": "$200k", "Organization": "Google", "Publication date": "2016-09-26"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2016.7664383561644, "y": 12617.626833005994, "tooltipData": {"Model": "Xception", "Domain": "Vision", "Training compute (FLOP)": "4.4e+20", "Training compute cost (2023 USD)": "$10k", "Organization": "Google", "Publication date": "2016-10-07"}, "size": 8}, {"x": 2016.8771689497717, "y": 615.5480459425461, "tooltipData": {"Model": "PolyNet", "Domain": "Vision", "Training compute (FLOP)": "6.4e+19", "Training compute cost (2023 USD)": "$600", "Organization": "Chinese University of Hong Kong (CUHK)", "Publication date": "2016-11-17"}, "size": 8}, {"x": 2017.0, "y": 471445.3247973037, "tooltipData": {"Model": "AlphaGo Master", "Domain": "Games", "Training compute (FLOP)": "2.0e+23", "Training compute cost (2023 USD)": "$500k", "Organization": "DeepMind", "Publication date": "2017-01-01"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2017.0602739726028, "y": 3863.735305110515, "tooltipData": {"Model": "MoE-Multi", "Domain": "Language", "Training compute (FLOP)": "9.4e+19", "Training compute cost (2023 USD)": "$4k", "Organization": "Jagiellonian University, Google Brain", "Publication date": "2017-01-23"}, "size": 8}, {"x": 2017.5246575342467, "y": 17239.593034773116, "tooltipData": {"Model": "JFT", "Domain": "Vision", "Training compute (FLOP)": "8.4e+20", "Training compute cost (2023 USD)": "$20k", "Organization": "Google Research, Carnegie Mellon University (CMU)", "Publication date": "2017-07-10"}, "size": 8}, {"x": 2017.7965753424658, "y": 613480.6258010615, "tooltipData": {"Model": "AlphaGo Zero", "Domain": "Games", "Training compute (FLOP)": "3.4e+23", "Training compute cost (2023 USD)": "$600k", "Organization": "DeepMind", "Publication date": "2017-10-18"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2017.9276255707764, "y": 229918.6146969874, "tooltipData": {"Model": "AlphaZero", "Domain": "Games", "Training compute (FLOP)": "3.7e+22", "Training compute cost (2023 USD)": "$200k", "Organization": "DeepMind", "Publication date": "2017-12-05"}, "size": 8}, {"x": 2018.7406392694065, "y": 5170.456705747183, "tooltipData": {"Model": "BigGAN-deep 512x512", "Domain": "Image generation", "Training compute (FLOP)": "1.8e+21", "Training compute cost (2023 USD)": "$5k", "Organization": "Heriot-Watt University, DeepMind", "Publication date": "2018-09-28"}, "size": 8}, {"x": 2019.4100456621004, "y": 9551.591619865148, "tooltipData": {"Model": "MnasNet-A3", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Training compute cost (2023 USD)": "$10k", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.5, "y": 82771.07593250347, "tooltipData": {"Model": "RoBERTa Large", "Domain": "Language", "Training compute (FLOP)": "8.5e+21", "Training compute cost (2023 USD)": "$80k", "Organization": "Facebook, University of Washington", "Publication date": "2019-07-01"}, "size": 8}, {"x": 2019.7105022831051, "y": 106142.2892017932, "tooltipData": {"Model": "Megatron-LM (8.3B)", "Domain": "Language", "Training compute (FLOP)": "9.1e+21", "Training compute cost (2023 USD)": "$100k", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.8102739726028, "y": 75524.39074218823, "tooltipData": {"Model": "T5-11B", "Domain": "Language", "Training compute (FLOP)": "3.3e+22", "Training compute cost (2023 USD)": "$80k", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.8294520547945, "y": 125758.09814850632, "tooltipData": {"Model": "AlphaStar", "Domain": "Games", "Training compute (FLOP)": "5.9e+22", "Training compute cost (2023 USD)": "$100k", "Organization": "DeepMind", "Publication date": "2019-10-30"}, "size": 8}, {"x": 2019.8607305936073, "y": 43900.60644295845, "tooltipData": {"Model": "Noisy Student (L2)", "Domain": "Vision", "Training compute (FLOP)": "2.6e+22", "Training compute cost (2023 USD)": "$40k", "Organization": "Carnegie Mellon University (CMU), Google", "Publication date": "2019-11-11"}, "size": 8}, {"x": 2020.0739726027398, "y": 206760.3812904988, "tooltipData": {"Model": "Meena", "Domain": "Language", "Training compute (FLOP)": "1.1e+23", "Training compute cost (2023 USD)": "$200k", "Organization": "Google Brain", "Publication date": "2020-01-28"}, "size": 8}, {"x": 2020.116210045662, "y": 51659.713290894986, "tooltipData": {"Model": "Turing-NLG", "Domain": "Language", "Training compute (FLOP)": "1.6e+22", "Training compute cost (2023 USD)": "$50k", "Organization": "Microsoft", "Publication date": "2020-02-13"}, "size": 8}, {"x": 2020.407305936073, "y": 2056969.3385324872, "tooltipData": {"Model": "GPT-3 175B (davinci)", "Domain": "Language", "Training compute (FLOP)": "3.1e+23", "Training compute cost (2023 USD)": "$2M", "Organization": "OpenAI", "Publication date": "2020-05-28"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2020.4605022831051, "y": 98082.33822642289, "tooltipData": {"Model": "iGPT-XL", "Domain": "Vision, Image generation", "Training compute (FLOP)": "3.3e+22", "Training compute cost (2023 USD)": "$100k", "Organization": "OpenAI", "Publication date": "2020-06-17"}, "size": 8}, {"x": 2020.4961187214612, "y": 256224.76861369325, "tooltipData": {"Model": "GShard (dense)", "Domain": "Language", "Training compute (FLOP)": "4.8e+22", "Training compute cost (2023 USD)": "$300k", "Organization": "Google", "Publication date": "2020-06-30"}, "size": 8}, {"x": 2021.0109589041097, "y": 118437.35864214256, "tooltipData": {"Model": "DALL-E", "Domain": "Image generation", "Training compute (FLOP)": "4.7e+22", "Training compute cost (2023 USD)": "$100k", "Organization": "OpenAI", "Publication date": "2021-01-05"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2021.027397260274, "y": 139663.55942731188, "tooltipData": {"Model": "Switch", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Training compute cost (2023 USD)": "$100k", "Organization": "Google", "Publication date": "2021-01-11"}, "size": 8}, {"x": 2021.1666666666667, "y": 53844.28059190435, "tooltipData": {"Model": "Meta Pseudo Labels", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Training compute cost (2023 USD)": "$50k", "Organization": "Google Brain, Google AI", "Publication date": "2021-03-01"}, "size": 8}, {"x": 2021.3415525114156, "y": 85701.26256441118, "tooltipData": {"Model": "ProtT5-XXL", "Domain": "Biology", "Training compute (FLOP)": "7.4e+22", "Training compute cost (2023 USD)": "$90k", "Organization": "Technical University of Munich, Med AI Technology, NVIDIA, Oak Ridge National Laboratory, Google, Seoul National University", "Publication date": "2021-05-04"}, "size": 8}, {"x": 2021.407305936073, "y": 92453.37635669748, "tooltipData": {"Model": "ByT5-XXL", "Domain": "Language", "Training compute (FLOP)": "8.1e+22", "Training compute cost (2023 USD)": "$90k", "Organization": "Google, Google Research", "Publication date": "2021-05-28"}, "size": 8}, {"x": 2021.6721461187215, "y": 230526.76439336664, "tooltipData": {"Model": "FLAN 137B", "Domain": "Language", "Training compute (FLOP)": "2.0e+24", "Training compute cost (2023 USD)": "$200k", "Organization": "Google Research", "Publication date": "2021-09-03"}, "size": 8}, {"x": 2021.777397260274, "y": 3704291.3087597536, "tooltipData": {"Model": "Megatron-Turing NLG 530B", "Domain": "Language", "Training compute (FLOP)": "1.2e+24", "Training compute cost (2023 USD)": "$4M", "Organization": "Microsoft, NVIDIA", "Publication date": "2021-10-11"}, "size": 8}, {"x": 2021.9358447488585, "y": 616611.1391817601, "tooltipData": {"Model": "Gopher (280B)", "Domain": "Language", "Training compute (FLOP)": "6.3e+23", "Training compute cost (2023 USD)": "$600k", "Organization": "DeepMind", "Publication date": "2021-12-08"}, "size": 8}, {"x": 2021.9495433789955, "y": 541437.4162400038, "tooltipData": {"Model": "GLaM", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Training compute cost (2023 USD)": "$500k", "Organization": "Google", "Publication date": "2021-12-13"}, "size": 8}, {"x": 2022.10799086758, "y": 229949.98625999544, "tooltipData": {"Model": "LaMDA", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Training compute cost (2023 USD)": "$200k", "Organization": "Google", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.2582191780823, "y": 2945949.763287097, "tooltipData": {"Model": "PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Training compute cost (2023 USD)": "$3M", "Organization": "Google Research", "Publication date": "2022-04-04"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2022.3360730593606, "y": 731667.6068059877, "tooltipData": {"Model": "OPT-175B", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Training compute cost (2023 USD)": "$700k", "Organization": "Meta AI", "Publication date": "2022-05-02"}, "size": 8}, {"x": 2022.4742009132422, "y": 344852.94872144435, "tooltipData": {"Model": "Parti", "Domain": "Image generation", "Training compute (FLOP)": "4.0e+23", "Training compute cost (2023 USD)": "$300k", "Organization": "Google Research", "Publication date": "2022-06-22"}, "size": 8}, {"x": 2022.907305936073, "y": 4625550.747400068, "tooltipData": {"Model": "GPT-3.5 (text-davinci-003)", "Domain": "Language", "Training compute (FLOP)": "2.6e+24", "Training compute cost (2023 USD)": "$5M", "Organization": "OpenAI", "Publication date": "2022-11-28"}, "size": 8}, {"x": 2023.2050228310502, "y": 40586592.57781653, "tooltipData": {"Model": "GPT-4", "Domain": "Multimodal, Language, Vision, Image generation", "Training compute (FLOP)": "2.1e+25", "Training compute cost (2023 USD)": "$40M", "Organization": "OpenAI", "Publication date": "2023-03-15"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.35799086758, "y": 4865570.06395341, "tooltipData": {"Model": "PaLM 2", "Domain": "Language", "Training compute (FLOP)": "7.3e+24", "Training compute cost (2023 USD)": "$5M", "Organization": "Google", "Publication date": "2023-05-10"}, "size": 8}, {"x": 2023.6803652968038, "y": 10340911.710964862, "tooltipData": {"Model": "Falcon-180B", "Domain": "Language", "Training compute (FLOP)": "3.8e+24", "Training compute cost (2023 USD)": "$10M", "Organization": "Technology Innovation Institute", "Publication date": "2023-09-06"}, "size": 8}, {"x": 2023.8908675799087, "y": 12961959.001361668, "tooltipData": {"Model": "Inflection-2", "Domain": "Language", "Training compute (FLOP)": "1.0e+25", "Training compute cost (2023 USD)": "$10M", "Organization": "Inflection AI", "Publication date": "2023-11-22"}, "size": 8}, {"x": 2023.9303652968038, "y": 29827341.919963885, "tooltipData": {"Model": "Gemini 1.0 Ultra", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.0e+25", "Training compute cost (2023 USD)": "$30M", "Organization": "Google DeepMind", "Publication date": "2023-12-06"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}], "size": 8, "fillColor": "rgb(0.0, 165.0, 166.0)", "strokeColor": "rgb(0.0, 165.0, 166.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "line", "color": "#E03D90", "zOrder": 2, "clip": true, "strokeWidth": 1.5, "lineStyle": "-", "tooltipData": {"Growth rate": "2.5x/year", "90% CI": "1.9x to 3.2x per year", "R\u00b2": "0.59"}, "points": [{"x": 2015.9358447488585, "y": 3331.0122678877065}, {"x": 2016.0173008625063, "y": 3593.1784324193504}, {"x": 2016.0987569761542, "y": 3875.978293946898}, {"x": 2016.1802130898022, "y": 4170.490503256458}, {"x": 2016.26166920345, "y": 4498.727510965747}, {"x": 2016.343125317098, "y": 4852.798298453654}, {"x": 2016.4245814307458, "y": 5234.736104393532}, {"x": 2016.5060375443936, "y": 5646.734192798782}, {"x": 2016.5874936580417, "y": 6091.158447769929}, {"x": 2016.6689497716895, "y": 6553.988834232328}, {"x": 2016.7504058853374, "y": 7069.81825089896}, {"x": 2016.8318619989852, "y": 7626.245842788117}, {"x": 2016.9133181126333, "y": 8226.466875191278}, {"x": 2016.9947742262812, "y": 8873.928095645788}, {"x": 2017.076230339929, "y": 9548.204361019609}, {"x": 2017.1576864535768, "y": 10299.692471593333}, {"x": 2017.2391425672247, "y": 11110.326193109275}, {"x": 2017.3205986808728, "y": 11984.760560350433}, {"x": 2017.4020547945206, "y": 12895.410219096002}, {"x": 2017.4835109081685, "y": 13910.33900509726}, {"x": 2017.5649670218163, "y": 15005.147408974388}, {"x": 2017.6464231354641, "y": 16145.298044320856}, {"x": 2017.7278792491122, "y": 17416.00812374784}, {"x": 2017.80933536276, "y": 18786.728998982053}, {"x": 2017.890791476408, "y": 20265.331984999204}, {"x": 2017.9722475900558, "y": 21805.17231502292}, {"x": 2018.0537037037038, "y": 23521.340834692226}, {"x": 2018.1351598173517, "y": 25372.57980211373}, {"x": 2018.2166159309995, "y": 27369.51989000219}, {"x": 2018.2980720446474, "y": 29449.164603992154}, {"x": 2018.3795281582952, "y": 31766.950883960802}, {"x": 2018.4609842719433, "y": 34267.15772870902}, {"x": 2018.5424403855911, "y": 36870.912333114844}, {"x": 2018.623896499239, "y": 39772.824692412425}, {"x": 2018.7053526128868, "y": 42903.13105685321}, {"x": 2018.7868087265347, "y": 46279.807097348916}, {"x": 2018.8682648401827, "y": 49796.330462816906}, {"x": 2018.9497209538306, "y": 53715.533370305886}, {"x": 2019.0311770674784, "y": 57943.195782487484}, {"x": 2019.1126331811263, "y": 62503.594897629075}, {"x": 2019.1940892947741, "y": 67252.86603050366}, {"x": 2019.2755454084222, "y": 72545.97951163721}, {"x": 2019.35700152207, "y": 78255.68565237321}, {"x": 2019.438457635718, "y": 84414.77223336814}, {"x": 2019.5199137493657, "y": 90828.9415561526}, {"x": 2019.6013698630138, "y": 97977.60187956397}, {"x": 2019.6828259766617, "y": 105688.8950328198}, {"x": 2019.7642820903095, "y": 113719.55661426566}, {"x": 2019.8457382039574, "y": 122669.81485174334}, {"x": 2019.9271943176052, "y": 132324.50005765582}, {"x": 2020.0086504312533, "y": 142739.05391208566}, {"x": 2020.0901065449011, "y": 153584.9335673087}, {"x": 2020.171562658549, "y": 165672.78246278028}, {"x": 2020.2530187721968, "y": 178712.0013105377}, {"x": 2020.3344748858447, "y": 192777.467352507}, {"x": 2020.4159309994927, "y": 207949.95101800878}, {"x": 2020.4973871131406, "y": 224316.57974480893}, {"x": 2020.5788432267884, "y": 241361.04348408728}, {"x": 2020.6602993404363, "y": 260357.28064826445}, {"x": 2020.7417554540843, "y": 280848.6100658916}, {"x": 2020.8232115677322, "y": 302952.70245391165}, {"x": 2020.90466768138, "y": 326796.4897622074}, {"x": 2020.9861237950279, "y": 351627.7836693844}, {"x": 2021.0675799086757, "y": 379302.5263522813}, {"x": 2021.1490360223238, "y": 409155.4000536436}, {"x": 2021.2304921359716, "y": 441357.8338193695}, {"x": 2021.3119482496195, "y": 474893.9531875357}, {"x": 2021.3934043632673, "y": 512270.3169635367}, {"x": 2021.4748604769152, "y": 552588.374479099}, {"x": 2021.5563165905633, "y": 596079.6507192274}, {"x": 2021.637772704211, "y": 641372.1476178687}, {"x": 2021.719228817859, "y": 691851.1199108848}, {"x": 2021.8006849315068, "y": 746303.022199728}, {"x": 2021.8821410451546, "y": 803010.0868978229}, {"x": 2021.9635971588027, "y": 866210.7171061657}, {"x": 2022.0450532724506, "y": 934385.5309815687}, {"x": 2022.1265093860984, "y": 1007926.0199232802}, {"x": 2022.2079654997463, "y": 1084512.2380173758}, {"x": 2022.2894216133943, "y": 1169868.3973355587}, {"x": 2022.3708777270422, "y": 1261942.4835507479}, {"x": 2022.45233384069, "y": 1361263.2287672856}, {"x": 2022.5337899543379, "y": 1464697.4099086549}, {"x": 2022.6152460679857, "y": 1579975.911239029}, {"x": 2022.6967021816338, "y": 1704327.3670097373}, {"x": 2022.7781582952816, "y": 1838465.8609512798}, {"x": 2022.8596144089295, "y": 1978159.7914607858}, {"x": 2022.9410705225773, "y": 2133850.171336467}, {"x": 2023.0225266362252, "y": 2301794.1085286345}, {"x": 2023.1039827498732, "y": 2476693.557614091}, {"x": 2023.185438863521, "y": 2671620.864541021}, {"x": 2023.266894977169, "y": 2881889.8575110123}, {"x": 2023.3483510908168, "y": 3108707.998600635}, {"x": 2023.4298072044649, "y": 3344919.966607781}, {"x": 2023.5112633181127, "y": 3608180.7317405855}, {"x": 2023.5927194317605, "y": 3892161.3440297325}, {"x": 2023.6741755454084, "y": 4198492.551854934}, {"x": 2023.7556316590562, "y": 4517510.674106073}, {"x": 2023.8370877727043, "y": 4873059.783930329}, {"x": 2023.9185438863522, "y": 5256592.2630516235}, {"x": 2024.0, "y": 5670310.532838979}]}, {"type": "annotation", "color": "#3E555E", "text": "AlphaGo Master", "x": 2017.0, "y": 471445.3247973037, "ha": "right", "va": "bottom", "background": true, "hasArrow": false, "responsiveRules": {"<640": {"visible": false}}, "targetX": 2017.0, "targetY": 471445.3247973037, "relDx": -0.01, "relDy": -0.01, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GNMT", "x": 2016.7351598173516, "y": 193787.11043815003, "ha": "center", "va": "top", "background": true, "hasArrow": false, "targetX": 2016.7351598173516, "targetY": 193787.11043815003, "relDx": -0.0, "relDy": -0.06, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "AlphaGo Zero", "x": 2017.7965753424658, "y": 613480.6258010615, "ha": "right", "va": "bottom", "background": true, "hasArrow": false, "responsiveRules": {"<330": {"visible": false}}, "targetX": 2017.7965753424658, "targetY": 613480.6258010615, "relDx": 0.02, "relDy": 0.04, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "DALL-E", "x": 2021.0109589041097, "y": 118437.35864214256, "ha": "right", "va": "bottom", "background": true, "hasArrow": false, "targetX": 2021.0109589041097, "targetY": 118437.35864214256, "relDx": 0.05, "relDy": -0.06, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-4", "x": 2023.2050228310502, "y": 40586592.57781653, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2023.2050228310502, "targetY": 40586592.57781653, "relDx": -0.01, "relDy": 0.0, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "Gemini Ultra", "x": 2023.9303652968038, "y": 29827341.919963885, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2023.9303652968038, "targetY": 29827341.919963885, "relDx": -0.04, "relDy": 0.09, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-3", "x": 2020.407305936073, "y": 2056969.3385324872, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2020.407305936073, "targetY": 2056969.3385324872, "relDx": -0.05, "relDy": 0.06, "arrowPosition": "right", "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "PaLM", "x": 2022.2582191780823, "y": 2945949.763287097, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2022.2582191780823, "targetY": 2945949.763287097, "relDx": -0.06, "relDy": 0.09, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#E03D90", "text": "2.5x/year", "x": 2018.8, "y": [46749.66972180488], "background": true, "weight": "bold", "hasArrow": true, "targetX": 2018.8, "targetY": [46749.66972180488], "relDx": 0.08, "relDy": -0.14, "hasArrowHead": true, "arrowType": "arc", "arrowColor": "#E03D90", "targetSize": 8}], "additionalLegendItems": [], "tooltipKeyWidth": 120, "tooltipMinWidth": 250, "topRightText": "40 models", "addDataPadding": false, "title": "Amortized hardware and energy cost to train large-scale AI models over time", "originalDataAspectRatio": 0.7451612903225805}

Enable JavaScript to see an interactive visualization.

Training compute has scaled up faster for language than vision.

Before 2020, the largest vision and language models had similar training compute. After that, language models rapidly scaled to use more training compute, driven by the success of transformer-based architectures. Standalone vision models never caught up. Instead, the largest models have recently become multimodal, integrating vision and other modalities into large models such as GPT-4 and Gemini.

{"xAxis": {"label": "Publication date", "lim": [2009.25, 2025.75], "scaleType": "linear", "ticks": [2008.0, 2010.0, 2012.0, 2014.0, 2016.0, 2018.0, 2020.0, 2022.0, 2024.0, 2026.0], "tickLabels": ["2008", "2010", "2012", "2014", "2016", "2018", "2020", "2022", "2024", "2026"], "hideMinorGrid": true, "nice": false}, "yAxis": {"label": "Training compute (FLOP)", "lim": [374778213340.50616, 9.411489473108503e+26], "scaleType": "log", "ticks": [1000000000.0, 100000000000.0, 10000000000000.0, 1000000000000000.0, 1e+17, 1e+19, 1e+21, 1e+23, 1e+25, 1e+27, 1e+29], "tickLabels": ["$\\mathdefault{10^{9}}$", "$\\mathdefault{10^{11}}$", "$\\mathdefault{10^{13}}$", "$\\mathdefault{10^{15}}$", "$\\mathdefault{10^{17}}$", "$\\mathdefault{10^{19}}$", "$\\mathdefault{10^{21}}$", "$\\mathdefault{10^{23}}$", "$\\mathdefault{10^{25}}$", "$\\mathdefault{10^{27}}$", "$\\mathdefault{10^{29}}$"], "hideMinorGrid": true}, "showLegend": true, "legendPosition": "header", "showFrame": true, "objects": [{"type": "scatter", "label": "Language", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2012.0, "y": 1.66e+16, "tooltipData": {"Model": "LSTM LM", "Domain": "Language", "Training compute (FLOP)": "1.7e+16", "Organization": "RWTH Aachen University", "Publication date": "2012-01-01"}, "size": 8}, {"x": 2013.041095890411, "y": 2.612736e+18, "tooltipData": {"Model": "DistBelief NNLM", "Domain": "Language", "Training compute (FLOP)": "2.6e+18", "Organization": "Google", "Publication date": "2013-01-16"}, "size": 8}, {"x": 2013.5915525114156, "y": 4210000000000000.0, "tooltipData": {"Model": "RNN+weight noise+dynamic eval", "Domain": "Language", "Training compute (FLOP)": "4.2e+15", "Organization": "University of Toronto", "Publication date": "2013-08-04"}, "size": 8}, {"x": 2013.75, "y": 9331200000000000.0, "tooltipData": {"Model": "RCTM", "Domain": "Language", "Training compute (FLOP)": "9.3e+15", "Organization": "University of Oxford", "Publication date": "2013-10-01"}, "size": 8}, {"x": 2013.75, "y": 1.422e+16, "tooltipData": {"Model": "RNTN", "Domain": "Language", "Training compute (FLOP)": "1.4e+16", "Organization": "Stanford University", "Publication date": "2013-10-01"}, "size": 8}, {"x": 2013.791095890411, "y": 3.888e+16, "tooltipData": {"Model": "Word2Vec (large)", "Domain": "Language", "Training compute (FLOP)": "3.9e+16", "Organization": "Google", "Publication date": "2013-10-16"}, "size": 8}, {"x": 2013.9276255707764, "y": 1.340928e+18, "tooltipData": {"Model": "TransE", "Domain": "Language", "Training compute (FLOP)": "1.3e+18", "Organization": "Universite de Technologie de Compi\u00e8gne \u2013 CNRS, Google", "Publication date": "2013-12-05"}, "size": 8}, {"x": 2014.0, "y": 4.4e+16, "tooltipData": {"Model": "SPN-4+KN5", "Domain": "Language", "Training compute (FLOP)": "4.4e+16", "Organization": "Singapore University of Technology & Design, DSO National Laboratories", "Publication date": "2014-01-01"}, "size": 8}, {"x": 2014.6666666666667, "y": 1.5552e+18, "tooltipData": {"Model": "RNNsearch-50*", "Domain": "Language", "Training compute (FLOP)": "1.6e+18", "Organization": "Jacobs University Bremen, University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2014-09-01"}, "size": 8}, {"x": 2014.6858447488585, "y": 9.1e+16, "tooltipData": {"Model": "Large regularized LSTM", "Domain": "Language", "Training compute (FLOP)": "9.1e+16", "Organization": "New York University (NYU), Google Brain", "Publication date": "2014-09-08"}, "size": 8}, {"x": 2014.6913242009134, "y": 5.6e+19, "tooltipData": {"Model": "Seq2Seq LSTM", "Domain": "Language", "Training compute (FLOP)": "5.6e+19", "Organization": "Google", "Publication date": "2014-09-10"}, "size": 8}, {"x": 2014.9221461187215, "y": 2.97600000001e+20, "tooltipData": {"Model": "SNM-skip", "Domain": "Language", "Training compute (FLOP)": "3.0e+20", "Organization": "Google", "Publication date": "2014-12-03"}, "size": 8}, {"x": 2015.2105022831051, "y": 7.3e+16, "tooltipData": {"Model": "genCNN + dyn eval", "Domain": "Language", "Training compute (FLOP)": "7.3e+16", "Organization": "Chinese Academy of Sciences, Huawei Noah's Ark Lab, Dublin City University", "Publication date": "2015-03-17"}, "size": 8}, {"x": 2015.513698630137, "y": 3340000000000000.0, "tooltipData": {"Model": "Search-Proven Best LSTM", "Domain": "Language", "Training compute (FLOP)": "3.3e+15", "Organization": "Google", "Publication date": "2015-07-06"}, "size": 8}, {"x": 2015.651826484018, "y": 2650000000000000.0, "tooltipData": {"Model": "LSTM-Char-Large", "Domain": "Language", "Training compute (FLOP)": "2.6e+15", "Organization": "Harvard University, New York University (NYU)", "Publication date": "2015-08-26"}, "size": 8}, {"x": 2015.9577625570778, "y": 5620000000000000.0, "tooltipData": {"Model": "Variational (untied weights, MC) LSTM (Large)", "Domain": "Language", "Training compute (FLOP)": "5.6e+15", "Organization": "University of Cambridge", "Publication date": "2015-12-16"}, "size": 8}, {"x": 2016.4100456621004, "y": 9.69408e+16, "tooltipData": {"Model": "Named Entity Recognition model", "Domain": "Language", "Training compute (FLOP)": "9.7e+16", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.4100456621004, "y": 1.454112e+17, "tooltipData": {"Model": "Part-of-sentence tagging model", "Domain": "Language", "Training compute (FLOP)": "1.5e+17", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.5301369863014, "y": 3570000000000000.0, "tooltipData": {"Model": "VD-RHN", "Domain": "Language", "Training compute (FLOP)": "3.6e+15", "Organization": "ETH Zurich, IDSIA", "Publication date": "2016-07-12"}, "size": 8}, {"x": 2016.7351598173516, "y": 1.68e+16, "tooltipData": {"Model": "Zoneout + Variational LSTM (WT2)", "Domain": "Language", "Training compute (FLOP)": "1.7e+16", "Organization": "MetaMind Inc, Salesforce", "Publication date": "2016-09-26"}, "size": 8}, {"x": 2016.7351598173516, "y": 7490000000000000.0, "tooltipData": {"Model": "Pointer Sentinel-LSTM (medium)", "Domain": "Language", "Training compute (FLOP)": "7.5e+15", "Organization": "MetaMind Inc, Salesforce", "Publication date": "2016-09-26"}, "size": 8}, {"x": 2016.7351598173516, "y": 6.620000000001e+21, "tooltipData": {"Model": "GNMT", "Domain": "Language", "Training compute (FLOP)": "6.6e+21", "Organization": "Google", "Publication date": "2016-09-26"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2016.8415525114156, "y": 2.13e+16, "tooltipData": {"Model": "VD-LSTM+REAL Large", "Domain": "Language", "Training compute (FLOP)": "2.1e+16", "Organization": "Salesforce Research, Stanford University", "Publication date": "2016-11-04"}, "size": 8}, {"x": 2016.844292237443, "y": 1.05e+16, "tooltipData": {"Model": "NAS with base 8 and shared embeddings", "Domain": "Language", "Training compute (FLOP)": "1.0e+16", "Organization": "Google Brain", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2016.844292237443, "y": 3.4686144e+18, "tooltipData": {"Model": "BIDAF", "Domain": "Language", "Training compute (FLOP)": "3.5e+18", "Organization": "University of Washington, Allen Institute for AI", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2017.0602739726028, "y": 9.393905664e+19, "tooltipData": {"Model": "MoE-Multi", "Domain": "Language", "Training compute (FLOP)": "9.4e+19", "Organization": "Jagiellonian University, Google Brain", "Publication date": "2017-01-23"}, "size": 8}, {"x": 2017.4468036529681, "y": 7.4245248e+18, "tooltipData": {"Model": "Transformer", "Domain": "Language", "Training compute (FLOP)": "7.4e+18", "Organization": "Google Research, Google Brain", "Publication date": "2017-06-12"}, "size": 8}, {"x": 2017.5657534246575, "y": 5.64e+19, "tooltipData": {"Model": "ConvS2S (ensemble of 8 models)", "Domain": "Language", "Training compute (FLOP)": "5.6e+19", "Organization": "Meta AI", "Publication date": "2017-07-25"}, "size": 8}, {"x": 2017.5997716894976, "y": 3.09e+17, "tooltipData": {"Model": "AWD-LSTM - 3-layer LSTM (tied) + continuous cache pointer (WT2)", "Domain": "Language", "Training compute (FLOP)": "3.1e+17", "Organization": "Salesforce Research", "Publication date": "2017-08-07"}, "size": 8}, {"x": 2017.6189497716894, "y": 1.06e+16, "tooltipData": {"Model": "EI-REHN-1000D", "Domain": "Language", "Training compute (FLOP)": "1.1e+16", "Organization": "Korea Advanced Institute of Science and Technology (KAIST)", "Publication date": "2017-08-14"}, "size": 8}, {"x": 2017.6600456621004, "y": 4.74e+17, "tooltipData": {"Model": "GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)", "Domain": "Language", "Training compute (FLOP)": "4.7e+17", "Organization": "Ben-Gurion University of the Negev", "Publication date": "2017-08-29"}, "size": 8}, {"x": 2017.7050228310502, "y": 3400000000000000.0, "tooltipData": {"Model": "ISS", "Domain": "Language", "Training compute (FLOP)": "3.4e+15", "Organization": "Duke University, Microsoft", "Publication date": "2017-09-15"}, "size": 8}, {"x": 2017.7351598173516, "y": 3310000000000000.0, "tooltipData": {"Model": "AWD-LSTM+WT+Cache+IOG (WT2)", "Domain": "Language", "Training compute (FLOP)": "3.3e+15", "Organization": "NTT Communication Science Laboratories", "Publication date": "2017-09-26"}, "size": 8}, {"x": 2017.8321917808219, "y": 9.85e+16, "tooltipData": {"Model": "Fraternal dropout + AWD-LSTM 3-layer (WT2)", "Domain": "Language", "Training compute (FLOP)": "9.8e+16", "Organization": "Jagiellonian University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2017-10-31"}, "size": 8}, {"x": 2017.85799086758, "y": 4.37e+17, "tooltipData": {"Model": "AWD-LSTM-MoS + dynamic evaluation (WT2, 2017)", "Domain": "Language", "Training compute (FLOP)": "4.4e+17", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2017-11-10"}, "size": 8}, {"x": 2017.9276255707764, "y": 1340000000000000.0, "tooltipData": {"Model": "2-layer-LSTM+Deep-Gradient-Compression", "Domain": "Language", "Training compute (FLOP)": "1.3e+15", "Organization": "Tsinghua University, Stanford University, NVIDIA", "Publication date": "2017-12-05"}, "size": 8}, {"x": 2018.0465753424658, "y": 2.72538e+17, "tooltipData": {"Model": "ULM-FiT", "Domain": "Language", "Training compute (FLOP)": "2.7e+17", "Organization": "University of San Francisco, Insight Centre NUI Galway, Fast.ai", "Publication date": "2018-01-18"}, "size": 8}, {"x": 2018.0833333333333, "y": 3.6e+17, "tooltipData": {"Model": "QRNN", "Domain": "Language", "Training compute (FLOP)": "3.6e+17", "Organization": "Salesforce Research", "Publication date": "2018-02-01"}, "size": 8}, {"x": 2018.1052511415523, "y": 2.01e+16, "tooltipData": {"Model": "ENAS", "Domain": "Language", "Training compute (FLOP)": "2.0e+16", "Organization": "Google Brain, Carnegie Mellon University (CMU), Stanford University", "Publication date": "2018-02-09"}, "size": 8}, {"x": 2018.2242009132422, "y": 2.4e+17, "tooltipData": {"Model": "4 layer QRNN (h=2500)", "Domain": "Language", "Training compute (FLOP)": "2.4e+17", "Organization": "Salesforce Research", "Publication date": "2018-03-22"}, "size": 8}, {"x": 2018.2378995433792, "y": 3.33e+19, "tooltipData": {"Model": "LSTM (Hebbian, Cache, MbPA)", "Domain": "Language", "Training compute (FLOP)": "3.3e+19", "Organization": "DeepMind, University College London (UCL)", "Publication date": "2018-03-27"}, "size": 8}, {"x": 2018.338812785388, "y": 1.27e+17, "tooltipData": {"Model": "Dropout-LSTM+Noise(Bernoulli) (WT2)", "Domain": "Language", "Training compute (FLOP)": "1.3e+17", "Organization": "Columbia University, New York University (NYU), Princeton University", "Publication date": "2018-05-03"}, "size": 8}, {"x": 2018.3908675799087, "y": 7.59e+16, "tooltipData": {"Model": "aLSTM(depth-2)+RecurrentPolicy (WT2)", "Domain": "Language", "Training compute (FLOP)": "7.6e+16", "Organization": "University of Manchester, Alan Turing Institute", "Publication date": "2018-05-22"}, "size": 8}, {"x": 2018.4166666666667, "y": 1.7578125e+19, "tooltipData": {"Model": "GPT-1", "Domain": "Language", "Training compute (FLOP)": "1.8e+19", "Organization": "OpenAI", "Publication date": "2018-06-01"}, "size": 8}, {"x": 2018.4796803652969, "y": 1.1e+16, "tooltipData": {"Model": "DARTS", "Domain": "Language", "Training compute (FLOP)": "1.1e+16", "Organization": "DeepMind, Carnegie Mellon University (CMU)", "Publication date": "2018-06-24"}, "size": 8}, {"x": 2018.657305936073, "y": 4.7808e+20, "tooltipData": {"Model": "Big Transformer for Back-Translation", "Domain": "Language", "Training compute (FLOP)": "4.8e+20", "Organization": "Facebook AI Research, Google Brain", "Publication date": "2018-08-28"}, "size": 8}, {"x": 2018.6627853881278, "y": 6.93e+17, "tooltipData": {"Model": "(ensemble): AWD-LSTM-DOC (fin) \u00d7 5 (WT2)", "Domain": "Language", "Training compute (FLOP)": "6.9e+17", "Organization": "NTT Communication Science Laboratories, Tohoku University", "Publication date": "2018-08-30"}, "size": 8}, {"x": 2018.7105022831051, "y": 1.1e+19, "tooltipData": {"Model": "Transformer + Simple Recurrent Unit", "Domain": "Language", "Training compute (FLOP)": "1.1e+19", "Organization": "ASAPP, Cornell University, Google, Princeton University", "Publication date": "2018-09-17"}, "size": 8}, {"x": 2018.7296803652969, "y": 1020000000000000.0, "tooltipData": {"Model": "LSTM+NeuralCache", "Domain": "Language", "Training compute (FLOP)": "1.0e+15", "Organization": "KU Leuven, ESAT - PSI, Apple", "Publication date": "2018-09-24"}, "size": 8}, {"x": 2018.7406392694065, "y": 4.47e+19, "tooltipData": {"Model": "Transformer (Adaptive Input Embeddings) WT103", "Domain": "Language", "Training compute (FLOP)": "4.5e+19", "Organization": "Facebook AI Research", "Publication date": "2018-09-28"}, "size": 8}, {"x": 2018.777397260274, "y": 2.85e+20, "tooltipData": {"Model": "BERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.8e+20", "Organization": "Google", "Publication date": "2018-10-11"}, "size": 8}, {"x": 2018.7883561643835, "y": 2.78e+18, "tooltipData": {"Model": "TrellisNet", "Domain": "Language", "Training compute (FLOP)": "2.8e+18", "Organization": "Carnegie Mellon University (CMU), Bosch Center for Artificial Intelligence, Intel Labs", "Publication date": "2018-10-15"}, "size": 8}, {"x": 2018.844292237443, "y": 6.84288e+19, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 2.9B (translation)", "Domain": "Language", "Training compute (FLOP)": "6.8e+19", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2018.844292237443, "y": 1.617408e+20, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 4.9B (language)", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2018.8634703196346, "y": 5.188e+16, "tooltipData": {"Model": "Fine-tuned-AWD-LSTM-DOC (fin)", "Domain": "Language", "Training compute (FLOP)": "5.2e+16", "Organization": "Samsung R&D Institute Russia", "Publication date": "2018-11-12"}, "size": 8}, {"x": 2018.8716894977167, "y": 2010000000000000.0, "tooltipData": {"Model": "Multi-cell LSTM", "Domain": "Language", "Training compute (FLOP)": "2.0e+15", "Organization": "University of Hyderabad", "Publication date": "2018-11-15"}, "size": 8}, {"x": 2019.021917808219, "y": 1.09e+19, "tooltipData": {"Model": "Transformer-XL (257M)", "Domain": "Language", "Training compute (FLOP)": "1.1e+19", "Organization": "Carnegie Mellon University (CMU), Google Brain", "Publication date": "2019-01-09"}, "size": 8}, {"x": 2019.1189497716894, "y": 1.920000000001e+21, "tooltipData": {"Model": "GPT-2 (1.5B)", "Domain": "Language", "Training compute (FLOP)": "1.9e+21", "Organization": "OpenAI", "Publication date": "2019-02-14"}, "size": 8}, {"x": 2019.2351598173516, "y": 8.926848e+19, "tooltipData": {"Model": "SciBERT", "Domain": "Language", "Training compute (FLOP)": "8.9e+19", "Organization": "Allen Institute for AI", "Publication date": "2019-03-26"}, "size": 8}, {"x": 2019.25, "y": 7.3e+18, "tooltipData": {"Model": "FAIRSEQ Adaptive Inputs", "Domain": "Language", "Training compute (FLOP)": "7.3e+18", "Organization": "Facebook AI Research, Google Brain", "Publication date": "2019-04-01"}, "size": 8}, {"x": 2019.2582191780823, "y": 2.56e+18, "tooltipData": {"Model": "Cross-lingual alignment", "Domain": "Language", "Training compute (FLOP)": "2.6e+18", "Organization": "Tel Aviv University, Massachusetts Institute of Technology (MIT)", "Publication date": "2019-04-04"}, "size": 8}, {"x": 2019.2691780821917, "y": 7.30000001e+17, "tooltipData": {"Model": "WeNet (Penn Treebank)", "Domain": "Language", "Training compute (FLOP)": "7.3e+17", "Organization": "Amazon", "Publication date": "2019-04-08"}, "size": 8}, {"x": 2019.3020547945205, "y": 5.21e+20, "tooltipData": {"Model": "BERT-Large-CAS (PTB+WT2+WT103)", "Domain": "Language", "Training compute (FLOP)": "5.2e+20", "Organization": "Amazon", "Publication date": "2019-04-20"}, "size": 8}, {"x": 2019.3689497716894, "y": 4.24e+17, "tooltipData": {"Model": "AWD-LSTM-DRILL + dynamic evaluation\u2020 (WT2)", "Domain": "Language", "Training compute (FLOP)": "4.2e+17", "Organization": "IDIAP", "Publication date": "2019-05-14"}, "size": 8}, {"x": 2019.4166666666667, "y": 6.19e+21, "tooltipData": {"Model": "XLNet", "Domain": "Language", "Training compute (FLOP)": "6.2e+21", "Organization": "Carnegie Mellon University (CMU), Google Brain", "Publication date": "2019-06-01"}, "size": 8}, {"x": 2019.424885844749, "y": 7.3e+18, "tooltipData": {"Model": "Transformer-XL Large + Phrase Induction", "Domain": "Language", "Training compute (FLOP)": "7.3e+18", "Organization": "Massachusetts Institute of Technology (MIT), University of Illinois Urbana-Champaign (UIUC)", "Publication date": "2019-06-04"}, "size": 8}, {"x": 2019.4413242009134, "y": 3.28e+17, "tooltipData": {"Model": "AWD-LSTM + MoS + Partial Shuffled", "Domain": "Language", "Training compute (FLOP)": "3.3e+17", "Organization": "University of Texas at Austin", "Publication date": "2019-06-10"}, "size": 8}, {"x": 2019.4796803652969, "y": 4.76e+18, "tooltipData": {"Model": "Tensorized Transformer (257M)", "Domain": "Language", "Training compute (FLOP)": "4.8e+18", "Organization": "Tianjin University, Microsoft Research Asia, Beijing Institute of Technology", "Publication date": "2019-06-24"}, "size": 8}, {"x": 2019.5, "y": 8.5067e+21, "tooltipData": {"Model": "RoBERTa Large", "Domain": "Language", "Training compute (FLOP)": "8.5e+21", "Organization": "Facebook, University of Washington", "Publication date": "2019-07-01"}, "size": 8}, {"x": 2019.7105022831051, "y": 2.2e+22, "tooltipData": {"Model": "Megatron-BERT", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.7105022831051, "y": 9.1e+21, "tooltipData": {"Model": "Megatron-LM (8.3B)", "Domain": "Language", "Training compute (FLOP)": "9.1e+21", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.7527397260274, "y": 1.24416e+19, "tooltipData": {"Model": "DistilBERT", "Domain": "Language", "Training compute (FLOP)": "1.2e+19", "Organization": "Hugging Face", "Publication date": "2019-10-02"}, "size": 8}, {"x": 2019.8102739726028, "y": 8.658654068736e+20, "tooltipData": {"Model": "T5-3B", "Domain": "Language", "Training compute (FLOP)": "8.7e+20", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.8102739726028, "y": 3.3e+22, "tooltipData": {"Model": "T5-11B", "Domain": "Language", "Training compute (FLOP)": "3.3e+22", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.8333333333333, "y": 7.3e+18, "tooltipData": {"Model": "Base LM + kNN LM + Continuous Cache", "Domain": "Language", "Training compute (FLOP)": "7.3e+18", "Organization": "Stanford University, Facebook AI Research", "Publication date": "2019-11-01"}, "size": 8}, {"x": 2019.85799086758, "y": 8.3e+20, "tooltipData": {"Model": "CamemBERT", "Domain": "Language", "Training compute (FLOP)": "8.3e+20", "Organization": "Facebook, INRIA, Sorbonne University", "Publication date": "2019-11-10"}, "size": 8}, {"x": 2019.85799086758, "y": 1.58e+20, "tooltipData": {"Model": "Sandwich Transformer", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Organization": "Allen Institute for AI, Facebook AI Research", "Publication date": "2019-11-10"}, "size": 8}, {"x": 2019.9045662100457, "y": 6.2e+18, "tooltipData": {"Model": "Transformer-XL DeFINE (141M)", "Domain": "Language", "Training compute (FLOP)": "6.2e+18", "Organization": "University of Washington, Allen Institute for AI", "Publication date": "2019-11-27"}, "size": 8}, {"x": 2019.9276255707764, "y": 2.32e+18, "tooltipData": {"Model": "MMLSTM", "Domain": "Language", "Training compute (FLOP)": "2.3e+18", "Organization": "Beijing University of Posts and Telecommunications, University of West London", "Publication date": "2019-12-05"}, "size": 8}, {"x": 2020.0739726027398, "y": 1.12e+23, "tooltipData": {"Model": "Meena", "Domain": "Language", "Training compute (FLOP)": "1.1e+23", "Organization": "Google Brain", "Publication date": "2020-01-28"}, "size": 8}, {"x": 2020.102511415525, "y": 2.78e+19, "tooltipData": {"Model": "TaLK Convolution", "Domain": "Language", "Training compute (FLOP)": "2.8e+19", "Organization": "Carleton University", "Publication date": "2020-02-08"}, "size": 8}, {"x": 2020.1052511415523, "y": 2.39e+21, "tooltipData": {"Model": "ALBERT-xxlarge", "Domain": "Language", "Training compute (FLOP)": "2.4e+21", "Organization": "Toyota Technological Institute at Chicago, Google", "Publication date": "2020-02-09"}, "size": 8}, {"x": 2020.116210045662, "y": 1.57e+22, "tooltipData": {"Model": "Turing-NLG", "Domain": "Language", "Training compute (FLOP)": "1.6e+22", "Organization": "Microsoft", "Publication date": "2020-02-13"}, "size": 8}, {"x": 2020.1381278538813, "y": 4.41e+19, "tooltipData": {"Model": "Feedback Transformer", "Domain": "Language", "Training compute (FLOP)": "4.4e+19", "Organization": "LORIA, University of Lorraine, Facebook AI Research", "Publication date": "2020-02-21"}, "size": 8}, {"x": 2020.1940639269408, "y": 4.6e+17, "tooltipData": {"Model": "TransformerXL + spectrum control", "Domain": "Language", "Training compute (FLOP)": "4.6e+17", "Organization": "University of California Los Angeles (UCLA), JD.com", "Publication date": "2020-03-11"}, "size": 8}, {"x": 2020.2105022831051, "y": 1.58e+18, "tooltipData": {"Model": "Tensor-Transformer(1core)+PN (WT103)", "Domain": "Language", "Training compute (FLOP)": "1.6e+18", "Organization": "UC Berkeley", "Publication date": "2020-03-17"}, "size": 8}, {"x": 2020.2269406392695, "y": 3.1e+21, "tooltipData": {"Model": "ELECTRA", "Domain": "Language", "Training compute (FLOP)": "3.1e+21", "Organization": "Stanford University, Google, Google Brain", "Publication date": "2020-03-23"}, "size": 8}, {"x": 2020.3360730593606, "y": 3.825792e+19, "tooltipData": {"Model": "ATLAS", "Domain": "Language", "Training compute (FLOP)": "3.8e+19", "Organization": "Allen Institute for AI, University of Washington", "Publication date": "2020-05-02"}, "size": 8}, {"x": 2020.3360730593606, "y": 1.65e+19, "tooltipData": {"Model": "UnifiedQA", "Domain": "Language", "Training compute (FLOP)": "1.6e+19", "Organization": "Allen Institute for AI, University of Washington", "Publication date": "2020-05-02"}, "size": 8}, {"x": 2020.3470319634703, "y": 2.89e+18, "tooltipData": {"Model": "NAS+ESS (156M)", "Domain": "Language", "Training compute (FLOP)": "2.9e+18", "Organization": "Northeastern University (China), Chinese Academy of Sciences, NiuTrans Research, Kingsoft", "Publication date": "2020-05-06"}, "size": 8}, {"x": 2020.407305936073, "y": 3.14e+23, "tooltipData": {"Model": "GPT-3 175B (davinci)", "Domain": "Language", "Training compute (FLOP)": "3.1e+23", "Organization": "OpenAI", "Publication date": "2020-05-28"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2020.4961187214612, "y": 4.765e+22, "tooltipData": {"Model": "GShard (dense)", "Domain": "Language", "Training compute (FLOP)": "4.8e+22", "Organization": "Google", "Publication date": "2020-06-30"}, "size": 8}, {"x": 2020.588812785388, "y": 2.4e+19, "tooltipData": {"Model": "DeLighT", "Domain": "Language", "Training compute (FLOP)": "2.4e+19", "Organization": "University of Washington, Allen Institute for AI, Facebook AI Research", "Publication date": "2020-08-03"}, "size": 8}, {"x": 2020.5970319634703, "y": 2e+20, "tooltipData": {"Model": "ERNIE-GEN (large)", "Domain": "Language", "Training compute (FLOP)": "2.0e+20", "Organization": "Baidu", "Publication date": "2020-08-06"}, "size": 8}, {"x": 2020.7527397260274, "y": 1.8144e+22, "tooltipData": {"Model": "LUKE", "Domain": "Language", "Training compute (FLOP)": "1.8e+22", "Organization": "University of Washington, National Institute of Informatics", "Publication date": "2020-10-02"}, "size": 8}, {"x": 2020.8020547945205, "y": 8.2e+22, "tooltipData": {"Model": "mT5-XXL", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Organization": "Google, Google Research", "Publication date": "2020-10-20"}, "size": 8}, {"x": 2020.804794520548, "y": 1.42829568e+21, "tooltipData": {"Model": "German ELECTRA Large", "Domain": "Language", "Training compute (FLOP)": "1.4e+21", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.804794520548, "y": 2.2444646e+21, "tooltipData": {"Model": "GBERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.2e+21", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.893607305936, "y": 1.24e+20, "tooltipData": {"Model": "KEPLER", "Domain": "Language", "Training compute (FLOP)": "1.2e+20", "Organization": "Tsinghua University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), HEC, CIFAR AI Research, Princeton University, University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2020-11-23"}, "size": 8}, {"x": 2020.9166666666667, "y": 1.8e+21, "tooltipData": {"Model": "CPM-Large", "Domain": "Language", "Training compute (FLOP)": "1.8e+21", "Organization": "Tsinghua University, Beijing Academy of Artificial Intelligence / BAAI", "Publication date": "2020-12-01"}, "size": 8}, {"x": 2020.9769406392695, "y": 2.09952e+18, "tooltipData": {"Model": "DensePhrases", "Domain": "Language", "Training compute (FLOP)": "2.1e+18", "Organization": "Korea University, Princeton University", "Publication date": "2020-12-23"}, "size": 8}, {"x": 2020.9824200913242, "y": 5.62e+17, "tooltipData": {"Model": "CT-MoS (WT2)", "Domain": "Language", "Training compute (FLOP)": "5.6e+17", "Organization": "Google, National Tsing Hua University", "Publication date": "2020-12-25"}, "size": 8}, {"x": 2020.9988584474886, "y": 2.91e+19, "tooltipData": {"Model": "ERNIE-Doc (247M)", "Domain": "Language", "Training compute (FLOP)": "2.9e+19", "Organization": "Baidu", "Publication date": "2020-12-31"}, "size": 8}, {"x": 2021.027397260274, "y": 8.22e+22, "tooltipData": {"Model": "Switch", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Organization": "Google", "Publication date": "2021-01-11"}, "size": 8}, {"x": 2021.1463470319634, "y": 1.1e+19, "tooltipData": {"Model": "SRU++ Large", "Domain": "Language", "Training compute (FLOP)": "1.1e+19", "Organization": "ASAPP", "Publication date": "2021-02-24"}, "size": 8}, {"x": 2021.1776255707764, "y": 1.449e+22, "tooltipData": {"Model": "Generative BST", "Domain": "Language", "Training compute (FLOP)": "1.4e+22", "Organization": "Facebook AI Research", "Publication date": "2021-03-05"}, "size": 8}, {"x": 2021.2993150684931, "y": 3.5997696e+22, "tooltipData": {"Model": "PLUG", "Domain": "Language", "Training compute (FLOP)": "3.6e+22", "Organization": "Alibaba", "Publication date": "2021-04-19"}, "size": 8}, {"x": 2021.407305936073, "y": 8.1e+22, "tooltipData": {"Model": "ByT5-XXL", "Domain": "Language", "Training compute (FLOP)": "8.1e+22", "Organization": "Google, Google Research", "Publication date": "2021-05-28"}, "size": 8}, {"x": 2021.4385844748858, "y": 1.91e+21, "tooltipData": {"Model": "EMDR", "Domain": "Language", "Training compute (FLOP)": "1.9e+21", "Organization": "Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), McGill University, DeepMind", "Publication date": "2021-06-09"}, "size": 8}, {"x": 2021.4413242009134, "y": 2.588e+22, "tooltipData": {"Model": "DeBERTa", "Domain": "Language", "Training compute (FLOP)": "2.6e+22", "Organization": "Microsoft", "Publication date": "2021-06-10"}, "size": 8}, {"x": 2021.4906392694065, "y": 8.2e+19, "tooltipData": {"Model": "Adaptive Input Transformer + RD", "Domain": "Language", "Training compute (FLOP)": "8.2e+19", "Organization": "Microsoft Research Asia, Soochow University", "Publication date": "2021-06-28"}, "size": 8}, {"x": 2021.5109589041097, "y": 2.25e+22, "tooltipData": {"Model": "ERNIE 3.0", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Organization": "Baidu", "Publication date": "2021-07-05"}, "size": 8}, {"x": 2021.5164383561644, "y": 7.344e+22, "tooltipData": {"Model": "Codex", "Domain": "Language", "Training compute (FLOP)": "7.3e+22", "Organization": "OpenAI", "Publication date": "2021-07-07"}, "size": 8}, {"x": 2021.6107305936073, "y": 3.7e+23, "tooltipData": {"Model": "Jurassic-1-Jumbo", "Domain": "Language", "Training compute (FLOP)": "3.7e+23", "Organization": "AI21 Labs", "Publication date": "2021-08-11"}, "size": 8}, {"x": 2021.6271689497717, "y": 3.366e+22, "tooltipData": {"Model": "XLMR-XXL", "Domain": "Language", "Training compute (FLOP)": "3.4e+22", "Organization": "Facebook AI Research", "Publication date": "2021-08-17"}, "size": 8}, {"x": 2021.6721461187215, "y": 2.047e+24, "tooltipData": {"Model": "FLAN 137B", "Domain": "Language", "Training compute (FLOP)": "2.0e+24", "Organization": "Google Research", "Publication date": "2021-09-03"}, "size": 8}, {"x": 2021.6803652968038, "y": 2.775e+18, "tooltipData": {"Model": "PermuteFormer", "Domain": "Language", "Training compute (FLOP)": "2.8e+18", "Organization": "Peking University", "Publication date": "2021-09-06"}, "size": 8}, {"x": 2021.7187214611872, "y": 9.9e+21, "tooltipData": {"Model": "PLATO-XL", "Domain": "Language", "Training compute (FLOP)": "9.9e+21", "Organization": "Baidu", "Publication date": "2021-09-20"}, "size": 8}, {"x": 2021.777397260274, "y": 1.17e+24, "tooltipData": {"Model": "Megatron-Turing NLG 530B", "Domain": "Language", "Training compute (FLOP)": "1.2e+24", "Organization": "Microsoft, NVIDIA", "Publication date": "2021-10-11"}, "size": 8}, {"x": 2021.7801369863014, "y": 3.5380000000001e+23, "tooltipData": {"Model": "Yuan 1.0", "Domain": "Language", "Training compute (FLOP)": "3.5e+23", "Organization": "Inspur", "Publication date": "2021-10-12"}, "size": 8}, {"x": 2021.7883561643835, "y": 9.1819e+20, "tooltipData": {"Model": "T0-XXL", "Domain": "Language", "Training compute (FLOP)": "9.2e+20", "Organization": "Hugging Face, Brown University", "Publication date": "2021-10-15"}, "size": 8}, {"x": 2021.7938356164384, "y": 7.3e+18, "tooltipData": {"Model": "base LM+GNN+kNN", "Domain": "Language", "Training compute (FLOP)": "7.3e+18", "Organization": "Shannon.AI, Nanjing University, Nanyang Technological University, Zhejiang University", "Publication date": "2021-10-17"}, "size": 8}, {"x": 2021.8321917808219, "y": 6e+20, "tooltipData": {"Model": "S4", "Domain": "Language", "Training compute (FLOP)": "6.0e+20", "Organization": "Stanford University", "Publication date": "2021-10-31"}, "size": 8}, {"x": 2021.8333333333333, "y": 1.56e+21, "tooltipData": {"Model": "CodeT5-base", "Domain": "Language", "Training compute (FLOP)": "1.6e+21", "Organization": "Salesforce, Nanyang Technological University", "Publication date": "2021-11-01"}, "size": 8}, {"x": 2021.9358447488585, "y": 6.31e+23, "tooltipData": {"Model": "Gopher (280B)", "Domain": "Language", "Training compute (FLOP)": "6.3e+23", "Organization": "DeepMind", "Publication date": "2021-12-08"}, "size": 8}, {"x": 2021.9495433789955, "y": 3.6363112434e+23, "tooltipData": {"Model": "GLaM", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Organization": "Google", "Publication date": "2021-12-13"}, "size": 8}, {"x": 2021.9577625570778, "y": 1.57e+20, "tooltipData": {"Model": "Contriever", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Organization": "Meta AI, University College London (UCL), PSL University, Universit\u00e9 Grenoble Alpes", "Publication date": "2021-12-16"}, "size": 8}, {"x": 2021.9687214611872, "y": 2.25e+22, "tooltipData": {"Model": "XGLM-7.5B", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Organization": "Meta AI, Facebook AI Research", "Publication date": "2021-12-20"}, "size": 8}, {"x": 2021.9769406392695, "y": 1.0421e+24, "tooltipData": {"Model": "ERNIE 3.0 Titan", "Domain": "Language", "Training compute (FLOP)": "1.0e+24", "Organization": "Baidu, Peng Cheng Laboratory", "Publication date": "2021-12-23"}, "size": 8}, {"x": 2022.0860730593606, "y": 1.63944e+23, "tooltipData": {"Model": "AlphaCode", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Organization": "DeepMind", "Publication date": "2022-02-02"}, "size": 8}, {"x": 2022.0997716894976, "y": 1.68e+22, "tooltipData": {"Model": "RETRO-7B", "Domain": "Language", "Training compute (FLOP)": "1.7e+22", "Organization": "DeepMind", "Publication date": "2022-02-07"}, "size": 8}, {"x": 2022.1052511415523, "y": 9.31627008e+22, "tooltipData": {"Model": "GPT-NeoX-20B", "Domain": "Language", "Training compute (FLOP)": "9.3e+22", "Organization": "EleutherAI", "Publication date": "2022-02-09"}, "size": 8}, {"x": 2022.10799086758, "y": 3.55e+23, "tooltipData": {"Model": "LaMDA", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Organization": "Google", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.1271689497717, "y": 2.9e+23, "tooltipData": {"Model": "ST-MoE", "Domain": "Language", "Training compute (FLOP)": "2.9e+23", "Organization": "Google, Google Brain, Google Research", "Publication date": "2022-02-17"}, "size": 8}, {"x": 2022.151826484018, "y": 1.1e+21, "tooltipData": {"Model": "PolyCoder", "Domain": "Language", "Training compute (FLOP)": "1.1e+21", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2022-02-26"}, "size": 8}, {"x": 2022.2214611872148, "y": 2.65e+19, "tooltipData": {"Model": "Segatron-XL large, M=384 + HCP", "Domain": "Language", "Training compute (FLOP)": "2.6e+19", "Organization": "Microsoft Research, University of Waterloo", "Publication date": "2022-03-21"}, "size": 8}, {"x": 2022.2433789954339, "y": 5.76e+23, "tooltipData": {"Model": "Chinchilla", "Domain": "Language", "Training compute (FLOP)": "5.8e+23", "Organization": "DeepMind", "Publication date": "2022-03-29"}, "size": 8}, {"x": 2022.2582191780823, "y": 2.5272e+24, "tooltipData": {"Model": "PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Organization": "Google Research", "Publication date": "2022-04-04"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2022.285616438356, "y": 6.0770304e+19, "tooltipData": {"Model": "Sparse all-MLP", "Domain": "Language", "Training compute (FLOP)": "6.1e+19", "Organization": "Meta AI", "Publication date": "2022-04-14"}, "size": 8}, {"x": 2022.3360730593606, "y": 4.3e+23, "tooltipData": {"Model": "OPT-175B", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Organization": "Meta AI", "Publication date": "2022-05-02"}, "size": 8}, {"x": 2022.35799086758, "y": 1.2e+23, "tooltipData": {"Model": "UL2", "Domain": "Language", "Training compute (FLOP)": "1.2e+23", "Organization": "Google Research, Google Brain", "Publication date": "2022-05-10"}, "size": 8}, {"x": 2022.3634703196346, "y": 4.02e+21, "tooltipData": {"Model": "Gato", "Domain": "Multimodal, Robotics, Games, Language", "Training compute (FLOP)": "4.0e+21", "Organization": "DeepMind", "Publication date": "2022-05-12"}, "size": 8}, {"x": 2022.4303652968038, "y": 1.1e+19, "tooltipData": {"Model": "DITTO", "Domain": "Language", "Training compute (FLOP)": "1.1e+19", "Organization": "Tsinghua University, Apple, Westlake University, Chinese University of Hong Kong (CUHK)", "Publication date": "2022-06-06"}, "size": 8}, {"x": 2022.4933789954339, "y": 2.7415e+24, "tooltipData": {"Model": "Minerva (540B)", "Domain": "Language", "Training compute (FLOP)": "2.7e+24", "Organization": "Google", "Publication date": "2022-06-29"}, "size": 8}, {"x": 2022.5109589041097, "y": 2.72e+21, "tooltipData": {"Model": "CodeT5-large", "Domain": "Language", "Training compute (FLOP)": "2.7e+21", "Organization": "Salesforce", "Publication date": "2022-07-05"}, "size": 8}, {"x": 2022.513698630137, "y": 1.751113728e+22, "tooltipData": {"Model": "NLLB", "Domain": "Language", "Training compute (FLOP)": "1.8e+22", "Organization": "Meta AI", "Publication date": "2022-07-06"}, "size": 8}, {"x": 2022.527397260274, "y": 3.65664e+23, "tooltipData": {"Model": "BLOOM-176B", "Domain": "Language", "Training compute (FLOP)": "3.7e+23", "Organization": "Hugging Face, BigScience", "Publication date": "2022-07-11"}, "size": 8}, {"x": 2022.5860730593606, "y": 2.04374016e+23, "tooltipData": {"Model": "AlexaTM 20B", "Domain": "Language", "Training compute (FLOP)": "2.0e+23", "Organization": "Amazon", "Publication date": "2022-08-02"}, "size": 8}, {"x": 2022.5915525114156, "y": 3.5490054945e+23, "tooltipData": {"Model": "GLM-130B", "Domain": "Language", "Training compute (FLOP)": "3.5e+23", "Organization": "Tsinghua University", "Publication date": "2022-08-04"}, "size": 8}, {"x": 2022.60799086758, "y": 4.3e+23, "tooltipData": {"Model": "BlenderBot 3", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Organization": "McGill University, Meta AI, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms)", "Publication date": "2022-08-10"}, "size": 8}, {"x": 2022.8020547945205, "y": 2.5e+24, "tooltipData": {"Model": "Flan-PaLM 540B", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Organization": "Google", "Publication date": "2022-10-20"}, "size": 8}, {"x": 2022.8020547945205, "y": 3.3e+22, "tooltipData": {"Model": "Flan-T5 11B", "Domain": "Language", "Training compute (FLOP)": "3.3e+22", "Organization": "Google", "Publication date": "2022-10-20"}, "size": 8}, {"x": 2022.8020547945205, "y": 2.53e+24, "tooltipData": {"Model": "U-PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Organization": "Google", "Publication date": "2022-10-20"}, "size": 8}, {"x": 2022.838812785388, "y": 1.4e+17, "tooltipData": {"Model": "Mogrifier RLSTM (WT2)", "Domain": "Language", "Training compute (FLOP)": "1.4e+17", "Organization": "DeepMind", "Publication date": "2022-11-03"}, "size": 8}, {"x": 2022.8744292237443, "y": 3.24e+23, "tooltipData": {"Model": "Galactica", "Domain": "Language, Biology", "Training compute (FLOP)": "3.2e+23", "Organization": "Meta AI", "Publication date": "2022-11-16"}, "size": 8}, {"x": 2022.879908675799, "y": 1.3e+20, "tooltipData": {"Model": "Fusion in Encoder", "Domain": "Language", "Training compute (FLOP)": "1.3e+20", "Organization": "Samsung", "Publication date": "2022-11-18"}, "size": 8}, {"x": 2022.907305936073, "y": 2.578e+24, "tooltipData": {"Model": "GPT-3.5 (text-davinci-003)", "Domain": "Language", "Training compute (FLOP)": "2.6e+24", "Organization": "OpenAI", "Publication date": "2022-11-28"}, "size": 8}, {"x": 2022.9906392694065, "y": 8.49e+20, "tooltipData": {"Model": "Hybrid H3-2.7B", "Domain": "Language", "Training compute (FLOP)": "8.5e+20", "Organization": "Stanford University, University at Buffalo", "Publication date": "2022-12-28"}, "size": 8}, {"x": 2023.1463470319634, "y": 5.5e+23, "tooltipData": {"Model": "LLaMA-65B", "Domain": "Language", "Training compute (FLOP)": "5.5e+23", "Organization": "Meta AI", "Publication date": "2023-02-24"}, "size": 8}, {"x": 2023.2050228310502, "y": 2.4e+23, "tooltipData": {"Model": "Falcon-40B", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Organization": "Technology Innovation Institute", "Publication date": "2023-03-15"}, "size": 8}, {"x": 2023.2187214611872, "y": 4.67e+23, "tooltipData": {"Model": "PanGu-\u03a3", "Domain": "Language", "Training compute (FLOP)": "4.7e+23", "Organization": "Huawei Noah's Ark Lab", "Publication date": "2023-03-20"}, "size": 8}, {"x": 2023.2461187214612, "y": 2.36e+23, "tooltipData": {"Model": "BloombergGPT", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Organization": "Bloomberg, Johns Hopkins University", "Publication date": "2023-03-30"}, "size": 8}, {"x": 2023.271917808219, "y": 3.00001e+21, "tooltipData": {"Model": "Incoder-6.7B", "Domain": "Language", "Training compute (FLOP)": "3.0e+21", "Organization": "Facebook AI Research, University of Washington, UC Berkeley, Carnegie Mellon University (CMU), Toyota Technological Institute at Chicago", "Publication date": "2023-04-09"}, "size": 8}, {"x": 2023.3552511415523, "y": 8.46e+22, "tooltipData": {"Model": "StarCoder", "Domain": "Language", "Training compute (FLOP)": "8.5e+22", "Organization": "Hugging Face, ServiceNow, Northeastern University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), Carnegie Mellon University (CMU), Johns Hopkins University, Leipzig University, ScaDS.AI, Queen Mary University of London, Roblox, Sea AI Lab, Technion - Israel Institute of Technology, Monash University, CSIRO, Data61, McGill University, Saama, University of British Columbia (UBC), Massachusetts Institute of Technology (MIT), Technical University of Munich, IBM, University of Vermont, UnfoldML, SAP, University of Notre Dame, Columbia University, New York University (NYU), University of Allahabad, Discover Dollar, Toloka, Telefonica, Stanford University, Weizmann Institute of Science, Alan Turing Institute, Wellesley College, EleutherAI, Forschungszentrum Julich", "Publication date": "2023-05-09"}, "size": 8}, {"x": 2023.35799086758, "y": 7.34e+24, "tooltipData": {"Model": "PaLM 2", "Domain": "Language", "Training compute (FLOP)": "7.3e+24", "Organization": "Google", "Publication date": "2023-05-10"}, "size": 8}, {"x": 2023.527397260274, "y": 3.866e+24, "tooltipData": {"Model": "Claude 2", "Domain": "Language", "Training compute (FLOP)": "3.9e+24", "Organization": "Anthropic", "Publication date": "2023-07-11"}, "size": 8}, {"x": 2023.5465753424658, "y": 8.4e+22, "tooltipData": {"Model": "Llama 2-7B", "Domain": "Language", "Training compute (FLOP)": "8.4e+22", "Organization": "Meta AI", "Publication date": "2023-07-18"}, "size": 8}, {"x": 2023.5465753424658, "y": 8.1e+23, "tooltipData": {"Model": "Llama 2-70B", "Domain": "Language", "Training compute (FLOP)": "8.1e+23", "Organization": "Meta AI", "Publication date": "2023-07-18"}, "size": 8}, {"x": 2023.6600456621004, "y": 3.08e+22, "tooltipData": {"Model": "Jais", "Domain": "Language", "Training compute (FLOP)": "3.1e+22", "Organization": "Cerebras Systems, Mohamed bin Zayed University of Artificial Intelligence, Inception", "Publication date": "2023-08-29"}, "size": 8}, {"x": 2023.6803652968038, "y": 3.76e+24, "tooltipData": {"Model": "Falcon-180B", "Domain": "Language", "Training compute (FLOP)": "3.8e+24", "Organization": "Technology Innovation Institute", "Publication date": "2023-09-06"}, "size": 8}, {"x": 2023.7406392694065, "y": 4.8e+24, "tooltipData": {"Model": "Amazon Titan", "Domain": "Language, Image generation", "Training compute (FLOP)": "4.8e+24", "Organization": "Amazon", "Publication date": "2023-09-28"}, "size": 8}, {"x": 2023.7664383561644, "y": 1.6e+23, "tooltipData": {"Model": "FinGPT-13B", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Organization": "University of California Los Angeles (UCLA), Columbia University, New York University (NYU)", "Publication date": "2023-10-07"}, "size": 8}, {"x": 2023.8184931506848, "y": 7.92e+18, "tooltipData": {"Model": "CODEFUSION (Python)", "Domain": "Language", "Training compute (FLOP)": "7.9e+18", "Organization": "Microsoft, Microsoft Research", "Publication date": "2023-10-26"}, "size": 8}, {"x": 2023.8294520547945, "y": 2.5e+23, "tooltipData": {"Model": "Skywork-13B", "Domain": "Language", "Training compute (FLOP)": "2.5e+23", "Organization": "Kunlun Inc.", "Publication date": "2023-10-30"}, "size": 8}, {"x": 2023.8360730593606, "y": 6.1e+23, "tooltipData": {"Model": "Yi-34B", "Domain": "Language", "Training compute (FLOP)": "6.1e+23", "Organization": "01.AI", "Publication date": "2023-11-02"}, "size": 8}, {"x": 2023.8716894977167, "y": 1.8e+23, "tooltipData": {"Model": "Nemotron-3-8B", "Domain": "Language", "Training compute (FLOP)": "1.8e+23", "Organization": "NVIDIA", "Publication date": "2023-11-15"}, "size": 8}, {"x": 2023.8908675799087, "y": 1.001e+25, "tooltipData": {"Model": "Inflection-2", "Domain": "Language", "Training compute (FLOP)": "1.0e+25", "Organization": "Inflection AI", "Publication date": "2023-11-22"}, "size": 8}, {"x": 2023.9127853881278, "y": 1.3e+24, "tooltipData": {"Model": "Qwen-72B", "Domain": "Language", "Training compute (FLOP)": "1.3e+24", "Organization": "Alibaba", "Publication date": "2023-11-30"}, "size": 8}, {"x": 2023.9331050228311, "y": 1.6e+23, "tooltipData": {"Model": "Llama Guard", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Organization": "Meta AI", "Publication date": "2023-12-07"}, "size": 8}, {"x": 2023.9522831050228, "y": 3.87e+23, "tooltipData": {"Model": "FunSearch", "Domain": "Language, Search", "Training compute (FLOP)": "3.9e+23", "Organization": "Google DeepMind", "Publication date": "2023-12-14"}, "size": 8}, {"x": 2024.0915525114156, "y": 1.3e+24, "tooltipData": {"Model": "Qwen1.5 72B", "Domain": "Language", "Training compute (FLOP)": "1.3e+24", "Organization": "Alibaba", "Publication date": "2024-02-04"}, "size": 8}, {"x": 2024.143607305936, "y": 1.2e+25, "tooltipData": {"Model": "MegaScale (Production)", "Domain": "Language", "Training compute (FLOP)": "1.2e+25", "Organization": "ByteDance, Peking University", "Publication date": "2024-02-23"}, "size": 8}, {"x": 2024.151826484018, "y": 1.12e+25, "tooltipData": {"Model": "Mistral Large", "Domain": "Language", "Training compute (FLOP)": "1.1e+25", "Organization": "Mistral AI", "Publication date": "2024-02-26"}, "size": 8}, {"x": 2024.1831050228311, "y": 1.0001e+25, "tooltipData": {"Model": "Inflection-2.5", "Domain": "Language", "Training compute (FLOP)": "1.0e+25", "Organization": "Inflection AI", "Publication date": "2024-03-07"}, "size": 8}, {"x": 2024.2965753424658, "y": 7.861e+24, "tooltipData": {"Model": "Llama 3-70B", "Domain": "Language", "Training compute (FLOP)": "7.9e+24", "Organization": "Meta AI", "Publication date": "2024-04-18"}, "size": 8}, {"x": 2024.4331050228311, "y": 3.02e+24, "tooltipData": {"Model": "Qwen2-72B", "Domain": "Language", "Training compute (FLOP)": "3.0e+24", "Organization": "Alibaba", "Publication date": "2024-06-07"}, "size": 8}, {"x": 2024.4522831050228, "y": 1.8e+25, "tooltipData": {"Model": "Nemotron-4 340B", "Domain": "Language", "Training compute (FLOP)": "1.8e+25", "Organization": "NVIDIA", "Publication date": "2024-06-14"}, "size": 8}, {"x": 2024.4605022831051, "y": 1.2852e+24, "tooltipData": {"Model": "DeepSeek-Coder-V2 236B", "Domain": "Language", "Training compute (FLOP)": "1.3e+24", "Organization": "DeepSeek", "Publication date": "2024-06-17"}, "size": 8}, {"x": 2024.4632420091325, "y": 1.2e+25, "tooltipData": {"Model": "GLM-4 (0520)", "Domain": "Language", "Training compute (FLOP)": "1.2e+25", "Organization": "Zhipu AI", "Publication date": "2024-06-18"}, "size": 8}, {"x": 2024.5602739726028, "y": 3.8e+25, "tooltipData": {"Model": "Llama 3.1-405B", "Domain": "Language", "Training compute (FLOP)": "3.8e+25", "Organization": "Meta AI", "Publication date": "2024-07-23"}, "size": 8}, {"x": 2024.5630136986301, "y": 2.13e+25, "tooltipData": {"Model": "Mistral Large 2", "Domain": "Language", "Training compute (FLOP)": "2.1e+25", "Organization": "Mistral AI", "Publication date": "2024-07-24"}, "size": 8}, {"x": 2024.5767123287671, "y": 4.5126e+23, "tooltipData": {"Model": "AFM-on-device", "Domain": "Language", "Training compute (FLOP)": "4.5e+23", "Organization": "Apple", "Publication date": "2024-07-29"}, "size": 8}, {"x": 2024.7159817351599, "y": 7.8e+24, "tooltipData": {"Model": "Qwen2.5-72B", "Domain": "Language", "Training compute (FLOP)": "7.8e+24", "Organization": "Alibaba", "Publication date": "2024-09-19"}, "size": 8}], "size": 8, "fillColor": "rgb(0.0, 165.0, 166.0)", "strokeColor": "rgb(0.0, 165.0, 166.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "line", "color": "#00A5A6", "zOrder": 2, "clip": true, "strokeWidth": 1.5, "lineStyle": "-", "tooltipData": {"Growth rate": "6.4x/year", "90% CI": "5.4x to 7.7x per year", "R\u00b2": "0.63"}, "points": [{"x": 2010.0, "y": 1763610604767.019}, {"x": 2010.1515151515152, "y": 2332349862261.295}, {"x": 2010.3030303030303, "y": 3084499415737.9536}, {"x": 2010.4545454545455, "y": 4079206468817.8936}, {"x": 2010.6060606060605, "y": 5422178267977.984}, {"x": 2010.7575757575758, "y": 7170753397769.056}, {"x": 2010.909090909091, "y": 9483219058900.213}, {"x": 2011.060606060606, "y": 12605320344698.629}, {"x": 2011.2121212121212, "y": 16670356307820.695}, {"x": 2011.3636363636363, "y": 22046308370633.23}, {"x": 2011.5151515151515, "y": 29304477488477.574}, {"x": 2011.6666666666667, "y": 38754753373097.8}, {"x": 2011.8181818181818, "y": 51252608397450.625}, {"x": 2011.969696969697, "y": 67780843352424.89}, {"x": 2012.121212121212, "y": 90095909246057.08}, {"x": 2012.2727272727273, "y": 119150554522900.02}, {"x": 2012.4242424242425, "y": 158377751234169.72}, {"x": 2012.5757575757575, "y": 209452316332187.84}, {"x": 2012.7272727272727, "y": 278409000984584.56}, {"x": 2012.878787878788, "y": 368191931565574.94}, {"x": 2013.030303030303, "y": 489409473396484.4}, {"x": 2013.1818181818182, "y": 647237045853700.4}, {"x": 2013.3333333333333, "y": 855961758603452.4}, {"x": 2013.4848484848485, "y": 1131997213207635.2}, {"x": 2013.6363636363637, "y": 1504677621930907.2}, {"x": 2013.7878787878788, "y": 1989914686820460.0}, {"x": 2013.939393939394, "y": 2631633781955500.5}, {"x": 2014.090909090909, "y": 3498030220059786.0}, {"x": 2014.2424242424242, "y": 4626095057429121.0}, {"x": 2014.3939393939395, "y": 6117944710041863.0}, {"x": 2014.5454545454545, "y": 8132117632446177.0}, {"x": 2014.6969696969697, "y": 1.075460954287244e+16}, {"x": 2014.8484848484848, "y": 1.422281767766201e+16}, {"x": 2015.0, "y": 1.890530756673862e+16}, {"x": 2015.1515151515152, "y": 2.500199952308363e+16}, {"x": 2015.3030303030303, "y": 3.3064787650023836e+16}, {"x": 2015.4545454545455, "y": 4.37277099110094e+16}, {"x": 2015.6060606060605, "y": 5.812391213835892e+16}, {"x": 2015.7575757575758, "y": 7.686804451256336e+16}, {"x": 2015.909090909091, "y": 1.0165689214306827e+17}, {"x": 2016.060606060606, "y": 1.3512475908777878e+17}, {"x": 2016.2121212121212, "y": 1.787005659835581e+17}, {"x": 2016.3636363636363, "y": 2.3753304294798397e+17}, {"x": 2016.5151515151515, "y": 3.141340602652046e+17}, {"x": 2016.6666666666667, "y": 4.1755446502203475e+17}, {"x": 2016.8181818181818, "y": 5.5220982247914406e+17}, {"x": 2016.969696969697, "y": 7.302896115033905e+17}, {"x": 2017.121212121212, "y": 9.707183225690611e+17}, {"x": 2017.2727272727273, "y": 1.2837611317479898e+18}, {"x": 2017.4242424242425, "y": 1.6977557805091226e+18}, {"x": 2017.5757575757575, "y": 2.2566973669448515e+18}, {"x": 2017.7272727272727, "y": 2.984450070063696e+18}, {"x": 2017.878787878788, "y": 3.9468926366307057e+18}, {"x": 2018.030303030303, "y": 5.24630357496541e+18}, {"x": 2018.1818181818182, "y": 6.938161625574923e+18}, {"x": 2018.3333333333333, "y": 9.175619758706595e+18}, {"x": 2018.4848484848485, "y": 1.2134626216544258e+19}, {"x": 2018.6363636363637, "y": 1.6129633806060982e+19}, {"x": 2018.7878787878788, "y": 2.1331210576884077e+19}, {"x": 2018.939393939394, "y": 2.8210221642115097e+19}, {"x": 2019.090909090909, "y": 3.7497697626235855e+19}, {"x": 2019.2424242424242, "y": 4.959017010743166e+19}, {"x": 2019.3939393939395, "y": 6.558229243292762e+19}, {"x": 2019.5454545454545, "y": 8.717354306825744e+19}, {"x": 2019.6969696969697, "y": 1.1528576694799652e+20}, {"x": 2019.8484848484848, "y": 1.5246378193409003e+20}, {"x": 2020.0, "y": 2.026584855107588e+20}, {"x": 2020.1515151515152, "y": 2.6801295563188696e+20}, {"x": 2020.3030303030303, "y": 3.544433099135581e+20}, {"x": 2020.4545454545455, "y": 4.711344784664483e+20}, {"x": 2020.6060606060605, "y": 6.230686257995885e+20}, {"x": 2020.7575757575758, "y": 8.281976379706157e+20}, {"x": 2020.909090909091, "y": 1.0952795597989018e+21}, {"x": 2021.060606060606, "y": 1.4558716436393312e+21}, {"x": 2021.2121212121212, "y": 1.925369476875477e+21}, {"x": 2021.3636363636363, "y": 2.546273662709039e+21}, {"x": 2021.5151515151515, "y": 3.384567519149521e+21}, {"x": 2021.6666666666667, "y": 4.4760422543190996e+21}, {"x": 2021.8181818181818, "y": 5.919502018826591e+21}, {"x": 2021.969696969697, "y": 7.828456962635206e+21}, {"x": 2022.121212121212, "y": 1.0405771205521323e+22}, {"x": 2022.2727272727273, "y": 1.376148395361139e+22}, {"x": 2022.4242424242425, "y": 1.8199366184903075e+22}, {"x": 2022.5757575757575, "y": 2.4191030430312973e+22}, {"x": 2022.7272727272727, "y": 3.1992292595424066e+22}, {"x": 2022.878787878788, "y": 4.230935050328568e+22}, {"x": 2023.030303030303, "y": 5.623859507595332e+22}, {"x": 2023.1818181818182, "y": 7.437473959653641e+22}, {"x": 2023.3333333333333, "y": 9.835953196517396e+22}, {"x": 2023.4848484848485, "y": 1.3007907766662867e+23}, {"x": 2023.6363636363637, "y": 1.729042041469983e+23}, {"x": 2023.7878787878788, "y": 2.2866334305136233e+23}, {"x": 2023.939393939394, "y": 3.024040087015164e+23}, {"x": 2024.090909090909, "y": 4.0196260146791494e+23}, {"x": 2024.2424242424242, "y": 5.315898053877565e+23}, {"x": 2024.3939393939395, "y": 7.06601813927283e+23}, {"x": 2024.5454545454545, "y": 9.34470817386277e+23}, {"x": 2024.6969696969697, "y": 1.2421208381632032e+24}, {"x": 2024.8484848484848, "y": 1.6426870863515036e+24}, {"x": 2025.0, "y": 2.1834987487847515e+24}]}, {"type": "scatter", "label": "Vision", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2010.1666666666667, "y": 130788000000000.0, "tooltipData": {"Model": "6-layer MLP (MNIST)", "Domain": "Vision", "Training compute (FLOP)": "1.3e+14", "Organization": "IDSIA, University of Lugano, SUPSI", "Publication date": "2010-03-01"}, "size": 8}, {"x": 2010.366210045662, "y": 350000000000000.0, "tooltipData": {"Model": "Feedforward NN", "Domain": "Vision", "Training compute (FLOP)": "3.5e+14", "Organization": "University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2010-05-13"}, "size": 8}, {"x": 2011.0, "y": 3.672864e+16, "tooltipData": {"Model": "Deep Autoencoders", "Domain": "Vision", "Training compute (FLOP)": "3.7e+16", "Organization": "University of Toronto", "Publication date": "2011-01-01"}, "size": 8}, {"x": 2012.116210045662, "y": 3726979200000000.0, "tooltipData": {"Model": "MCDNN (MNIST)", "Domain": "Vision", "Training compute (FLOP)": "3.7e+15", "Organization": "IDSIA", "Publication date": "2012-02-13"}, "size": 8}, {"x": 2012.4221461187215, "y": 4268700000000000.0, "tooltipData": {"Model": "Dropout (CIFAR)", "Domain": "Vision", "Training compute (FLOP)": "4.3e+15", "Organization": "University of Toronto", "Publication date": "2012-06-03"}, "size": 8}, {"x": 2012.4221461187215, "y": 2.731968e+17, "tooltipData": {"Model": "Dropout (ImageNet)", "Domain": "Vision", "Training compute (FLOP)": "2.7e+17", "Organization": "University of Toronto", "Publication date": "2012-06-03"}, "size": 8}, {"x": 2012.4221461187215, "y": 6039370800000000.0, "tooltipData": {"Model": "Dropout (MNIST)", "Domain": "Vision", "Training compute (FLOP)": "6.0e+15", "Organization": "University of Toronto", "Publication date": "2012-06-03"}, "size": 8}, {"x": 2012.5301369863014, "y": 6e+17, "tooltipData": {"Model": "Unsupervised High-level Feature Learner", "Domain": "Vision", "Training compute (FLOP)": "6.0e+17", "Organization": "Google", "Publication date": "2012-07-12"}, "size": 8}, {"x": 2012.7461187214612, "y": 4.7e+17, "tooltipData": {"Model": "AlexNet", "Domain": "Vision", "Training compute (FLOP)": "4.7e+17", "Organization": "University of Toronto", "Publication date": "2012-09-30"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2013.4468036529681, "y": 90842400000000.0, "tooltipData": {"Model": "Fisher Vector image classifier", "Domain": "Vision", "Training compute (FLOP)": "9.1e+13", "Organization": "Universidad Nacional de Cordoba, Inteligent Systems Lab Amsterdam, University of Amsterdam, LEAR Team, INRIA, Xerox Research Centre Europe (XRCE)", "Publication date": "2013-06-12"}, "size": 8}, {"x": 2013.7242009132422, "y": 1.37e+17, "tooltipData": {"Model": "Mitosis", "Domain": "Vision, Medicine", "Training compute (FLOP)": "1.4e+17", "Organization": "IDSIA", "Publication date": "2013-09-22"}, "size": 8}, {"x": 2013.8634703196346, "y": 5.32e+17, "tooltipData": {"Model": "Visualizing CNNs", "Domain": "Vision", "Training compute (FLOP)": "5.3e+17", "Organization": "New York University (NYU)", "Publication date": "2013-11-12"}, "size": 8}, {"x": 2013.9687214611872, "y": 475200000000000.0, "tooltipData": {"Model": "Image generation", "Domain": "Vision", "Training compute (FLOP)": "4.8e+14", "Organization": "University of Amsterdam", "Publication date": "2013-12-20"}, "size": 8}, {"x": 2014.4632420091325, "y": 3.411072e+18, "tooltipData": {"Model": "SPPNet", "Domain": "Vision", "Training compute (FLOP)": "3.4e+18", "Organization": "Microsoft, Xi\u2019an Jiaotong University, University of Science and Technology of China", "Publication date": "2014-06-18"}, "size": 8}, {"x": 2014.674885844749, "y": 1.2291e+19, "tooltipData": {"Model": "VGG16", "Domain": "Vision", "Training compute (FLOP)": "1.2e+19", "Organization": "University of Oxford", "Publication date": "2014-09-04"}, "size": 8}, {"x": 2014.7105022831051, "y": 1.51e+18, "tooltipData": {"Model": "GoogLeNet / InceptionV1", "Domain": "Vision", "Training compute (FLOP)": "1.5e+18", "Organization": "Google, University of Michigan, University of North Carolina", "Publication date": "2014-09-17"}, "size": 8}, {"x": 2014.9632420091325, "y": 1e+17, "tooltipData": {"Model": "Fractional Max-Pooling", "Domain": "Vision", "Training compute (FLOP)": "1.0e+17", "Organization": "University of Warwick", "Publication date": "2014-12-18"}, "size": 8}, {"x": 2014.9742009132422, "y": 6.048e+16, "tooltipData": {"Model": "ADAM (CIFAR-10)", "Domain": "Vision", "Training compute (FLOP)": "6.0e+16", "Organization": "University of Amsterdam, OpenAI, University of Toronto", "Publication date": "2014-12-22"}, "size": 8}, {"x": 2015.0970319634703, "y": 2.397403008e+19, "tooltipData": {"Model": "MSRA (C, PReLU)", "Domain": "Vision", "Training compute (FLOP)": "2.4e+19", "Organization": "Microsoft Research", "Publication date": "2015-02-06"}, "size": 8}, {"x": 2015.9413242009134, "y": 1.041408e+19, "tooltipData": {"Model": "ResNet-152 (ImageNet)", "Domain": "Vision", "Training compute (FLOP)": "1.0e+19", "Organization": "Microsoft", "Publication date": "2015-12-10"}, "size": 8}, {"x": 2016.4714611872148, "y": 6.1492939794e+16, "tooltipData": {"Model": "R-FCN", "Domain": "Vision", "Training compute (FLOP)": "6.1e+16", "Organization": "Tsinghua University, Microsoft Research", "Publication date": "2016-06-21"}, "size": 8}, {"x": 2016.7105022831051, "y": 2.9741645e+19, "tooltipData": {"Model": "ResNet-200", "Domain": "Vision", "Training compute (FLOP)": "3.0e+19", "Organization": "Microsoft Research Asia", "Publication date": "2016-09-17"}, "size": 8}, {"x": 2016.7664383561644, "y": 4.36e+20, "tooltipData": {"Model": "Xception", "Domain": "Vision", "Training compute (FLOP)": "4.4e+20", "Organization": "Google", "Publication date": "2016-10-07"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2016.844292237443, "y": 2.2e+21, "tooltipData": {"Model": "NASv3 (CIFAR-10)", "Domain": "Vision", "Training compute (FLOP)": "2.2e+21", "Organization": "Google Brain", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2016.8771689497717, "y": 6.4e+19, "tooltipData": {"Model": "PolyNet", "Domain": "Vision", "Training compute (FLOP)": "6.4e+19", "Organization": "Chinese University of Hong Kong (CUHK)", "Publication date": "2016-11-17"}, "size": 8}, {"x": 2017.5246575342467, "y": 8.43e+20, "tooltipData": {"Model": "JFT", "Domain": "Vision", "Training compute (FLOP)": "8.4e+20", "Organization": "Google Research, Carnegie Mellon University (CMU)", "Publication date": "2017-07-10"}, "size": 8}, {"x": 2017.5997716894976, "y": 2.065392e+18, "tooltipData": {"Model": "RetinaNet-R101", "Domain": "Vision", "Training compute (FLOP)": "2.1e+18", "Organization": "Facebook AI Research", "Publication date": "2017-08-07"}, "size": 8}, {"x": 2017.6803652968038, "y": 2340000000000000.0, "tooltipData": {"Model": "PyramidNet", "Domain": "Vision", "Training compute (FLOP)": "2.3e+15", "Organization": "Korea Advanced Institute of Science and Technology (KAIST)", "Publication date": "2017-09-06"}, "size": 8}, {"x": 2017.919406392694, "y": 6.62904e+19, "tooltipData": {"Model": "PNASNet-5", "Domain": "Vision", "Training compute (FLOP)": "6.6e+19", "Organization": "Johns Hopkins University, Google AI, Stanford University", "Publication date": "2017-12-02"}, "size": 8}, {"x": 2018.094292237443, "y": 3.85296912e+20, "tooltipData": {"Model": "AmoebaNet-A (F=448)", "Domain": "Vision", "Training compute (FLOP)": "3.9e+20", "Organization": "Google Brain", "Publication date": "2018-02-05"}, "size": 8}, {"x": 2018.2691780821917, "y": 5.093919992e+19, "tooltipData": {"Model": "YOLOv3", "Domain": "Vision", "Training compute (FLOP)": "5.1e+19", "Organization": "University of Washington", "Publication date": "2018-04-08"}, "size": 8}, {"x": 2018.3360730593606, "y": 8.74395e+21, "tooltipData": {"Model": "ResNeXt-101 32x48d", "Domain": "Vision", "Training compute (FLOP)": "8.7e+21", "Organization": "Facebook", "Publication date": "2018-05-02"}, "size": 8}, {"x": 2018.4878995433792, "y": 3.4875e+19, "tooltipData": {"Model": "QT-Opt", "Domain": "Robotics, Vision", "Training compute (FLOP)": "3.5e+19", "Organization": "Google Brain, UC Berkeley", "Publication date": "2018-06-27"}, "size": 8}, {"x": 2018.5246575342467, "y": 2.46048e+17, "tooltipData": {"Model": "Big-Little Net", "Domain": "Vision", "Training compute (FLOP)": "2.5e+17", "Organization": "IBM", "Publication date": "2018-07-10"}, "size": 8}, {"x": 2019.0082191780823, "y": 2.47e+18, "tooltipData": {"Model": "Decoupled weight decay regularization", "Domain": "Vision", "Training compute (FLOP)": "2.5e+18", "Organization": "University of Freiburg", "Publication date": "2019-01-04"}, "size": 8}, {"x": 2019.143607305936, "y": 3.70656e+19, "tooltipData": {"Model": "ProxylessNAS", "Domain": "Vision", "Training compute (FLOP)": "3.7e+19", "Organization": "Massachusetts Institute of Technology (MIT)", "Publication date": "2019-02-23"}, "size": 8}, {"x": 2019.4100456621004, "y": 1.5e+21, "tooltipData": {"Model": "MnasNet-A1 + SSDLite", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.4100456621004, "y": 1.5e+21, "tooltipData": {"Model": "MnasNet-A3", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.6803652968038, "y": 1.94e+19, "tooltipData": {"Model": "ResNet-152 + ObjectNet", "Domain": "Vision", "Training compute (FLOP)": "1.9e+19", "Organization": "Massachusetts Institute of Technology (MIT)", "Publication date": "2019-09-06"}, "size": 8}, {"x": 2019.7527397260274, "y": 7.6e+18, "tooltipData": {"Model": "AlphaX-1", "Domain": "Vision", "Training compute (FLOP)": "7.6e+18", "Organization": "Facebook AI Research, Brown University", "Publication date": "2019-10-02"}, "size": 8}, {"x": 2019.8607305936073, "y": 2.612e+22, "tooltipData": {"Model": "Noisy Student (L2)", "Domain": "Vision", "Training compute (FLOP)": "2.6e+22", "Organization": "Carnegie Mellon University (CMU), Google", "Publication date": "2019-11-11"}, "size": 8}, {"x": 2020.3267123287671, "y": 1.78428096e+21, "tooltipData": {"Model": "Once for All", "Domain": "Vision", "Training compute (FLOP)": "1.8e+21", "Organization": "MIT-IBM Watson AI Lab, Massachusetts Institute of Technology (MIT), IBM", "Publication date": "2020-04-29"}, "size": 8}, {"x": 2020.401826484018, "y": 4e+20, "tooltipData": {"Model": "DETR", "Domain": "Vision", "Training compute (FLOP)": "4.0e+20", "Organization": "Facebook", "Publication date": "2020-05-26"}, "size": 8}, {"x": 2020.4605022831051, "y": 8.91e+21, "tooltipData": {"Model": "iGPT-L", "Domain": "Image generation, Vision", "Training compute (FLOP)": "8.9e+21", "Organization": "OpenAI", "Publication date": "2020-06-17"}, "size": 8}, {"x": 2020.4605022831051, "y": 3.3e+22, "tooltipData": {"Model": "iGPT-XL", "Domain": "Vision, Image generation", "Training compute (FLOP)": "3.3e+22", "Organization": "OpenAI", "Publication date": "2020-06-17"}, "size": 8}, {"x": 2020.8075342465754, "y": 4.262e+21, "tooltipData": {"Model": "ViT-Huge/14", "Domain": "Vision", "Training compute (FLOP)": "4.3e+21", "Organization": "Google Brain, Google Research", "Publication date": "2020-10-22"}, "size": 8}, {"x": 2021.0383561643835, "y": 7.884e+19, "tooltipData": {"Model": "DeiT-B", "Domain": "Vision", "Training compute (FLOP)": "7.9e+19", "Organization": "Meta AI, Sorbonne University", "Publication date": "2021-01-15"}, "size": 8}, {"x": 2021.1666666666667, "y": 4.79e+22, "tooltipData": {"Model": "Meta Pseudo Labels", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Organization": "Google Brain, Google AI", "Publication date": "2021-03-01"}, "size": 8}, {"x": 2021.3267123287671, "y": 2.1e+20, "tooltipData": {"Model": "ViT + DINO", "Domain": "Vision", "Training compute (FLOP)": "2.1e+20", "Organization": "INRIA, Facebook AI Research", "Publication date": "2021-04-29"}, "size": 8}, {"x": 2021.401826484018, "y": 2.40576e+19, "tooltipData": {"Model": "Transformer local-attention (NesT-B)", "Domain": "Vision", "Training compute (FLOP)": "2.4e+19", "Organization": "Google Cloud, Google Research", "Publication date": "2021-05-26"}, "size": 8}, {"x": 2021.4358447488585, "y": 5.85e+22, "tooltipData": {"Model": "ViT-G/14", "Domain": "Vision", "Training compute (FLOP)": "5.8e+22", "Organization": "Google Brain, Google Research", "Publication date": "2021-06-08"}, "size": 8}, {"x": 2021.4385844748858, "y": 4.27e+22, "tooltipData": {"Model": "CoAtNet", "Domain": "Vision", "Training compute (FLOP)": "4.3e+22", "Organization": "Google, Google Research, Google Brain", "Publication date": "2021-06-09"}, "size": 8}, {"x": 2021.4440639269408, "y": 3.8e+20, "tooltipData": {"Model": "Denoising Diffusion Probabilistic Models (LSUN Bedroom)", "Domain": "Vision", "Training compute (FLOP)": "3.8e+20", "Organization": "UC Berkeley", "Publication date": "2021-06-11"}, "size": 8}, {"x": 2021.4769406392695, "y": 9.56e+19, "tooltipData": {"Model": "EfficientNetV2-XL", "Domain": "Vision", "Training compute (FLOP)": "9.6e+19", "Organization": "Google, Google Brain", "Publication date": "2021-06-23"}, "size": 8}, {"x": 2021.5767123287671, "y": 1.8e+22, "tooltipData": {"Model": "SEER", "Domain": "Vision", "Training compute (FLOP)": "1.8e+22", "Organization": "Facebook AI Research, INRIA", "Publication date": "2021-07-29"}, "size": 8}, {"x": 2021.5970319634703, "y": 6.34275e+20, "tooltipData": {"Model": "YOLOX-X", "Domain": "Vision", "Training compute (FLOP)": "6.3e+20", "Organization": "Megvii Inc", "Publication date": "2021-08-06"}, "size": 8}, {"x": 2021.8607305936073, "y": 4.6e+20, "tooltipData": {"Model": "Masked Autoencoders ViT-H", "Domain": "Vision", "Training compute (FLOP)": "4.6e+20", "Organization": "Facebook AI Research", "Publication date": "2021-11-11"}, "size": 8}, {"x": 2021.879908675799, "y": 1.1e+21, "tooltipData": {"Model": "Swin Transformer V2", "Domain": "Vision, Video", "Training compute (FLOP)": "1.1e+21", "Organization": "Microsoft Research Asia", "Publication date": "2021-11-18"}, "size": 8}, {"x": 2021.8826484018264, "y": 4.12e+22, "tooltipData": {"Model": "BASIC-L", "Domain": "Vision", "Training compute (FLOP)": "4.1e+22", "Organization": "Google", "Publication date": "2021-11-19"}, "size": 8}, {"x": 2021.8908675799087, "y": 4.831e+22, "tooltipData": {"Model": "Florence", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Organization": "Microsoft", "Publication date": "2021-11-22"}, "size": 8}, {"x": 2022.0164383561644, "y": 2.34399744e+19, "tooltipData": {"Model": "Detic", "Domain": "Vision", "Training compute (FLOP)": "2.3e+19", "Organization": "Meta AI, University of Texas at Austin", "Publication date": "2022-01-07"}, "size": 8}, {"x": 2022.1913242009134, "y": 3.4e+21, "tooltipData": {"Model": "ViT-G (model soup)", "Domain": "Vision", "Training compute (FLOP)": "3.4e+21", "Organization": "University of Washington, Columbia University, Google, Meta AI, Tel Aviv University", "Publication date": "2022-03-10"}, "size": 8}, {"x": 2022.4522831050228, "y": 7.3e+22, "tooltipData": {"Model": "CoCa", "Domain": "Vision", "Training compute (FLOP)": "7.3e+22", "Organization": "Google Research", "Publication date": "2022-06-14"}, "size": 8}, {"x": 2022.85799086758, "y": 2.408e+21, "tooltipData": {"Model": "InternImage", "Domain": "Vision", "Training compute (FLOP)": "2.4e+21", "Organization": "Shanghai AI Lab, Tsinghua University, Nanjing University, SenseTime, Chinese University of Hong Kong (CUHK)", "Publication date": "2022-11-10"}, "size": 8}, {"x": 2022.8689497716894, "y": 1.501e+22, "tooltipData": {"Model": "EVA-01", "Domain": "Vision", "Training compute (FLOP)": "1.5e+22", "Organization": "Beijing Academy of Artificial Intelligence / BAAI, Huazhong University of Science and Technology, Zhejiang University, Beijing Institute of Technology", "Publication date": "2022-11-14"}, "size": 8}, {"x": 2023.10799086758, "y": 1.93248e+23, "tooltipData": {"Model": "ViT-22B", "Domain": "Vision", "Training compute (FLOP)": "1.9e+23", "Organization": "Google", "Publication date": "2023-02-10"}, "size": 8}, {"x": 2023.2609589041097, "y": 7.8e+21, "tooltipData": {"Model": "Segment Anything Model", "Domain": "Vision", "Training compute (FLOP)": "7.8e+21", "Organization": "Meta AI", "Publication date": "2023-04-05"}, "size": 8}, {"x": 2023.285616438356, "y": 7.41851136e+21, "tooltipData": {"Model": "DINOv2", "Domain": "Vision", "Training compute (FLOP)": "7.4e+21", "Organization": "Facebook AI Research, INRIA", "Publication date": "2023-04-14"}, "size": 8}], "size": 8, "fillColor": "rgb(224.0, 61.0, 144.0)", "strokeColor": "rgb(224.0, 61.0, 144.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "line", "color": "#E03D90", "zOrder": 2, "clip": true, "strokeWidth": 1.5, "lineStyle": "-", "tooltipData": {"Growth rate": "3.7x/year", "90% CI": "3.2x to 4.2x per year", "R\u00b2": "0.72"}, "points": [{"x": 2010.0, "y": 996187171452773.4}, {"x": 2010.1515151515152, "y": 1211353147764841.0}, {"x": 2010.3030303030303, "y": 1472992717282099.0}, {"x": 2010.4545454545455, "y": 1791143688502062.8}, {"x": 2010.6060606060605, "y": 2185769860463123.2}, {"x": 2010.7575757575758, "y": 2657873215632996.0}, {"x": 2010.909090909091, "y": 3231945941867134.5}, {"x": 2011.060606060606, "y": 3944010788038543.5}, {"x": 2011.2121212121212, "y": 4795875734819653.0}, {"x": 2011.3636363636363, "y": 5831734571714647.0}, {"x": 2011.5151515151515, "y": 7116586872905529.0}, {"x": 2011.6666666666667, "y": 8653694965037746.0}, {"x": 2011.8181818181818, "y": 1.052280227660681e+16}, {"x": 2011.969696969697, "y": 1.2795617155444356e+16}, {"x": 2012.121212121212, "y": 1.561475748927755e+16}, {"x": 2012.2727272727273, "y": 1.89873812655374e+16}, {"x": 2012.4242424242425, "y": 2.317069588876132e+16}, {"x": 2012.5757575757575, "y": 2.8175323076907852e+16}, {"x": 2012.7272727272727, "y": 3.4382932193351708e+16}, {"x": 2012.878787878788, "y": 4.1809284776336024e+16}, {"x": 2013.030303030303, "y": 5.102073894923574e+16}, {"x": 2013.1818181818182, "y": 6.204068321549845e+16}, {"x": 2013.3333333333333, "y": 7.544081981398834e+16}, {"x": 2013.4848484848485, "y": 9.173524531375408e+16}, {"x": 2013.6363636363637, "y": 1.1194642598254269e+17}, {"x": 2013.7878787878788, "y": 1.3612567937129435e+17}, {"x": 2013.939393939394, "y": 1.6552739778567024e+17}, {"x": 2014.090909090909, "y": 2.0199652293858483e+17}, {"x": 2014.2424242424242, "y": 2.4562565239860422e+17}, {"x": 2014.3939393939395, "y": 2.986782160332123e+17}, {"x": 2014.5454545454545, "y": 3.6448323312832806e+17}, {"x": 2014.6969696969697, "y": 4.432077870603635e+17}, {"x": 2014.8484848484848, "y": 5.389360186060495e+17}, {"x": 2015.0, "y": 6.576748218189398e+17}, {"x": 2015.1515151515152, "y": 7.99725682528348e+17}, {"x": 2015.3030303030303, "y": 9.724580386493353e+17}, {"x": 2015.4545454545455, "y": 1.1824987712592133e+18}, {"x": 2015.6060606060605, "y": 1.4430278211883625e+18}, {"x": 2015.7575757575758, "y": 1.7547066892646858e+18}, {"x": 2015.909090909091, "y": 2.1337049224846075e+18}, {"x": 2016.060606060606, "y": 2.603804452222053e+18}, {"x": 2016.2121212121212, "y": 3.166198892886636e+18}, {"x": 2016.3636363636363, "y": 3.863778298040755e+18}, {"x": 2016.5151515151515, "y": 4.698313868840094e+18}, {"x": 2016.6666666666667, "y": 5.733450038335515e+18}, {"x": 2016.8181818181818, "y": 6.97181508708018e+18}, {"x": 2016.969696969697, "y": 8.477653992525194e+18}, {"x": 2017.121212121212, "y": 1.0345457320508596e+19}, {"x": 2017.2727272727273, "y": 1.257996754966144e+19}, {"x": 2017.4242424242425, "y": 1.5297108542193662e+19}, {"x": 2017.5757575757575, "y": 1.866737940591464e+19}, {"x": 2017.7272727272727, "y": 2.269933748584443e+19}, {"x": 2017.878787878788, "y": 2.760215620478231e+19}, {"x": 2018.030303030303, "y": 3.3683484749733114e+19}, {"x": 2018.1818181818182, "y": 4.095876402400808e+19}, {"x": 2018.3333333333333, "y": 4.980542728396868e+19}, {"x": 2018.4848484848485, "y": 6.056287698246929e+19}, {"x": 2018.6363636363637, "y": 7.39061371910331e+19}, {"x": 2018.7878787878788, "y": 8.986908734720188e+19}, {"x": 2018.939393939394, "y": 1.0927986724224482e+20}, {"x": 2019.090909090909, "y": 1.3335649267390423e+20}, {"x": 2019.2424242424242, "y": 1.621600958178837e+20}, {"x": 2019.3939393939395, "y": 1.971849750126541e+20}, {"x": 2019.5454545454545, "y": 2.406289222276221e+20}, {"x": 2019.6969696969697, "y": 2.926022445746353e+20}, {"x": 2019.8484848484848, "y": 3.558012592067852e+20}, {"x": 2020.0, "y": 4.341916696473471e+20}, {"x": 2020.1515151515152, "y": 5.279725144357776e+20}, {"x": 2020.3030303030303, "y": 6.420090376815469e+20}, {"x": 2020.4545454545455, "y": 7.834569687051899e+20}, {"x": 2020.6060606060605, "y": 9.526754533440847e+20}, {"x": 2020.7575757575758, "y": 1.1625696509391098e+21}, {"x": 2020.909090909091, "y": 1.4136724970132674e+21}, {"x": 2021.060606060606, "y": 1.725133922182985e+21}, {"x": 2021.2121212121212, "y": 2.097744748012281e+21}, {"x": 2021.3636363636363, "y": 2.550835602515061e+21}, {"x": 2021.5151515151515, "y": 3.1128376884377525e+21}, {"x": 2021.6666666666667, "y": 3.7851779669789335e+21}, {"x": 2021.8181818181818, "y": 4.6027366909984584e+21}, {"x": 2021.969696969697, "y": 5.596879520986718e+21}, {"x": 2022.121212121212, "y": 6.829988374552657e+21}, {"x": 2022.2727272727273, "y": 8.305194198236598e+21}, {"x": 2022.4242424242425, "y": 1.0099029000906122e+22}, {"x": 2022.5757575757575, "y": 1.2324054933077886e+22}, {"x": 2022.7272727272727, "y": 1.4985921485649882e+22}, {"x": 2022.878787878788, "y": 1.8222723283341684e+22}, {"x": 2023.030303030303, "y": 2.223756786458508e+22}, {"x": 2023.1818181818182, "y": 2.704064919056875e+22}, {"x": 2023.3333333333333, "y": 3.2881145685535588e+22}, {"x": 2023.4848484848485, "y": 3.9983128140670025e+22}, {"x": 2023.6363636363637, "y": 4.87922420618299e+22}, {"x": 2023.7878787878788, "y": 5.93308543834262e+22}, {"x": 2023.939393939394, "y": 7.2145696387687265e+22}, {"x": 2024.090909090909, "y": 8.804089238547343e+22}, {"x": 2024.2424242424242, "y": 1.0705680135153677e+23}, {"x": 2024.3939393939395, "y": 1.3064363917529856e+23}, {"x": 2024.5454545454545, "y": 1.588612944288999e+23}, {"x": 2024.6969696969697, "y": 1.9386173850040036e+23}, {"x": 2024.8484848484848, "y": 2.3573383987773467e+23}, {"x": 2025.0, "y": 2.8767090301235133e+23}]}, {"type": "scatter", "label": "Language and Vision", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2021.0109589041097, "y": 1.05e+22, "tooltipData": {"Model": "CLIP (ViT L/14@336px)", "Domain": "Multimodal, Vision, Language, Video", "Training compute (FLOP)": "1.0e+22", "Organization": "OpenAI", "Publication date": "2021-01-05"}, "size": 8}, {"x": 2021.1776255707764, "y": 5.5e+21, "tooltipData": {"Model": "M6-T", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.5e+21", "Organization": "Alibaba", "Publication date": "2021-03-05"}, "size": 8}, {"x": 2021.4440639269408, "y": 2.598670000001e+22, "tooltipData": {"Model": "ALIGN", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "2.6e+22", "Organization": "Google Research", "Publication date": "2021-06-11"}, "size": 8}, {"x": 2021.8963470319634, "y": 4.8384e+21, "tooltipData": {"Model": "N\u00dcWA", "Domain": "Multimodal, Vision, Image generation, Video, Language", "Training compute (FLOP)": "4.8e+21", "Organization": "Microsoft Research, Peking University", "Publication date": "2021-11-24"}, "size": 8}, {"x": 2022.3267123287671, "y": 2.18972000000001e+23, "tooltipData": {"Model": "Flamingo", "Domain": "Multimodal, Vision, Language, Video", "Training compute (FLOP)": "2.2e+23", "Organization": "DeepMind", "Publication date": "2022-04-29"}, "size": 8}, {"x": 2022.6408675799087, "y": 7e+19, "tooltipData": {"Model": "BEIT-3", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "7.0e+19", "Organization": "Microsoft", "Publication date": "2022-08-22"}, "size": 8}, {"x": 2022.7022831050228, "y": 1.69e+23, "tooltipData": {"Model": "PaLI", "Domain": "Language, Vision, Multimodal", "Training compute (FLOP)": "1.7e+23", "Organization": "Google", "Publication date": "2022-09-14"}, "size": 8}, {"x": 2023.0794520547945, "y": 1.20000000001e+21, "tooltipData": {"Model": "BLIP-2 (Q-Former)", "Domain": "Vision, Language", "Training compute (FLOP)": "1.2e+21", "Organization": "Salesforce Research", "Publication date": "2023-01-30"}, "size": 8}, {"x": 2023.2050228310502, "y": 2.1e+25, "tooltipData": {"Model": "GPT-4", "Domain": "Multimodal, Language, Vision, Image generation", "Training compute (FLOP)": "2.1e+25", "Organization": "OpenAI", "Publication date": "2023-03-15"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.2938356164384, "y": 7.8049e+22, "tooltipData": {"Model": "LLaVA", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "7.8e+22", "Organization": "University of Wisconsin Madison, Microsoft Research, Columbia University", "Publication date": "2023-04-17"}, "size": 8}, {"x": 2023.3607305936073, "y": 1.94e+20, "tooltipData": {"Model": "InstructBLIP", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "1.9e+20", "Organization": "Salesforce Research, Hong Kong University of Science and Technology, Nanyang Technological University", "Publication date": "2023-05-11"}, "size": 8}, {"x": 2023.379908675799, "y": 1.8e+20, "tooltipData": {"Model": "ONE-PEACE", "Domain": "Multimodal, Vision, Speech, Language", "Training compute (FLOP)": "1.8e+20", "Organization": "Alibaba, Huazhong University of Science and Technology", "Publication date": "2023-05-18"}, "size": 8}, {"x": 2023.8212328767124, "y": 5.04e+22, "tooltipData": {"Model": "ChatGLM3-6B", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.0e+22", "Organization": "Zhipu AI", "Publication date": "2023-10-27"}, "size": 8}, {"x": 2023.844292237443, "y": 7.807e+22, "tooltipData": {"Model": "LLaVA 1.5", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "7.8e+22", "Organization": "University of Wisconsin Madison, Microsoft Research", "Publication date": "2023-11-05"}, "size": 8}, {"x": 2023.8470319634703, "y": 6.331e+22, "tooltipData": {"Model": "CogVLM-17B", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "6.3e+22", "Organization": "Tsinghua University, Zhipu AI, Beihang University", "Publication date": "2023-11-06"}, "size": 8}, {"x": 2023.866210045662, "y": 3.04e+22, "tooltipData": {"Model": "SPHINX (Llama 2 13B)", "Domain": "Vision, Language, Multimodal", "Training compute (FLOP)": "3.0e+22", "Organization": "Shanghai AI Lab, Chinese University of Hong Kong (CUHK), ShanghaiTech University", "Publication date": "2023-11-13"}, "size": 8}, {"x": 2023.866210045662, "y": 4.56e+22, "tooltipData": {"Model": "Volcano 13B", "Domain": "Language, Multimodal, Vision", "Training compute (FLOP)": "4.6e+22", "Organization": "Korea University, Korea Advanced Institute of Science and Technology (KAIST), LG", "Publication date": "2023-11-13"}, "size": 8}, {"x": 2023.9303652968038, "y": 5.0000000001e+25, "tooltipData": {"Model": "Gemini 1.0 Ultra", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.0e+25", "Organization": "Google DeepMind", "Publication date": "2023-12-06"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.9522831050228, "y": 6.707e+22, "tooltipData": {"Model": "CogAgent", "Domain": "Vision, Language", "Training compute (FLOP)": "6.7e+22", "Organization": "Tsinghua University, Zhipu AI", "Publication date": "2023-12-14"}, "size": 8}, {"x": 2024.2022831050228, "y": 4.86e+23, "tooltipData": {"Model": "MM1-30B", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "4.9e+23", "Organization": "Apple", "Publication date": "2024-03-14"}, "size": 8}, {"x": 2024.4495433789955, "y": 1.1e+23, "tooltipData": {"Model": "OpenVLA", "Domain": "Robotics, Vision, Language", "Training compute (FLOP)": "1.1e+23", "Organization": "Stanford University, UC Berkeley, Toyota Research Institute, Google DeepMind, Massachusetts Institute of Technology (MIT), Physical Intelligence", "Publication date": "2024-06-13"}, "size": 8}], "size": 8, "fillColor": "rgb(255.0, 178.0, 60.0)", "strokeColor": "rgb(255.0, 178.0, 60.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "annotation", "color": "#3E555E", "text": "Gemini Ultra", "x": 2023.9303652968038, "y": 5.0000000001e+25, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2023.9303652968038, "targetY": 5.0000000001e+25, "relDx": -0.005, "relDy": 0.04, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-4", "x": 2023.2050228310502, "y": 2.1e+25, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2023.2050228310502, "targetY": 2.1e+25, "relDx": -0.012, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-3", "x": 2020.407305936073, "y": 3.14e+23, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2020.407305936073, "targetY": 3.14e+23, "relDx": -0.05, "relDy": 0.06, "arrowPosition": "right", "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "PaLM", "x": 2022.2582191780823, "y": 2.5272e+24, "ha": "right", "va": "center", "background": true, "hasArrow": true, "responsiveRules": {"<410": {"visible": false}}, "targetX": 2022.2582191780823, "targetY": 2.5272e+24, "relDx": -0.04, "relDy": 0.02, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "AlexNet", "x": 2012.7461187214612, "y": 4.7e+17, "ha": "left", "va": "bottom", "background": true, "hasArrow": false, "responsiveRules": {"<410": {"visible": false}}, "targetX": 2012.7461187214612, "targetY": 4.7e+17, "relDx": -0.01, "relDy": 0.03, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GNMT", "x": 2016.7351598173516, "y": 6.620000000001e+21, "ha": "left", "va": "bottom", "background": true, "hasArrow": false, "targetX": 2016.7351598173516, "targetY": 6.620000000001e+21, "relDx": -0.01, "relDy": 0.03, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "Xception", "x": 2016.7664383561644, "y": 4.36e+20, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2016.7664383561644, "targetY": 4.36e+20, "relDx": -0.01, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#00A5A6", "text": "6.4x/year", "x": 2014, "y": [2957937272492653.0], "background": true, "weight": "bold", "hasArrow": true, "targetX": 2014, "targetY": [2957937272492653.0], "relDx": 0.08, "relDy": -0.14, "hasArrowHead": true, "arrowType": "arc", "arrowColor": "#00A5A6", "targetSize": 8}, {"type": "annotation", "color": "#E03D90", "text": "3.7x/year", "x": 2015, "y": [6.576748218188483e+17], "background": true, "weight": "bold", "hasArrow": true, "targetX": 2015, "targetY": [6.576748218188483e+17], "relDx": -0.15, "relDy": 0.09, "hasArrowHead": true, "arrowType": "arc", "arrowColor": "#E03D90", "targetSize": 8}], "additionalLegendItems": [], "tooltipKeyWidth": 120, "tooltipMinWidth": 250, "topRightText": "214 language, 89 vision", "addDataPadding": false, "title": "Training compute of notable language and vision models", "originalDataAspectRatio": 0.7451612903225805}

Enable JavaScript to see an interactive visualization.

The size of datasets used to train language models doubles approximately every eight months.

Across all domains of ML, models are using more and more training data. In language modeling, datasets are growing at a rate of 2.9x per year. The largest models currently use datasets with tens of trillions of words. The largest public datasets are about ten times larger than this, for example Common Crawl contains hundreds of trillions of words before filtering.

{"xAxis": {"label": "Publication date", "lim": [2009.25, 2025.75], "scaleType": "linear", "ticks": [2008.0, 2010.0, 2012.0, 2014.0, 2016.0, 2018.0, 2020.0, 2022.0, 2024.0, 2026.0], "tickLabels": ["2008", "2010", "2012", "2014", "2016", "2018", "2020", "2022", "2024", "2026"], "hideMinorGrid": true, "nice": false}, "yAxis": {"label": "Training dataset size (tokens)", "lim": [2102.6376984053495, 213948413624074.72], "scaleType": "log", "ticks": [10.0, 1000.0, 100000.0, 10000000.0, 1000000000.0, 100000000000.0, 10000000000000.0, 1000000000000000.0, 1e+17], "tickLabels": ["$\\mathdefault{10^{1}}$", "$\\mathdefault{10^{3}}$", "$\\mathdefault{10^{5}}$", "$\\mathdefault{10^{7}}$", "$\\mathdefault{10^{9}}$", "$\\mathdefault{10^{11}}$", "$\\mathdefault{10^{13}}$", "$\\mathdefault{10^{15}}$", "$\\mathdefault{10^{17}}$"], "hideMinorGrid": true}, "showLegend": true, "legendPosition": "header", "showFrame": true, "objects": [{"type": "scatter", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2010.4166666666667, "y": 37000000.0, "tooltipData": {"Model": "Word Representations", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "3.7e+07", "Organization": "University of Montreal / Universit\u00e9 de Montr\u00e9al, University of Illinois Urbana-Champaign (UIUC)", "Publication date": "2010-06-01"}, "size": 8}, {"x": 2011.3908675799087, "y": 697500.0, "tooltipData": {"Model": "RNN-SpeedUp", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "7.0e+05", "Organization": "Brno University of Technology, Johns Hopkins University", "Publication date": "2011-05-22"}, "size": 8}, {"x": 2011.4659817351599, "y": 75000.0, "tooltipData": {"Model": "Vector Space Model", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "7.5e+04", "Organization": "Stanford University", "Publication date": "2011-06-19"}, "size": 8}, {"x": 2011.4906392694065, "y": 833333.0, "tooltipData": {"Model": "Recursive Neural Network", "Domain": "Vision, Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "8.3e+05", "Organization": "Stanford University", "Publication date": "2011-06-28"}, "size": 8}, {"x": 2011.852511415525, "y": 852000000.0, "tooltipData": {"Model": "NLP from scratch", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "8.5e+08", "Organization": "NEC Laboratories, Princeton University", "Publication date": "2011-11-08"}, "size": 8}, {"x": 2012.0, "y": 27000000.0, "tooltipData": {"Model": "LSTM LM", "Domain": "Language", "Training compute (FLOP)": "1.7e+16", "Training dataset size (datapoints)": "2.7e+07", "Organization": "RWTH Aachen University", "Publication date": "2012-01-01"}, "size": 8}, {"x": 2013.041095890411, "y": 6000000000.0, "tooltipData": {"Model": "DistBelief NNLM", "Domain": "Language", "Training compute (FLOP)": "2.6e+18", "Training dataset size (datapoints)": "6.0e+09", "Organization": "Google", "Publication date": "2013-01-16"}, "size": 8}, {"x": 2013.75, "y": 155063.0, "tooltipData": {"Model": "RNTN", "Domain": "Language", "Training compute (FLOP)": "1.4e+16", "Training dataset size (datapoints)": "1.6e+05", "Organization": "Stanford University", "Publication date": "2013-10-01"}, "size": 8}, {"x": 2013.75, "y": 4100000.0, "tooltipData": {"Model": "RCTM", "Domain": "Language", "Training compute (FLOP)": "9.3e+15", "Training dataset size (datapoints)": "4.1e+06", "Organization": "University of Oxford", "Publication date": "2013-10-01"}, "size": 8}, {"x": 2013.791095890411, "y": 33000000000.0, "tooltipData": {"Model": "Word2Vec (large)", "Domain": "Language", "Training compute (FLOP)": "3.9e+16", "Training dataset size (datapoints)": "3.3e+10", "Organization": "Google", "Publication date": "2013-10-16"}, "size": 8}, {"x": 2013.791095890411, "y": 692000.0, "tooltipData": {"Model": "Word2Vec (small)", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "6.9e+05", "Organization": "Google", "Publication date": "2013-10-16"}, "size": 8}, {"x": 2013.9276255707764, "y": 17000000.0, "tooltipData": {"Model": "TransE", "Domain": "Language", "Training compute (FLOP)": "1.3e+18", "Training dataset size (datapoints)": "1.7e+07", "Organization": "Universite de Technologie de Compi\u00e8gne \u2013 CNRS, Google", "Publication date": "2013-12-05"}, "size": 8}, {"x": 2013.9440639269408, "y": 1000000000.0, "tooltipData": {"Model": "RNN for 1B words", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "1.0e+09", "Organization": "Google", "Publication date": "2013-12-11"}, "size": 8}, {"x": 2014.0, "y": 6000000000.0, "tooltipData": {"Model": "GloVe (6B)", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "6.0e+09", "Organization": "Stanford University", "Publication date": "2014-01-01"}, "size": 8}, {"x": 2014.0, "y": 42000000000.0, "tooltipData": {"Model": "GloVe (32B)", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "4.2e+10", "Organization": "Stanford University", "Publication date": "2014-01-01"}, "size": 8}, {"x": 2014.3689497716894, "y": 75000.0, "tooltipData": {"Model": "Paragraph Vector", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "7.5e+04", "Organization": "Google", "Publication date": "2014-05-14"}, "size": 8}, {"x": 2014.4166666666667, "y": 6248.0, "tooltipData": {"Model": "AdaRNN", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "6.2e+03", "Organization": "Beihang University", "Publication date": "2014-06-01"}, "size": 8}, {"x": 2014.6666666666667, "y": 348000000.0, "tooltipData": {"Model": "RNNsearch-50*", "Domain": "Language", "Training compute (FLOP)": "1.6e+18", "Training dataset size (datapoints)": "3.5e+08", "Organization": "Jacobs University Bremen, University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2014-09-01"}, "size": 8}, {"x": 2014.6913242009134, "y": 652000000.0, "tooltipData": {"Model": "Seq2Seq LSTM", "Domain": "Language", "Training compute (FLOP)": "5.6e+19", "Training dataset size (datapoints)": "6.5e+08", "Organization": "Google", "Publication date": "2014-09-10"}, "size": 8}, {"x": 2014.85799086758, "y": 600000.0, "tooltipData": {"Model": "SC-NLM", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "6.0e+05", "Organization": "University of Toronto", "Publication date": "2014-11-10"}, "size": 8}, {"x": 2014.9221461187215, "y": 1000000000.0, "tooltipData": {"Model": "SNM-skip", "Domain": "Language", "Training compute (FLOP)": "3.0e+20", "Training dataset size (datapoints)": "1.0e+09", "Organization": "Google", "Publication date": "2014-12-03"}, "size": 8}, {"x": 2015.6655251141551, "y": 37500000.0, "tooltipData": {"Model": "BPE", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "3.8e+07", "Organization": "University of Edinburgh", "Publication date": "2015-08-31"}, "size": 8}, {"x": 2016.4100456621004, "y": 204567.0, "tooltipData": {"Model": "Named Entity Recognition model", "Domain": "Language", "Training compute (FLOP)": "9.7e+16", "Training dataset size (datapoints)": "2.0e+05", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.4100456621004, "y": 912344.0, "tooltipData": {"Model": "Part-of-sentence tagging model", "Domain": "Language", "Training compute (FLOP)": "1.5e+17", "Training dataset size (datapoints)": "9.1e+05", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.7351598173516, "y": 388960152200.7108, "tooltipData": {"Model": "GNMT", "Domain": "Language", "Training compute (FLOP)": "6.6e+21", "Training dataset size (datapoints)": "3.9e+11", "Organization": "Google", "Publication date": "2016-09-26"}, "size": 8}, {"x": 2016.844292237443, "y": 47160000.0, "tooltipData": {"Model": "BIDAF", "Domain": "Language", "Training compute (FLOP)": "3.5e+18", "Training dataset size (datapoints)": "4.7e+07", "Organization": "University of Washington, Allen Institute for AI", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2017.0602739726028, "y": 133000000000.0, "tooltipData": {"Model": "MoE-Multi", "Domain": "Language", "Training compute (FLOP)": "9.4e+19", "Training dataset size (datapoints)": "1.3e+11", "Organization": "Jagiellonian University, Google Brain", "Publication date": "2017-01-23"}, "size": 8}, {"x": 2017.352511415525, "y": 107785.0, "tooltipData": {"Model": "Mnemonic Reader", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "1.1e+05", "Organization": "Fudan University, Microsoft Research", "Publication date": "2017-05-08"}, "size": 8}, {"x": 2017.4468036529681, "y": 1866666666.6666667, "tooltipData": {"Model": "Transformer", "Domain": "Language", "Training compute (FLOP)": "7.4e+18", "Training dataset size (datapoints)": "1.9e+09", "Organization": "Google Research, Google Brain", "Publication date": "2017-06-12"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2017.5657534246575, "y": 46600000.0, "tooltipData": {"Model": "ConvS2S (ensemble of 8 models)", "Domain": "Language", "Training compute (FLOP)": "5.6e+19", "Training dataset size (datapoints)": "4.7e+07", "Organization": "Meta AI", "Publication date": "2017-07-25"}, "size": 8}, {"x": 2017.5794520547945, "y": 4000000.0, "tooltipData": {"Model": "GSM", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "4.0e+06", "Organization": "Peking University, Microsoft Research", "Publication date": "2017-07-30"}, "size": 8}, {"x": 2017.8239726027398, "y": 90000.0, "tooltipData": {"Model": "PhraseCond", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "9.0e+04", "Organization": "Carnegie Mellon University (CMU), University of Pittsburgh", "Publication date": "2017-10-28"}, "size": 8}, {"x": 2017.8267123287671, "y": 2000000000.0, "tooltipData": {"Model": "S-Norm", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "2.0e+09", "Organization": "University of Washington, Allen Institute for AI", "Publication date": "2017-10-29"}, "size": 8}, {"x": 2017.8321917808219, "y": 107785.0, "tooltipData": {"Model": "DCN+", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "1.1e+05", "Organization": "Salesforce Research", "Publication date": "2017-10-31"}, "size": 8}, {"x": 2018.2378995433792, "y": 175181505.0, "tooltipData": {"Model": "LSTM (Hebbian, Cache, MbPA)", "Domain": "Language", "Training compute (FLOP)": "3.3e+19", "Training dataset size (datapoints)": "1.8e+08", "Organization": "DeepMind, University College London (UCL)", "Publication date": "2018-03-27"}, "size": 8}, {"x": 2018.4166666666667, "y": 1000000000.0, "tooltipData": {"Model": "GPT-1", "Domain": "Language", "Training compute (FLOP)": "1.8e+19", "Training dataset size (datapoints)": "1.0e+09", "Organization": "OpenAI", "Publication date": "2018-06-01"}, "size": 8}, {"x": 2018.657305936073, "y": 3390000000.0, "tooltipData": {"Model": "Big Transformer for Back-Translation", "Domain": "Language", "Training compute (FLOP)": "4.8e+20", "Training dataset size (datapoints)": "3.4e+09", "Organization": "Facebook AI Research, Google Brain", "Publication date": "2018-08-28"}, "size": 8}, {"x": 2018.7406392694065, "y": 100000000.0, "tooltipData": {"Model": "Transformer (Adaptive Input Embeddings) WT103", "Domain": "Language", "Training compute (FLOP)": "4.5e+19", "Training dataset size (datapoints)": "1.0e+08", "Organization": "Facebook AI Research", "Publication date": "2018-09-28"}, "size": 8}, {"x": 2018.777397260274, "y": 3300000000.0, "tooltipData": {"Model": "BERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.8e+20", "Training dataset size (datapoints)": "3.3e+09", "Organization": "Google", "Publication date": "2018-10-11"}, "size": 8}, {"x": 2018.844292237443, "y": 1800000000.0, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 2.9B (translation)", "Domain": "Language", "Training compute (FLOP)": "6.8e+19", "Training dataset size (datapoints)": "1.8e+09", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2018.844292237443, "y": 6333333333.333333, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 4.9B (language)", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Training dataset size (datapoints)": "6.3e+09", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2018.8744292237443, "y": 20000000000.0, "tooltipData": {"Model": "GPipe (Transformer)", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "2.0e+10", "Organization": "Google", "Publication date": "2018-11-16"}, "size": 8}, {"x": 2019.1189497716894, "y": 8000000000.0, "tooltipData": {"Model": "GPT-2 (1.5B)", "Domain": "Language", "Training compute (FLOP)": "1.9e+21", "Training dataset size (datapoints)": "8.0e+09", "Organization": "OpenAI", "Publication date": "2019-02-14"}, "size": 8}, {"x": 2019.2351598173516, "y": 3300000000.0, "tooltipData": {"Model": "SciBERT", "Domain": "Language", "Training compute (FLOP)": "8.9e+19", "Training dataset size (datapoints)": "3.3e+09", "Organization": "Allen Institute for AI", "Publication date": "2019-03-26"}, "size": 8}, {"x": 2019.5, "y": 32000000000.0, "tooltipData": {"Model": "RoBERTa Large", "Domain": "Language", "Training compute (FLOP)": "8.5e+21", "Training dataset size (datapoints)": "3.2e+10", "Organization": "Facebook, University of Washington", "Publication date": "2019-07-01"}, "size": 8}, {"x": 2019.7105022831051, "y": 46400000000.0, "tooltipData": {"Model": "Megatron-BERT", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Training dataset size (datapoints)": "4.6e+10", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.7105022831051, "y": 34800000000.0, "tooltipData": {"Model": "Megatron-LM (8.3B)", "Domain": "Language", "Training compute (FLOP)": "9.1e+21", "Training dataset size (datapoints)": "3.5e+10", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.7351598173516, "y": 3300000000.0, "tooltipData": {"Model": "ALBERT", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "3.3e+09", "Organization": "Toyota Technological Institute at Chicago, Google Research", "Publication date": "2019-09-26"}, "size": 8}, {"x": 2019.8102739726028, "y": 25500000000.0, "tooltipData": {"Model": "T5-3B", "Domain": "Language", "Training compute (FLOP)": "8.7e+20", "Training dataset size (datapoints)": "2.6e+10", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.8102739726028, "y": 200000000000.0, "tooltipData": {"Model": "T5-11B", "Domain": "Language", "Training compute (FLOP)": "3.3e+22", "Training dataset size (datapoints)": "2.0e+11", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.844292237443, "y": 125250000000.0, "tooltipData": {"Model": "XLM-RoBERTa", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "1.3e+11", "Organization": "Facebook AI", "Publication date": "2019-11-05"}, "size": 8}, {"x": 2019.85799086758, "y": 31900000000.0, "tooltipData": {"Model": "CamemBERT", "Domain": "Language", "Training compute (FLOP)": "8.3e+20", "Training dataset size (datapoints)": "3.2e+10", "Organization": "Facebook, INRIA, Sorbonne University", "Publication date": "2019-11-10"}, "size": 8}, {"x": 2020.0739726027398, "y": 40000000000.0, "tooltipData": {"Model": "Meena", "Domain": "Language", "Training compute (FLOP)": "1.1e+23", "Training dataset size (datapoints)": "4.0e+10", "Organization": "Google Brain", "Publication date": "2020-01-28"}, "size": 8}, {"x": 2020.1052511415523, "y": 3300000000.0, "tooltipData": {"Model": "ALBERT-xxlarge", "Domain": "Language", "Training compute (FLOP)": "2.4e+21", "Training dataset size (datapoints)": "3.3e+09", "Organization": "Toyota Technological Institute at Chicago, Google", "Publication date": "2020-02-09"}, "size": 8}, {"x": 2020.116210045662, "y": 46400000000.0, "tooltipData": {"Model": "Turing-NLG", "Domain": "Language", "Training compute (FLOP)": "1.6e+22", "Training dataset size (datapoints)": "4.6e+10", "Organization": "Microsoft", "Publication date": "2020-02-13"}, "size": 8}, {"x": 2020.2269406392695, "y": 25000000000.0, "tooltipData": {"Model": "ELECTRA", "Domain": "Language", "Training compute (FLOP)": "3.1e+21", "Training dataset size (datapoints)": "2.5e+10", "Organization": "Stanford University, Google, Google Brain", "Publication date": "2020-03-23"}, "size": 8}, {"x": 2020.407305936073, "y": 374000000000.0, "tooltipData": {"Model": "GPT-3 175B (davinci)", "Domain": "Language", "Training compute (FLOP)": "3.1e+23", "Training dataset size (datapoints)": "3.7e+11", "Organization": "OpenAI", "Publication date": "2020-05-28"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2020.4961187214612, "y": 346666666666.6667, "tooltipData": {"Model": "GShard (dense)", "Domain": "Language", "Training compute (FLOP)": "4.8e+22", "Training dataset size (datapoints)": "3.5e+11", "Organization": "Google", "Publication date": "2020-06-30"}, "size": 8}, {"x": 2020.5970319634703, "y": 86000000000.0, "tooltipData": {"Model": "ERNIE-GEN (large)", "Domain": "Language", "Training compute (FLOP)": "2.0e+20", "Training dataset size (datapoints)": "8.6e+10", "Organization": "Baidu", "Publication date": "2020-08-06"}, "size": 8}, {"x": 2020.8020547945205, "y": 1000000000000.0, "tooltipData": {"Model": "mT5-XXL", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Training dataset size (datapoints)": "1.0e+12", "Organization": "Google, Google Research", "Publication date": "2020-10-20"}, "size": 8}, {"x": 2020.804794520548, "y": 27287800000.0, "tooltipData": {"Model": "GBERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.2e+21", "Training dataset size (datapoints)": "2.7e+10", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.804794520548, "y": 36383733333.333336, "tooltipData": {"Model": "German ELECTRA Large", "Domain": "Language", "Training compute (FLOP)": "1.4e+21", "Training dataset size (datapoints)": "3.6e+10", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.893607305936, "y": 3300000000.0, "tooltipData": {"Model": "KEPLER", "Domain": "Language", "Training compute (FLOP)": "1.2e+20", "Training dataset size (datapoints)": "3.3e+09", "Organization": "Tsinghua University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), HEC, CIFAR AI Research, Princeton University, University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2020-11-23"}, "size": 8}, {"x": 2020.9166666666667, "y": 16700000000.0, "tooltipData": {"Model": "CPM-Large", "Domain": "Language", "Training compute (FLOP)": "1.8e+21", "Training dataset size (datapoints)": "1.7e+10", "Organization": "Tsinghua University, Beijing Academy of Artificial Intelligence / BAAI", "Publication date": "2020-12-01"}, "size": 8}, {"x": 2020.9769406392695, "y": 58000000.0, "tooltipData": {"Model": "DensePhrases", "Domain": "Language", "Training compute (FLOP)": "2.1e+18", "Training dataset size (datapoints)": "5.8e+07", "Organization": "Korea University, Princeton University", "Publication date": "2020-12-23"}, "size": 8}, {"x": 2021.0109589041097, "y": 400000000.0, "tooltipData": {"Model": "CLIP (ViT L/14@336px)", "Domain": "Multimodal, Vision, Language, Video", "Training compute (FLOP)": "1.0e+22", "Training dataset size (datapoints)": "4.0e+08", "Organization": "OpenAI", "Publication date": "2021-01-05"}, "size": 8}, {"x": 2021.0109589041097, "y": 400000000.0, "tooltipData": {"Model": "CLIP (ResNet-50)", "Domain": "Multimodal, Vision, Language, Video", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "4.0e+08", "Organization": "OpenAI", "Publication date": "2021-01-05"}, "size": 8}, {"x": 2021.027397260274, "y": 576000000000.0, "tooltipData": {"Model": "Switch", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Training dataset size (datapoints)": "5.8e+11", "Organization": "Google", "Publication date": "2021-01-11"}, "size": 8}, {"x": 2021.1776255707764, "y": 56800000000.0, "tooltipData": {"Model": "Generative BST", "Domain": "Language", "Training compute (FLOP)": "1.4e+22", "Training dataset size (datapoints)": "5.7e+10", "Organization": "Facebook AI Research", "Publication date": "2021-03-05"}, "size": 8}, {"x": 2021.1776255707764, "y": 1900000000000.0, "tooltipData": {"Model": "M6-T", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.5e+21", "Training dataset size (datapoints)": "1.9e+12", "Organization": "Alibaba", "Publication date": "2021-03-05"}, "size": 8}, {"x": 2021.4385844748858, "y": 171600000000.0, "tooltipData": {"Model": "EMDR", "Domain": "Language", "Training compute (FLOP)": "1.9e+21", "Training dataset size (datapoints)": "1.7e+11", "Organization": "Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), McGill University, DeepMind", "Publication date": "2021-06-09"}, "size": 8}, {"x": 2021.4413242009134, "y": 15600000000.0, "tooltipData": {"Model": "DeBERTa", "Domain": "Language", "Training compute (FLOP)": "2.6e+22", "Training dataset size (datapoints)": "1.6e+10", "Organization": "Microsoft", "Publication date": "2021-06-10"}, "size": 8}, {"x": 2021.4440639269408, "y": 1600000000.0, "tooltipData": {"Model": "ALIGN", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "2.6e+22", "Training dataset size (datapoints)": "1.6e+09", "Organization": "Google Research", "Publication date": "2021-06-11"}, "size": 8}, {"x": 2021.5109589041097, "y": 668000000000.0, "tooltipData": {"Model": "ERNIE 3.0", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Training dataset size (datapoints)": "6.7e+11", "Organization": "Baidu", "Publication date": "2021-07-05"}, "size": 8}, {"x": 2021.5164383561644, "y": 31800000000.0, "tooltipData": {"Model": "Codex", "Domain": "Language", "Training compute (FLOP)": "7.3e+22", "Training dataset size (datapoints)": "3.2e+10", "Organization": "OpenAI", "Publication date": "2021-07-07"}, "size": 8}, {"x": 2021.6107305936073, "y": 225000000000.0, "tooltipData": {"Model": "Jurassic-1-Jumbo", "Domain": "Language", "Training compute (FLOP)": "3.7e+23", "Training dataset size (datapoints)": "2.2e+11", "Organization": "AI21 Labs", "Publication date": "2021-08-11"}, "size": 8}, {"x": 2021.6271689497717, "y": 167000000000.0, "tooltipData": {"Model": "XLMR-XXL", "Domain": "Language", "Training compute (FLOP)": "3.4e+22", "Training dataset size (datapoints)": "1.7e+11", "Organization": "Facebook AI Research", "Publication date": "2021-08-17"}, "size": 8}, {"x": 2021.6803652968038, "y": 103000000.0, "tooltipData": {"Model": "PermuteFormer", "Domain": "Language", "Training compute (FLOP)": "2.8e+18", "Training dataset size (datapoints)": "1.0e+08", "Organization": "Peking University", "Publication date": "2021-09-06"}, "size": 8}, {"x": 2021.6913242009134, "y": 560000000000.0, "tooltipData": {"Model": "HyperCLOVA 204B", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "5.6e+11", "Organization": "NAVER", "Publication date": "2021-09-10"}, "size": 8}, {"x": 2021.7187214611872, "y": 150000000000.0, "tooltipData": {"Model": "PLATO-XL", "Domain": "Language", "Training compute (FLOP)": "9.9e+21", "Training dataset size (datapoints)": "1.5e+11", "Organization": "Baidu", "Publication date": "2021-09-20"}, "size": 8}, {"x": 2021.777397260274, "y": 270000000000.0, "tooltipData": {"Model": "Megatron-Turing NLG 530B", "Domain": "Language", "Training compute (FLOP)": "1.2e+24", "Training dataset size (datapoints)": "2.7e+11", "Organization": "Microsoft, NVIDIA", "Publication date": "2021-10-11"}, "size": 8}, {"x": 2021.7801369863014, "y": 1000000000000.0, "tooltipData": {"Model": "Yuan 1.0", "Domain": "Language", "Training compute (FLOP)": "3.5e+23", "Training dataset size (datapoints)": "1.0e+12", "Organization": "Inspur", "Publication date": "2021-10-12"}, "size": 8}, {"x": 2021.9358447488585, "y": 300000000000.0, "tooltipData": {"Model": "Gopher (280B)", "Domain": "Language", "Training compute (FLOP)": "6.3e+23", "Training dataset size (datapoints)": "3.0e+11", "Organization": "DeepMind", "Publication date": "2021-12-08"}, "size": 8}, {"x": 2021.9495433789955, "y": 600000000000.0, "tooltipData": {"Model": "GLaM", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Training dataset size (datapoints)": "6.0e+11", "Organization": "Google", "Publication date": "2021-12-13"}, "size": 8}, {"x": 2021.9550228310502, "y": 200000000000.0, "tooltipData": {"Model": "LongT5", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "2.0e+11", "Organization": "Google Research", "Publication date": "2021-12-15"}, "size": 8}, {"x": 2021.9687214611872, "y": 1740000000.0, "tooltipData": {"Model": "XGLM-7.5B", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Training dataset size (datapoints)": "1.7e+09", "Organization": "Meta AI, Facebook AI Research", "Publication date": "2021-12-20"}, "size": 8}, {"x": 2021.9769406392695, "y": 668000000000.0, "tooltipData": {"Model": "ERNIE 3.0 Titan", "Domain": "Language", "Training compute (FLOP)": "1.0e+24", "Training dataset size (datapoints)": "6.7e+11", "Organization": "Baidu, Peng Cheng Laboratory", "Publication date": "2021-12-23"}, "size": 8}, {"x": 2022.0520547945205, "y": 3300000000.0, "tooltipData": {"Model": "data2vec (language)", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "3.3e+09", "Organization": "Meta AI", "Publication date": "2022-01-20"}, "size": 8}, {"x": 2022.0997716894976, "y": 419430400000.0, "tooltipData": {"Model": "RETRO-7B", "Domain": "Language", "Training compute (FLOP)": "1.7e+22", "Training dataset size (datapoints)": "4.2e+11", "Organization": "DeepMind", "Publication date": "2022-02-07"}, "size": 8}, {"x": 2022.1052511415523, "y": 341173367965.0, "tooltipData": {"Model": "GPT-NeoX-20B", "Domain": "Language", "Training compute (FLOP)": "9.3e+22", "Training dataset size (datapoints)": "3.4e+11", "Organization": "EleutherAI", "Publication date": "2022-02-09"}, "size": 8}, {"x": 2022.10799086758, "y": 1560000000000.0, "tooltipData": {"Model": "LaMDA", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Training dataset size (datapoints)": "1.6e+12", "Organization": "Google", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.1271689497717, "y": 1500000000000.0, "tooltipData": {"Model": "ST-MoE", "Domain": "Language", "Training compute (FLOP)": "2.9e+23", "Training dataset size (datapoints)": "1.5e+12", "Organization": "Google, Google Brain, Google Research", "Publication date": "2022-02-17"}, "size": 8}, {"x": 2022.1666666666667, "y": 12000000000.0, "tooltipData": {"Model": "DeepNet", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "1.2e+10", "Organization": "Microsoft Research", "Publication date": "2022-03-01"}, "size": 8}, {"x": 2022.169406392694, "y": 275000000000.0, "tooltipData": {"Model": "Statement Curriculum Learning", "Domain": "Language, Mathematics", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "2.8e+11", "Organization": "OpenAI", "Publication date": "2022-03-02"}, "size": 8}, {"x": 2022.2433789954339, "y": 1050000000000.0, "tooltipData": {"Model": "Chinchilla", "Domain": "Language", "Training compute (FLOP)": "5.8e+23", "Training dataset size (datapoints)": "1.0e+12", "Organization": "DeepMind", "Publication date": "2022-03-29"}, "size": 8}, {"x": 2022.2582191780823, "y": 585000000000.0, "tooltipData": {"Model": "PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Training dataset size (datapoints)": "5.8e+11", "Organization": "Google Research", "Publication date": "2022-04-04"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2022.285616438356, "y": 100000000000.0, "tooltipData": {"Model": "Sparse all-MLP", "Domain": "Language", "Training compute (FLOP)": "6.1e+19", "Training dataset size (datapoints)": "1.0e+11", "Organization": "Meta AI", "Publication date": "2022-04-14"}, "size": 8}, {"x": 2022.3360730593606, "y": 180000000000.0, "tooltipData": {"Model": "OPT-175B", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Training dataset size (datapoints)": "1.8e+11", "Organization": "Meta AI", "Publication date": "2022-05-02"}, "size": 8}, {"x": 2022.35799086758, "y": 1000000000000.0, "tooltipData": {"Model": "UL2", "Domain": "Language", "Training compute (FLOP)": "1.2e+23", "Training dataset size (datapoints)": "1.0e+12", "Organization": "Google Research, Google Brain", "Publication date": "2022-05-10"}, "size": 8}, {"x": 2022.3634703196346, "y": 524288000000.0, "tooltipData": {"Model": "Gato", "Domain": "Multimodal, Robotics, Games, Language", "Training compute (FLOP)": "4.0e+21", "Training dataset size (datapoints)": "5.2e+11", "Organization": "DeepMind", "Publication date": "2022-05-12"}, "size": 8}, {"x": 2022.5109589041097, "y": 10500000000.0, "tooltipData": {"Model": "CodeT5-large", "Domain": "Language", "Training compute (FLOP)": "2.7e+21", "Training dataset size (datapoints)": "1.0e+10", "Organization": "Salesforce", "Publication date": "2022-07-05"}, "size": 8}, {"x": 2022.513698630137, "y": 360000000000.0, "tooltipData": {"Model": "NLLB", "Domain": "Language", "Training compute (FLOP)": "1.8e+22", "Training dataset size (datapoints)": "3.6e+11", "Organization": "Meta AI", "Publication date": "2022-07-06"}, "size": 8}, {"x": 2022.527397260274, "y": 379000000000.0, "tooltipData": {"Model": "BLOOM-176B", "Domain": "Language", "Training compute (FLOP)": "3.7e+23", "Training dataset size (datapoints)": "3.8e+11", "Organization": "Hugging Face, BigScience", "Publication date": "2022-07-11"}, "size": 8}, {"x": 2022.5860730593606, "y": 1319000000000.0, "tooltipData": {"Model": "AlexaTM 20B", "Domain": "Language", "Training compute (FLOP)": "2.0e+23", "Training dataset size (datapoints)": "1.3e+12", "Organization": "Amazon", "Publication date": "2022-08-02"}, "size": 8}, {"x": 2022.5915525114156, "y": 400000000000.0, "tooltipData": {"Model": "GLM-130B", "Domain": "Language", "Training compute (FLOP)": "3.5e+23", "Training dataset size (datapoints)": "4.0e+11", "Organization": "Tsinghua University", "Publication date": "2022-08-04"}, "size": 8}, {"x": 2022.7022831050228, "y": 1600000000.0, "tooltipData": {"Model": "PaLI", "Domain": "Language, Vision, Multimodal", "Training compute (FLOP)": "1.7e+23", "Training dataset size (datapoints)": "1.6e+09", "Organization": "Google", "Publication date": "2022-09-14"}, "size": 8}, {"x": 2022.8744292237443, "y": 106000000000.0, "tooltipData": {"Model": "Galactica", "Domain": "Language, Biology", "Training compute (FLOP)": "3.2e+23", "Training dataset size (datapoints)": "1.1e+11", "Organization": "Meta AI", "Publication date": "2022-11-16"}, "size": 8}, {"x": 2023.0794520547945, "y": 129000000.0, "tooltipData": {"Model": "BLIP-2 (Q-Former)", "Domain": "Vision, Language", "Training compute (FLOP)": "1.2e+21", "Training dataset size (datapoints)": "1.3e+08", "Organization": "Salesforce Research", "Publication date": "2023-01-30"}, "size": 8}, {"x": 2023.1463470319634, "y": 1340000000000.0, "tooltipData": {"Model": "LLaMA-65B", "Domain": "Language", "Training compute (FLOP)": "5.5e+23", "Training dataset size (datapoints)": "1.3e+12", "Organization": "Meta AI", "Publication date": "2023-02-24"}, "size": 8}, {"x": 2023.2050228310502, "y": 1000000000000.0, "tooltipData": {"Model": "Falcon-40B", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Training dataset size (datapoints)": "1.0e+12", "Organization": "Technology Innovation Institute", "Publication date": "2023-03-15"}, "size": 8}, {"x": 2023.2050228310502, "y": 4900000000000.0, "tooltipData": {"Model": "GPT-4", "Domain": "Multimodal, Language, Vision, Image generation", "Training compute (FLOP)": "2.1e+25", "Training dataset size (datapoints)": "4.9e+12", "Organization": "OpenAI", "Publication date": "2023-03-15"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.2187214611872, "y": 246750000000.0, "tooltipData": {"Model": "PanGu-\u03a3", "Domain": "Language", "Training compute (FLOP)": "4.7e+23", "Training dataset size (datapoints)": "2.5e+11", "Organization": "Huawei Noah's Ark Lab", "Publication date": "2023-03-20"}, "size": 8}, {"x": 2023.2461187214612, "y": 532000000000.0, "tooltipData": {"Model": "BloombergGPT", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Training dataset size (datapoints)": "5.3e+11", "Organization": "Bloomberg, Johns Hopkins University", "Publication date": "2023-03-30"}, "size": 8}, {"x": 2023.3552511415523, "y": 1000000000000.0, "tooltipData": {"Model": "StarCoder", "Domain": "Language", "Training compute (FLOP)": "8.5e+22", "Training dataset size (datapoints)": "1.0e+12", "Organization": "Hugging Face, ServiceNow, Northeastern University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), Carnegie Mellon University (CMU), Johns Hopkins University, Leipzig University, ScaDS.AI, Queen Mary University of London, Roblox, Sea AI Lab, Technion - Israel Institute of Technology, Monash University, CSIRO, Data61, McGill University, Saama, University of British Columbia (UBC), Massachusetts Institute of Technology (MIT), Technical University of Munich, IBM, University of Vermont, UnfoldML, SAP, University of Notre Dame, Columbia University, New York University (NYU), University of Allahabad, Discover Dollar, Toloka, Telefonica, Stanford University, Weizmann Institute of Science, Alan Turing Institute, Wellesley College, EleutherAI, Forschungszentrum Julich", "Publication date": "2023-05-09"}, "size": 8}, {"x": 2023.35799086758, "y": 2700000000000.0, "tooltipData": {"Model": "PaLM 2", "Domain": "Language", "Training compute (FLOP)": "7.3e+24", "Training dataset size (datapoints)": "2.7e+12", "Organization": "Google", "Publication date": "2023-05-10"}, "size": 8}, {"x": 2023.379908675799, "y": 1600000000.0, "tooltipData": {"Model": "ONE-PEACE", "Domain": "Multimodal, Vision, Speech, Language", "Training compute (FLOP)": "1.8e+20", "Training dataset size (datapoints)": "1.6e+09", "Organization": "Alibaba, Huazhong University of Science and Technology", "Publication date": "2023-05-18"}, "size": 8}, {"x": 2023.513698630137, "y": 750000000000.0, "tooltipData": {"Model": "InternLM", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "7.5e+11", "Organization": "Shanghai AI Lab, SenseTime", "Publication date": "2023-07-06"}, "size": 8}, {"x": 2023.5465753424658, "y": 1500000000000.0, "tooltipData": {"Model": "Llama 2-70B", "Domain": "Language", "Training compute (FLOP)": "8.1e+23", "Training dataset size (datapoints)": "1.5e+12", "Organization": "Meta AI", "Publication date": "2023-07-18"}, "size": 8}, {"x": 2023.5465753424658, "y": 1500000000000.0, "tooltipData": {"Model": "Llama 2-7B", "Domain": "Language", "Training compute (FLOP)": "8.4e+22", "Training dataset size (datapoints)": "1.5e+12", "Organization": "Meta AI", "Publication date": "2023-07-18"}, "size": 8}, {"x": 2023.6600456621004, "y": 300000000000.0, "tooltipData": {"Model": "Jais", "Domain": "Language", "Training compute (FLOP)": "3.1e+22", "Training dataset size (datapoints)": "3.0e+11", "Organization": "Cerebras Systems, Mohamed bin Zayed University of Artificial Intelligence, Inception", "Publication date": "2023-08-29"}, "size": 8}, {"x": 2023.6803652968038, "y": 2625000000000.0, "tooltipData": {"Model": "Falcon-180B", "Domain": "Language", "Training compute (FLOP)": "3.8e+24", "Training dataset size (datapoints)": "2.6e+12", "Organization": "Technology Innovation Institute", "Publication date": "2023-09-06"}, "size": 8}, {"x": 2023.7406392694065, "y": 4000000000000.0, "tooltipData": {"Model": "Amazon Titan", "Domain": "Language, Image generation", "Training compute (FLOP)": "4.8e+24", "Training dataset size (datapoints)": "4.0e+12", "Organization": "Amazon", "Publication date": "2023-09-28"}, "size": 8}, {"x": 2023.8184931506848, "y": 4390400.0, "tooltipData": {"Model": "CODEFUSION (Python)", "Domain": "Language", "Training compute (FLOP)": "7.9e+18", "Training dataset size (datapoints)": "4.4e+06", "Organization": "Microsoft, Microsoft Research", "Publication date": "2023-10-26"}, "size": 8}, {"x": 2023.8212328767124, "y": 1400000000000.0, "tooltipData": {"Model": "ChatGLM3-6B", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.0e+22", "Training dataset size (datapoints)": "1.4e+12", "Organization": "Zhipu AI", "Publication date": "2023-10-27"}, "size": 8}, {"x": 2023.8294520547945, "y": 3180000000000.0, "tooltipData": {"Model": "Skywork-13B", "Domain": "Language", "Training compute (FLOP)": "2.5e+23", "Training dataset size (datapoints)": "3.2e+12", "Organization": "Kunlun Inc.", "Publication date": "2023-10-30"}, "size": 8}, {"x": 2023.8360730593606, "y": 3100000000000.0, "tooltipData": {"Model": "Yi-34B", "Domain": "Language", "Training compute (FLOP)": "6.1e+23", "Training dataset size (datapoints)": "3.1e+12", "Organization": "01.AI", "Publication date": "2023-11-02"}, "size": 8}, {"x": 2023.8716894977167, "y": 3800000000000.0, "tooltipData": {"Model": "Nemotron-3-8B", "Domain": "Language", "Training compute (FLOP)": "1.8e+23", "Training dataset size (datapoints)": "3.8e+12", "Organization": "NVIDIA", "Publication date": "2023-11-15"}, "size": 8}, {"x": 2023.9127853881278, "y": 3000000000000.0, "tooltipData": {"Model": "Qwen-72B", "Domain": "Language", "Training compute (FLOP)": "1.3e+24", "Training dataset size (datapoints)": "3.0e+12", "Organization": "Alibaba", "Publication date": "2023-11-30"}, "size": 8}, {"x": 2024.2022831050228, "y": 1500000000000.0, "tooltipData": {"Model": "MM1-30B", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "4.9e+23", "Training dataset size (datapoints)": "1.5e+12", "Organization": "Apple", "Publication date": "2024-03-14"}, "size": 8}, {"x": 2024.2965753424658, "y": 15000000000000.0, "tooltipData": {"Model": "Llama 3-70B", "Domain": "Language", "Training compute (FLOP)": "7.9e+24", "Training dataset size (datapoints)": "1.5e+13", "Organization": "Meta AI", "Publication date": "2024-04-18"}, "size": 8}, {"x": 2024.4331050228311, "y": 7000000000000.0, "tooltipData": {"Model": "Qwen2-72B", "Domain": "Language", "Training compute (FLOP)": "3.0e+24", "Training dataset size (datapoints)": "7.0e+12", "Organization": "Alibaba", "Publication date": "2024-06-07"}, "size": 8}, {"x": 2024.4522831050228, "y": 6750000000000.0, "tooltipData": {"Model": "Nemotron-4 340B", "Domain": "Language", "Training compute (FLOP)": "1.8e+25", "Training dataset size (datapoints)": "6.8e+12", "Organization": "NVIDIA", "Publication date": "2024-06-14"}, "size": 8}, {"x": 2024.4605022831051, "y": 3191000000000.0, "tooltipData": {"Model": "DeepSeek-Coder-V2 236B", "Domain": "Language", "Training compute (FLOP)": "1.3e+24", "Training dataset size (datapoints)": "3.2e+12", "Organization": "DeepSeek", "Publication date": "2024-06-17"}, "size": 8}, {"x": 2024.5602739726028, "y": 15600000000000.0, "tooltipData": {"Model": "Llama 3.1-405B", "Domain": "Language", "Training compute (FLOP)": "3.8e+25", "Training dataset size (datapoints)": "1.6e+13", "Organization": "Meta AI", "Publication date": "2024-07-23"}, "size": 8}, {"x": 2024.5767123287671, "y": 7588000000000.0, "tooltipData": {"Model": "AFM-on-device", "Domain": "Language", "Training compute (FLOP)": "4.5e+23", "Training dataset size (datapoints)": "7.6e+12", "Organization": "Apple", "Publication date": "2024-07-29"}, "size": 8}, {"x": 2024.5767123287671, "y": 7400000000000.0, "tooltipData": {"Model": "AFM-server", "Domain": "Language", "Training compute (FLOP)": "NA", "Training dataset size (datapoints)": "7.4e+12", "Organization": "Apple", "Publication date": "2024-07-29"}, "size": 8}, {"x": 2024.7159817351599, "y": 18000000000000.0, "tooltipData": {"Model": "Qwen2.5-72B", "Domain": "Language", "Training compute (FLOP)": "7.8e+24", "Training dataset size (datapoints)": "1.8e+13", "Organization": "Alibaba", "Publication date": "2024-09-19"}, "size": 8}], "size": 8, "fillColor": "rgb(0.0, 165.0, 166.0)", "strokeColor": "rgb(0.0, 165.0, 166.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "line", "color": "#E03D90", "zOrder": 2, "clip": true, "strokeWidth": 1.5, "lineStyle": "-", "tooltipData": {"Growth rate": "2.9x/year", "90% CI": "2.5x to 3.5x per year", "R\u00b2": "0.55"}, "points": [{"x": 2010.0, "y": 284806.2904658287}, {"x": 2010.1515151515152, "y": 334525.7071653841}, {"x": 2010.3030303030303, "y": 392924.7790541915}, {"x": 2010.4545454545455, "y": 461518.7373878472}, {"x": 2010.6060606060605, "y": 543675.5493985388}, {"x": 2010.7575757575758, "y": 638586.483934925}, {"x": 2010.909090909091, "y": 750066.2811034989}, {"x": 2011.060606060606, "y": 883588.6052478215}, {"x": 2011.2121212121212, "y": 1037839.0959357853}, {"x": 2011.3636363636363, "y": 1219017.518622739}, {"x": 2011.5151515151515, "y": 1436019.7441060303}, {"x": 2011.6666666666667, "y": 1686709.656639978}, {"x": 2011.8181818181818, "y": 1981163.1960350603}, {"x": 2011.969696969697, "y": 2327020.298883995}, {"x": 2012.121212121212, "y": 2741262.5684883446}, {"x": 2012.2727272727273, "y": 3219812.3073394196}, {"x": 2012.4242424242425, "y": 3792984.0835092585}, {"x": 2012.5757575757575, "y": 4455135.737092221}, {"x": 2012.7272727272727, "y": 5248212.419755285}, {"x": 2012.878787878788, "y": 6164407.282583858}, {"x": 2013.030303030303, "y": 7261758.287526861}, {"x": 2013.1818181818182, "y": 8529463.385186272}, {"x": 2013.3333333333333, "y": 10018475.245067596}, {"x": 2013.4848484848485, "y": 11767427.996739563}, {"x": 2013.6363636363637, "y": 13862195.319187474}, {"x": 2013.7878787878788, "y": 16282156.845736748}, {"x": 2013.939393939394, "y": 19124577.705390595}, {"x": 2014.090909090909, "y": 22529020.923056174}, {"x": 2014.2424242424242, "y": 26461974.00943753}, {"x": 2014.3939393939395, "y": 31081513.522824388}, {"x": 2014.5454545454545, "y": 36614459.114492774}, {"x": 2014.6969696969697, "y": 43006345.84904483}, {"x": 2014.8484848484848, "y": 50514081.8140635}, {"x": 2015.0, "y": 59506297.26988605}, {"x": 2015.1515151515152, "y": 69894475.09199518}, {"x": 2015.3030303030303, "y": 82096145.66047777}, {"x": 2015.4545454545455, "y": 96427895.38711388}, {"x": 2015.6060606060605, "y": 113593413.99367079}, {"x": 2015.7575757575758, "y": 133423728.39943375}, {"x": 2015.909090909091, "y": 156715875.27955106}, {"x": 2016.060606060606, "y": 184613500.362567}, {"x": 2016.2121212121212, "y": 216841986.39030555}, {"x": 2016.3636363636363, "y": 255442903.03511092}, {"x": 2016.5151515151515, "y": 300036272.50800407}, {"x": 2016.6666666666667, "y": 353446939.5945573}, {"x": 2016.8181818181818, "y": 415149143.01911193}, {"x": 2016.969696969697, "y": 487622869.6368429}, {"x": 2017.121212121212, "y": 574426583.5220312}, {"x": 2017.2727272727273, "y": 674705810.581136}, {"x": 2017.4242424242425, "y": 792491057.8487686}, {"x": 2017.5757575757575, "y": 933565587.6248369}, {"x": 2017.7272727272727, "y": 1096540697.4500022}, {"x": 2017.878787878788, "y": 1287966820.0098867}, {"x": 2018.030303030303, "y": 1517242988.744584}, {"x": 2018.1818181818182, "y": 1782112266.275709}, {"x": 2018.3333333333333, "y": 2093220501.3762326}, {"x": 2018.4848484848485, "y": 2458639755.922031}, {"x": 2018.6363636363637, "y": 2896312135.9708147}, {"x": 2018.7878787878788, "y": 3401929303.8528214}, {"x": 2018.939393939394, "y": 3995813450.0363083}, {"x": 2019.090909090909, "y": 4707124319.671804}, {"x": 2019.2424242424242, "y": 5528859946.099107}, {"x": 2019.3939393939395, "y": 6494048218.747351}, {"x": 2019.5454545454545, "y": 7650079936.366713}, {"x": 2019.6969696969697, "y": 8985575411.27007}, {"x": 2019.8484848484848, "y": 10554212000.821117}, {"x": 2020.0, "y": 12433009850.243553}, {"x": 2020.1515151515152, "y": 14603474542.448462}, {"x": 2020.3030303030303, "y": 17152843219.843384}, {"x": 2020.4545454545455, "y": 20206290028.607002}, {"x": 2020.6060606060605, "y": 23733757600.48257}, {"x": 2020.7575757575758, "y": 27958699522.73201}, {"x": 2020.909090909091, "y": 32839526521.58356}, {"x": 2021.060606060606, "y": 38685423098.230125}, {"x": 2021.2121212121212, "y": 45438843705.88932}, {"x": 2021.3636363636363, "y": 53371227505.64301}, {"x": 2021.5151515151515, "y": 62872054990.52168}, {"x": 2021.6666666666667, "y": 73847802386.14206}, {"x": 2021.8181818181818, "y": 86739616163.06967}, {"x": 2021.969696969697, "y": 101881989294.35622}, {"x": 2022.121212121212, "y": 120018413156.84407}, {"x": 2022.2727272727273, "y": 140970357320.73355}, {"x": 2022.4242424242425, "y": 165579939947.75674}, {"x": 2022.5757575757575, "y": 195055492936.2392}, {"x": 2022.7272727272727, "y": 229106866299.41867}, {"x": 2022.878787878788, "y": 269102681474.8452}, {"x": 2023.030303030303, "y": 317006735248.8542}, {"x": 2023.1818181818182, "y": 372347472072.5533}, {"x": 2023.3333333333333, "y": 437349193385.50006}, {"x": 2023.4848484848485, "y": 513698443795.6033}, {"x": 2023.6363636363637, "y": 605143975814.4545}, {"x": 2023.7878787878788, "y": 710785622449.3888}, {"x": 2023.939393939394, "y": 834869421612.7158}, {"x": 2024.090909090909, "y": 983487894858.5148}, {"x": 2024.2424242424242, "y": 1155178079030.8762}, {"x": 2024.3939393939395, "y": 1360815988371.1753}, {"x": 2024.5454545454545, "y": 1598377374626.3066}, {"x": 2024.6969696969697, "y": 1882910978251.0725}, {"x": 2024.8484848484848, "y": 2211615921470.119}, {"x": 2025.0, "y": 2605314592359.353}]}, {"type": "annotation", "color": "#3E555E", "text": "GPT-4", "x": 2023.2050228310502, "y": 4900000000000.0, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2023.2050228310502, "targetY": 4900000000000.0, "relDx": 0.0, "relDy": 0.03, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-3", "x": 2020.407305936073, "y": 374000000000.0, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2020.407305936073, "targetY": 374000000000.0, "relDx": -0.01, "relDy": 0.0, "arrowPosition": "right", "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "PaLM", "x": 2022.2582191780823, "y": 585000000000.0, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2022.2582191780823, "targetY": 585000000000.0, "relDx": -0.06, "relDy": 0.07, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "Transformer", "x": 2017.4468036529681, "y": 1866666666.6666667, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2017.4468036529681, "targetY": 1866666666.6666667, "relDx": -0.01, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#E03D90", "text": "2.9x/year", "x": 2015.0, "y": [59506297.26989274], "background": true, "weight": "bold", "hasArrow": true, "targetX": 2015.0, "targetY": [59506297.26989274], "relDx": 0.08, "relDy": -0.14, "hasArrowHead": true, "arrowType": "arc", "arrowColor": "#E03D90", "targetSize": 8}], "additionalLegendItems": [], "tooltipKeyWidth": 120, "tooltipMinWidth": 250, "topRightText": "137 models", "addDataPadding": false, "title": "Training data of notable LLMs", "originalDataAspectRatio": 0.7451612903225805}

The length of time spent training notable models is growing.

Since 2010, the length of training runs has increased by 1.2x per year among notable models, excluding those that are fine-tuned from base models.

A continuation of this trend would ease hardware constraints, by increasing training compute without requiring more chips or power.

However, longer training times face a tradeoff. For very long runs, waiting for future improvements to algorithms and hardware might outweigh the benefits of extended training.

{"xAxis": {"label": "Publication date", "lim": [2009.25, 2025.75], "scaleType": "linear", "ticks": [2008.0, 2010.0, 2012.0, 2014.0, 2016.0, 2018.0, 2020.0, 2022.0, 2024.0, 2026.0], "tickLabels": ["2008", "2010", "2012", "2014", "2016", "2018", "2020", "2022", "2024", "2026"], "hideMinorGrid": true, "nice": false}, "yAxis": {"label": "Training length (days)", "lim": [0.022074499328221104, 1861.635276779722], "scaleType": "log", "ticks": [1, 10, 100, 1000, 10000, 100000, 1000000, 10000000, 100000000, 1000000000], "tickLabels": ["1", "10", "100", "1k", "10k", "100k", "1M", "10M", "100M", "1B"], "hideMinorGrid": true}, "showLegend": true, "legendPosition": "header", "showFrame": true, "objects": [{"type": "scatter", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2010.1666666666667, "y": 0.08333333333333333, "tooltipData": {"Model": "6-layer MLP (MNIST)", "Domain": "Vision", "Training compute (FLOP)": "1.3e+14", "Training time (days)": "0.08", "Organization": "IDSIA, University of Lugano, SUPSI", "Publication date": "2010-03-01"}, "size": 8}, {"x": 2010.6776255707764, "y": 0.8958333333333334, "tooltipData": {"Model": "Fisher-Boost", "Domain": "Vision", "Training compute (FLOP)": "NA", "Training time (days)": "0.90", "Organization": "Xerox Research Centre Europe (XRCE)", "Publication date": "2010-09-05"}, "size": 8}, {"x": 2011.0, "y": 2.0, "tooltipData": {"Model": "Deep Autoencoders", "Domain": "Vision", "Training compute (FLOP)": "3.7e+16", "Training time (days)": "2.0", "Organization": "University of Toronto", "Publication date": "2011-01-01"}, "size": 8}, {"x": 2011.852511415525, "y": 3.0, "tooltipData": {"Model": "NLP from scratch", "Domain": "Language", "Training compute (FLOP)": "NA", "Training time (days)": "3.0", "Organization": "NEC Laboratories, Princeton University", "Publication date": "2011-11-08"}, "size": 8}, {"x": 2012.4221461187215, "y": 0.0625, "tooltipData": {"Model": "Dropout (CIFAR)", "Domain": "Vision", "Training compute (FLOP)": "4.3e+15", "Training time (days)": "0.06", "Organization": "University of Toronto", "Publication date": "2012-06-03"}, "size": 8}, {"x": 2012.4221461187215, "y": 4.0, "tooltipData": {"Model": "Dropout (ImageNet)", "Domain": "Vision", "Training compute (FLOP)": "2.7e+17", "Training time (days)": "4.0", "Organization": "University of Toronto", "Publication date": "2012-06-03"}, "size": 8}, {"x": 2012.5301369863014, "y": 3.0, "tooltipData": {"Model": "Unsupervised High-level Feature Learner", "Domain": "Vision", "Training compute (FLOP)": "6.0e+17", "Training time (days)": "3.0", "Organization": "Google", "Publication date": "2012-07-12"}, "size": 8}, {"x": 2012.7461187214612, "y": 5.5, "tooltipData": {"Model": "AlexNet", "Domain": "Vision", "Training compute (FLOP)": "4.7e+17", "Training time (days)": "5.5", "Organization": "University of Toronto", "Publication date": "2012-09-30"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2012.9221461187215, "y": 5.0, "tooltipData": {"Model": "DistBelief Speech", "Domain": "Speech", "Training compute (FLOP)": "3.1e+17", "Training time (days)": "5.0", "Organization": "Google", "Publication date": "2012-12-03"}, "size": 8}, {"x": 2013.041095890411, "y": 14.0, "tooltipData": {"Model": "DistBelief NNLM", "Domain": "Language", "Training compute (FLOP)": "2.6e+18", "Training time (days)": "14", "Organization": "Google", "Publication date": "2013-01-16"}, "size": 8}, {"x": 2013.401826484018, "y": 28.0, "tooltipData": {"Model": "Multilingual DNN", "Domain": "Speech", "Training compute (FLOP)": "NA", "Training time (days)": "28", "Organization": "Google", "Publication date": "2013-05-26"}, "size": 8}, {"x": 2013.401826484018, "y": 7.0, "tooltipData": {"Model": "ReLU-Speech", "Domain": "Speech", "Training compute (FLOP)": "1.3e+17", "Training time (days)": "7.0", "Organization": "Google, University of Toronto, New York University (NYU)", "Publication date": "2013-05-26"}, "size": 8}, {"x": 2013.4468036529681, "y": 0.08333333333333333, "tooltipData": {"Model": "Fisher Vector image classifier", "Domain": "Vision", "Training compute (FLOP)": "9.1e+13", "Training time (days)": "0.08", "Organization": "Universidad Nacional de Cordoba, Inteligent Systems Lab Amsterdam, University of Amsterdam, LEAR Team, INRIA, Xerox Research Centre Europe (XRCE)", "Publication date": "2013-06-12"}, "size": 8}, {"x": 2013.7242009132422, "y": 1.0, "tooltipData": {"Model": "Mitosis", "Domain": "Vision, Medicine", "Training compute (FLOP)": "1.4e+17", "Training time (days)": "1.0", "Organization": "IDSIA", "Publication date": "2013-09-22"}, "size": 8}, {"x": 2013.75, "y": 0.625, "tooltipData": {"Model": "RCTM", "Domain": "Language", "Training compute (FLOP)": "9.3e+15", "Training time (days)": "0.62", "Organization": "University of Oxford", "Publication date": "2013-10-01"}, "size": 8}, {"x": 2013.75, "y": 0.20833333333333334, "tooltipData": {"Model": "RNTN", "Domain": "Language", "Training compute (FLOP)": "1.4e+16", "Training time (days)": "0.21", "Organization": "Stanford University", "Publication date": "2013-10-01"}, "size": 8}, {"x": 2013.791095890411, "y": 1.0, "tooltipData": {"Model": "Word2Vec (large)", "Domain": "Language", "Training compute (FLOP)": "3.9e+16", "Training time (days)": "1.0", "Organization": "Google", "Publication date": "2013-10-16"}, "size": 8}, {"x": 2013.9440639269408, "y": 10.0, "tooltipData": {"Model": "RNN for 1B words", "Domain": "Language", "Training compute (FLOP)": "NA", "Training time (days)": "10", "Organization": "Google", "Publication date": "2013-12-11"}, "size": 8}, {"x": 2014.4632420091325, "y": 28.0, "tooltipData": {"Model": "SPPNet", "Domain": "Vision", "Training compute (FLOP)": "3.4e+18", "Training time (days)": "28", "Organization": "Microsoft, Xi\u2019an Jiaotong University, University of Science and Technology of China", "Publication date": "2014-06-18"}, "size": 8}, {"x": 2014.5, "y": 2.0, "tooltipData": {"Model": "SmooCT", "Domain": "Games", "Training compute (FLOP)": "6.9e+16", "Training time (days)": "2.0", "Organization": "University College London (UCL)", "Publication date": "2014-07-01"}, "size": 8}, {"x": 2014.674885844749, "y": 21.0, "tooltipData": {"Model": "VGG16", "Domain": "Vision", "Training compute (FLOP)": "1.2e+19", "Training time (days)": "21", "Organization": "University of Oxford", "Publication date": "2014-09-04"}, "size": 8}, {"x": 2014.6913242009134, "y": 10.0, "tooltipData": {"Model": "Seq2Seq LSTM", "Domain": "Language", "Training compute (FLOP)": "5.6e+19", "Training time (days)": "10", "Organization": "Google", "Publication date": "2014-09-10"}, "size": 8}, {"x": 2014.9632420091325, "y": 0.75, "tooltipData": {"Model": "Fractional Max-Pooling", "Domain": "Vision", "Training compute (FLOP)": "1.0e+17", "Training time (days)": "0.75", "Organization": "University of Warwick", "Publication date": "2014-12-18"}, "size": 8}, {"x": 2015.0970319634703, "y": 24.5, "tooltipData": {"Model": "MSRA (C, PReLU)", "Domain": "Vision", "Training compute (FLOP)": "2.4e+19", "Training time (days)": "24", "Organization": "Microsoft Research", "Publication date": "2015-02-06"}, "size": 8}, {"x": 2015.1326484018264, "y": 1.25, "tooltipData": {"Model": "TRPO", "Domain": "Games", "Training compute (FLOP)": "NA", "Training time (days)": "1.2", "Organization": "UC Berkeley", "Publication date": "2015-02-19"}, "size": 8}, {"x": 2015.9358447488585, "y": 5.0, "tooltipData": {"Model": "DeepSpeech2 (English)", "Domain": "Speech", "Training compute (FLOP)": "2.6e+19", "Training time (days)": "5.0", "Organization": "Baidu Research - Silicon Valley AI Lab", "Publication date": "2015-12-08"}, "size": 8}, {"x": 2016.0712328767124, "y": 29.0, "tooltipData": {"Model": "AlphaGo Lee", "Domain": "Games", "Training compute (FLOP)": "1.9e+21", "Training time (days)": "29", "Organization": "DeepMind", "Publication date": "2016-01-27"}, "size": 8}, {"x": 2016.4100456621004, "y": 0.3333333333333333, "tooltipData": {"Model": "Named Entity Recognition model", "Domain": "Language", "Training compute (FLOP)": "9.7e+16", "Training time (days)": "0.33", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.4100456621004, "y": 0.5, "tooltipData": {"Model": "Part-of-sentence tagging model", "Domain": "Language", "Training compute (FLOP)": "1.5e+17", "Training time (days)": "0.50", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.7105022831051, "y": 20.833333333333332, "tooltipData": {"Model": "ResNet-200", "Domain": "Vision", "Training compute (FLOP)": "3.0e+19", "Training time (days)": "21", "Organization": "Microsoft Research Asia", "Publication date": "2016-09-17"}, "size": 8}, {"x": 2016.7664383561644, "y": 30.0, "tooltipData": {"Model": "Xception", "Domain": "Vision", "Training compute (FLOP)": "4.4e+20", "Training time (days)": "30", "Organization": "Google", "Publication date": "2016-10-07"}, "size": 8}, {"x": 2016.844292237443, "y": 2.5, "tooltipData": {"Model": "BIDAF", "Domain": "Language", "Training compute (FLOP)": "3.5e+18", "Training time (days)": "2.5", "Organization": "University of Washington, Allen Institute for AI", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2017.0, "y": 3.0, "tooltipData": {"Model": "AlphaGo Master", "Domain": "Games", "Training compute (FLOP)": "2.0e+23", "Training time (days)": "3.0", "Organization": "DeepMind", "Publication date": "2017-01-01"}, "size": 8}, {"x": 2017.013698630137, "y": 9.083333333333334, "tooltipData": {"Model": "DeepStack", "Domain": "Games", "Training compute (FLOP)": "1.4e+19", "Training time (days)": "9.1", "Organization": "University of Alberta, Charles University, Czech Technical University", "Publication date": "2017-01-06"}, "size": 8}, {"x": 2017.0602739726028, "y": 12.0, "tooltipData": {"Model": "MoE-Multi", "Domain": "Language", "Training compute (FLOP)": "9.4e+19", "Training time (days)": "12", "Organization": "Jagiellonian University, Google Brain", "Publication date": "2017-01-23"}, "size": 8}, {"x": 2017.4468036529681, "y": 3.5, "tooltipData": {"Model": "Transformer", "Domain": "Language", "Training compute (FLOP)": "7.4e+18", "Training time (days)": "3.5", "Organization": "Google Research, Google Brain", "Publication date": "2017-06-12"}, "size": 8}, {"x": 2017.5246575342467, "y": 60.0, "tooltipData": {"Model": "JFT", "Domain": "Vision", "Training compute (FLOP)": "8.4e+20", "Training time (days)": "60", "Organization": "Google Research, Carnegie Mellon University (CMU)", "Publication date": "2017-07-10"}, "size": 8}, {"x": 2017.5997716894976, "y": 1.4583333333333333, "tooltipData": {"Model": "RetinaNet-R101", "Domain": "Vision", "Training compute (FLOP)": "2.1e+18", "Training time (days)": "1.5", "Organization": "Facebook AI Research", "Publication date": "2017-08-07"}, "size": 8}, {"x": 2017.7965753424658, "y": 20.0, "tooltipData": {"Model": "AlphaGo Zero", "Domain": "Games", "Training compute (FLOP)": "3.4e+23", "Training time (days)": "20", "Organization": "DeepMind", "Publication date": "2017-10-18"}, "size": 8}, {"x": 2017.9276255707764, "y": 1.0, "tooltipData": {"Model": "AlphaZero", "Domain": "Games", "Training compute (FLOP)": "3.7e+22", "Training time (days)": "1.0", "Organization": "DeepMind", "Publication date": "2017-12-05"}, "size": 8}, {"x": 2018.094292237443, "y": 7.0, "tooltipData": {"Model": "AmoebaNet-A (F=448)", "Domain": "Vision", "Training compute (FLOP)": "3.9e+20", "Training time (days)": "7.0", "Organization": "Google Brain", "Publication date": "2018-02-05"}, "size": 8}, {"x": 2018.094292237443, "y": 4.166666666666667, "tooltipData": {"Model": "IMPALA", "Domain": "Games", "Training compute (FLOP)": "1.7e+20", "Training time (days)": "4.2", "Organization": "DeepMind", "Publication date": "2018-02-05"}, "size": 8}, {"x": 2018.2378995433792, "y": 6.0, "tooltipData": {"Model": "LSTM (Hebbian, Cache, MbPA)", "Domain": "Language", "Training compute (FLOP)": "3.3e+19", "Training time (days)": "6.0", "Organization": "DeepMind, University College London (UCL)", "Publication date": "2018-03-27"}, "size": 8}, {"x": 2018.3360730593606, "y": 20.666666666666668, "tooltipData": {"Model": "ResNeXt-101 32x48d", "Domain": "Vision", "Training compute (FLOP)": "8.7e+21", "Training time (days)": "21", "Organization": "Facebook", "Publication date": "2018-05-02"}, "size": 8}, {"x": 2018.4166666666667, "y": 30.0, "tooltipData": {"Model": "GPT-1", "Domain": "Language", "Training compute (FLOP)": "1.8e+19", "Training time (days)": "30", "Organization": "OpenAI", "Publication date": "2018-06-01"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2018.4878995433792, "y": 4.341666666666667, "tooltipData": {"Model": "QT-Opt", "Domain": "Robotics, Vision", "Training compute (FLOP)": "3.5e+19", "Training time (days)": "4.3", "Organization": "Google Brain, UC Berkeley", "Publication date": "2018-06-27"}, "size": 8}, {"x": 2018.657305936073, "y": 1.15275, "tooltipData": {"Model": "Big Transformer for Back-Translation", "Domain": "Language", "Training compute (FLOP)": "4.8e+20", "Training time (days)": "1.2", "Organization": "Facebook AI Research, Google Brain", "Publication date": "2018-08-28"}, "size": 8}, {"x": 2018.7406392694065, "y": 2.0, "tooltipData": {"Model": "BigGAN-deep 512x512", "Domain": "Image generation", "Training compute (FLOP)": "1.8e+21", "Training time (days)": "2.0", "Organization": "Heriot-Watt University, DeepMind", "Publication date": "2018-09-28"}, "size": 8}, {"x": 2018.7406392694065, "y": 2.7916666666666665, "tooltipData": {"Model": "Transformer (Adaptive Input Embeddings) WT103", "Domain": "Language", "Training compute (FLOP)": "4.5e+19", "Training time (days)": "2.8", "Organization": "Facebook AI Research", "Publication date": "2018-09-28"}, "size": 8}, {"x": 2018.777397260274, "y": 4.0, "tooltipData": {"Model": "BERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.8e+20", "Training time (days)": "4.0", "Organization": "Google", "Publication date": "2018-10-11"}, "size": 8}, {"x": 2018.844292237443, "y": 0.9166666666666666, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 2.9B (translation)", "Domain": "Language", "Training compute (FLOP)": "6.8e+19", "Training time (days)": "0.92", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2018.844292237443, "y": 0.5416666666666666, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 4.9B (language)", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Training time (days)": "0.54", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2019.1545662100457, "y": 19.0, "tooltipData": {"Model": "KataGo", "Domain": "Games", "Training compute (FLOP)": "2.3e+19", "Training time (days)": "19", "Organization": "Jane Street", "Publication date": "2019-02-27"}, "size": 8}, {"x": 2019.2351598173516, "y": 7.0, "tooltipData": {"Model": "SciBERT", "Domain": "Language", "Training compute (FLOP)": "8.9e+19", "Training time (days)": "7.0", "Organization": "Allen Institute for AI", "Publication date": "2019-03-26"}, "size": 8}, {"x": 2019.2691780821917, "y": 1.0, "tooltipData": {"Model": "WeNet (Penn Treebank)", "Domain": "Language", "Training compute (FLOP)": "7.3e+17", "Training time (days)": "1.0", "Organization": "Amazon", "Publication date": "2019-04-08"}, "size": 8}, {"x": 2019.4100456621004, "y": 4.5, "tooltipData": {"Model": "MnasNet-A3", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Training time (days)": "4.5", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.4100456621004, "y": 4.5, "tooltipData": {"Model": "MnasNet-A1 + SSDLite", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Training time (days)": "4.5", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.5, "y": 5.0, "tooltipData": {"Model": "RoBERTa Large", "Domain": "Language", "Training compute (FLOP)": "8.5e+21", "Training time (days)": "5.0", "Organization": "Facebook, University of Washington", "Publication date": "2019-07-01"}, "size": 8}, {"x": 2019.7105022831051, "y": 13.625, "tooltipData": {"Model": "Megatron-LM (8.3B)", "Domain": "Language", "Training compute (FLOP)": "9.1e+21", "Training time (days)": "14", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.7105022831051, "y": 15.583333333333334, "tooltipData": {"Model": "Megatron-BERT", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Training time (days)": "16", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.8102739726028, "y": 20.079166666666666, "tooltipData": {"Model": "T5-11B", "Domain": "Language", "Training compute (FLOP)": "3.3e+22", "Training time (days)": "20", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.8294520547945, "y": 44.0, "tooltipData": {"Model": "AlphaStar", "Domain": "Games", "Training compute (FLOP)": "5.9e+22", "Training time (days)": "44", "Organization": "DeepMind", "Publication date": "2019-10-30"}, "size": 8}, {"x": 2019.85799086758, "y": 1.0, "tooltipData": {"Model": "CamemBERT", "Domain": "Language", "Training compute (FLOP)": "8.3e+20", "Training time (days)": "1.0", "Organization": "Facebook, INRIA, Sorbonne University", "Publication date": "2019-11-10"}, "size": 8}, {"x": 2019.8607305936073, "y": 6.0, "tooltipData": {"Model": "Noisy Student (L2)", "Domain": "Vision", "Training compute (FLOP)": "2.6e+22", "Training time (days)": "6.0", "Organization": "Carnegie Mellon University (CMU), Google", "Publication date": "2019-11-11"}, "size": 8}, {"x": 2019.9495433789955, "y": 296.0, "tooltipData": {"Model": "OpenAI Five", "Domain": "Games", "Training compute (FLOP)": "6.7e+22", "Training time (days)": "296", "Organization": "OpenAI", "Publication date": "2019-12-13"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2019.9659817351599, "y": 2.75, "tooltipData": {"Model": "DD-PPO", "Domain": "Robotics", "Training compute (FLOP)": "7.8e+20", "Training time (days)": "2.8", "Organization": "Georgia Institute of Technology, Facebook AI Research, Oregon State University, Simon Fraser University", "Publication date": "2019-12-19"}, "size": 8}, {"x": 2020.0383561643835, "y": 5.0, "tooltipData": {"Model": "AlphaFold", "Domain": "Biology", "Training compute (FLOP)": "1.0e+20", "Training time (days)": "5.0", "Organization": "DeepMind", "Publication date": "2020-01-15"}, "size": 8}, {"x": 2020.0493150684931, "y": 60.0, "tooltipData": {"Model": "ContextNet + Noisy Student", "Domain": "Speech", "Training compute (FLOP)": "8.2e+21", "Training time (days)": "60", "Organization": "Google", "Publication date": "2020-01-19"}, "size": 8}, {"x": 2020.0739726027398, "y": 30.0, "tooltipData": {"Model": "Meena", "Domain": "Language", "Training compute (FLOP)": "1.1e+23", "Training time (days)": "30", "Organization": "Google Brain", "Publication date": "2020-01-28"}, "size": 8}, {"x": 2020.1052511415523, "y": 1.3333333333333333, "tooltipData": {"Model": "ALBERT-xxlarge", "Domain": "Language", "Training compute (FLOP)": "2.4e+21", "Training time (days)": "1.3", "Organization": "Toyota Technological Institute at Chicago, Google", "Publication date": "2020-02-09"}, "size": 8}, {"x": 2020.3360730593606, "y": 1.5, "tooltipData": {"Model": "ATLAS", "Domain": "Language", "Training compute (FLOP)": "3.8e+19", "Training time (days)": "1.5", "Organization": "Allen Institute for AI, University of Washington", "Publication date": "2020-05-02"}, "size": 8}, {"x": 2020.407305936073, "y": 14.799999999999999, "tooltipData": {"Model": "GPT-3 175B (davinci)", "Domain": "Language", "Training compute (FLOP)": "3.1e+23", "Training time (days)": "15", "Organization": "OpenAI", "Publication date": "2020-05-28"}, "size": 8}, {"x": 2020.4961187214612, "y": 42.0, "tooltipData": {"Model": "GShard (dense)", "Domain": "Language", "Training compute (FLOP)": "4.8e+22", "Training time (days)": "42", "Organization": "Google", "Publication date": "2020-06-30"}, "size": 8}, {"x": 2020.6666666666667, "y": 0.75, "tooltipData": {"Model": "ProBERTa", "Domain": "Biology", "Training compute (FLOP)": "9.7e+18", "Training time (days)": "0.75", "Organization": "University of Illinois Urbana-Champaign (UIUC), Reed College", "Publication date": "2020-09-01"}, "size": 8}, {"x": 2020.8020547945205, "y": 7.0, "tooltipData": {"Model": "Conformer + Wav2vec 2.0 + Noisy Student", "Domain": "Speech", "Training compute (FLOP)": "7.6e+21", "Training time (days)": "7.0", "Organization": "Google, Google Research, Google Brain", "Publication date": "2020-10-20"}, "size": 8}, {"x": 2020.804794520548, "y": 11.0, "tooltipData": {"Model": "GBERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.2e+21", "Training time (days)": "11", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.804794520548, "y": 7.0, "tooltipData": {"Model": "German ELECTRA Large", "Domain": "Language", "Training compute (FLOP)": "1.4e+21", "Training time (days)": "7.0", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.9127853881278, "y": 11.0, "tooltipData": {"Model": "AlphaFold 2", "Domain": "Biology", "Training compute (FLOP)": "3.0e+21", "Training time (days)": "11", "Organization": "DeepMind", "Publication date": "2020-11-30"}, "size": 8}, {"x": 2020.9166666666667, "y": 14.0, "tooltipData": {"Model": "CPM-Large", "Domain": "Language", "Training compute (FLOP)": "1.8e+21", "Training time (days)": "14", "Organization": "Tsinghua University, Beijing Academy of Artificial Intelligence / BAAI", "Publication date": "2020-12-01"}, "size": 8}, {"x": 2020.9769406392695, "y": 0.8333333333333334, "tooltipData": {"Model": "DensePhrases", "Domain": "Language", "Training compute (FLOP)": "2.1e+18", "Training time (days)": "0.83", "Organization": "Korea University, Princeton University", "Publication date": "2020-12-23"}, "size": 8}, {"x": 2021.0109589041097, "y": 12.0, "tooltipData": {"Model": "CLIP (ViT L/14@336px)", "Domain": "Multimodal, Vision, Language, Video", "Training compute (FLOP)": "1.0e+22", "Training time (days)": "12", "Organization": "OpenAI", "Publication date": "2021-01-05"}, "size": 8}, {"x": 2021.027397260274, "y": 27.0, "tooltipData": {"Model": "Switch", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Training time (days)": "27", "Organization": "Google", "Publication date": "2021-01-11"}, "size": 8}, {"x": 2021.0383561643835, "y": 2.2083333333333335, "tooltipData": {"Model": "DeiT-B", "Domain": "Vision", "Training compute (FLOP)": "7.9e+19", "Training time (days)": "2.2", "Organization": "Meta AI, Sorbonne University", "Publication date": "2021-01-15"}, "size": 8}, {"x": 2021.1666666666667, "y": 11.0, "tooltipData": {"Model": "Meta Pseudo Labels", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Training time (days)": "11", "Organization": "Google Brain, Google AI", "Publication date": "2021-03-01"}, "size": 8}, {"x": 2021.2993150684931, "y": 35.0, "tooltipData": {"Model": "PLUG", "Domain": "Language", "Training compute (FLOP)": "3.6e+22", "Training time (days)": "35", "Organization": "Alibaba", "Publication date": "2021-04-19"}, "size": 8}, {"x": 2021.3853881278537, "y": 7.0, "tooltipData": {"Model": "MedBERT", "Domain": "Medicine", "Training compute (FLOP)": "9.5e+18", "Training time (days)": "7.0", "Organization": "Peng Cheng Laboratory, University of Texas at Houston", "Publication date": "2021-05-20"}, "size": 8}, {"x": 2021.4385844748858, "y": 14.791666666666666, "tooltipData": {"Model": "EMDR", "Domain": "Language", "Training compute (FLOP)": "1.9e+21", "Training time (days)": "15", "Organization": "Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), McGill University, DeepMind", "Publication date": "2021-06-09"}, "size": 8}, {"x": 2021.4413242009134, "y": 30.0, "tooltipData": {"Model": "DeBERTa", "Domain": "Language", "Training compute (FLOP)": "2.6e+22", "Training time (days)": "30", "Organization": "Microsoft", "Publication date": "2021-06-10"}, "size": 8}, {"x": 2021.4440639269408, "y": 14.470833333333333, "tooltipData": {"Model": "ALIGN", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "2.6e+22", "Training time (days)": "14", "Organization": "Google Research", "Publication date": "2021-06-11"}, "size": 8}, {"x": 2021.4769406392695, "y": 1.875, "tooltipData": {"Model": "EfficientNetV2-XL", "Domain": "Vision", "Training compute (FLOP)": "9.6e+19", "Training time (days)": "1.9", "Organization": "Google, Google Brain", "Publication date": "2021-06-23"}, "size": 8}, {"x": 2021.5767123287671, "y": 8.145833333333334, "tooltipData": {"Model": "SEER", "Domain": "Vision", "Training compute (FLOP)": "1.8e+22", "Training time (days)": "8.1", "Organization": "Facebook AI Research, INRIA", "Publication date": "2021-07-29"}, "size": 8}, {"x": 2021.6216894977167, "y": 25.0, "tooltipData": {"Model": "DNABERT", "Domain": "Biology", "Training compute (FLOP)": "1.1e+20", "Training time (days)": "25", "Organization": "Northeastern University", "Publication date": "2021-08-15"}, "size": 8}, {"x": 2021.777397260274, "y": 32.083333333333336, "tooltipData": {"Model": "Megatron-Turing NLG 530B", "Domain": "Language", "Training compute (FLOP)": "1.2e+24", "Training time (days)": "32", "Organization": "Microsoft, NVIDIA", "Publication date": "2021-10-11"}, "size": 8}, {"x": 2021.8333333333333, "y": 12.0, "tooltipData": {"Model": "CodeT5-base", "Domain": "Language", "Training compute (FLOP)": "1.6e+21", "Training time (days)": "12", "Organization": "Salesforce, Nanyang Technological University", "Publication date": "2021-11-01"}, "size": 8}, {"x": 2021.8908675799087, "y": 10.0, "tooltipData": {"Model": "Florence", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Training time (days)": "10", "Organization": "Microsoft", "Publication date": "2021-11-22"}, "size": 8}, {"x": 2021.9358447488585, "y": 38.333333333333336, "tooltipData": {"Model": "Gopher (280B)", "Domain": "Language", "Training compute (FLOP)": "6.3e+23", "Training time (days)": "38", "Organization": "DeepMind", "Publication date": "2021-12-08"}, "size": 8}, {"x": 2021.9495433789955, "y": 56.916666666666664, "tooltipData": {"Model": "GLaM", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Training time (days)": "57", "Organization": "Google", "Publication date": "2021-12-13"}, "size": 8}, {"x": 2021.9687214611872, "y": 21.0, "tooltipData": {"Model": "XGLM-7.5B", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Training time (days)": "21", "Organization": "Meta AI, Facebook AI Research", "Publication date": "2021-12-20"}, "size": 8}, {"x": 2022.0164383561644, "y": 1.0, "tooltipData": {"Model": "Detic", "Domain": "Vision", "Training compute (FLOP)": "2.3e+19", "Training time (days)": "1.0", "Organization": "Meta AI, University of Texas at Austin", "Publication date": "2022-01-07"}, "size": 8}, {"x": 2022.0860730593606, "y": 6.133333333333333, "tooltipData": {"Model": "AlphaCode", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Training time (days)": "6.1", "Organization": "DeepMind", "Publication date": "2022-02-02"}, "size": 8}, {"x": 2022.1052511415523, "y": 90.0, "tooltipData": {"Model": "GPT-NeoX-20B", "Domain": "Language", "Training compute (FLOP)": "9.3e+22", "Training time (days)": "90", "Organization": "EleutherAI", "Publication date": "2022-02-09"}, "size": 8}, {"x": 2022.10799086758, "y": 28.0, "tooltipData": {"Model": "ProteinBERT", "Domain": "Biology", "Training compute (FLOP)": "6.5e+19", "Training time (days)": "28", "Organization": "Hebrew University of Jerusalem, Ben-Gurion University of the Negev, Deep Trading", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.10799086758, "y": 57.708333333333336, "tooltipData": {"Model": "LaMDA", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Training time (days)": "58", "Organization": "Google", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.151826484018, "y": 41.666666666666664, "tooltipData": {"Model": "PolyCoder", "Domain": "Language", "Training compute (FLOP)": "1.1e+21", "Training time (days)": "42", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2022-02-26"}, "size": 8}, {"x": 2022.2582191780823, "y": 64.0, "tooltipData": {"Model": "PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Training time (days)": "64", "Organization": "Google Research", "Publication date": "2022-04-04"}, "size": 8}, {"x": 2022.2828767123287, "y": 24.4140625, "tooltipData": {"Model": "Stable Diffusion (LDM-KL-8-G)", "Domain": "Image generation", "Training compute (FLOP)": "5.0e+22", "Training time (days)": "24", "Organization": "Runway, Ludwig Maximilian University", "Publication date": "2022-04-13"}, "size": 8}, {"x": 2022.285616438356, "y": 4.666666666666667, "tooltipData": {"Model": "Sparse all-MLP", "Domain": "Language", "Training compute (FLOP)": "6.1e+19", "Training time (days)": "4.7", "Organization": "Meta AI", "Publication date": "2022-04-14"}, "size": 8}, {"x": 2022.3360730593606, "y": 33.0625, "tooltipData": {"Model": "OPT-175B", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Training time (days)": "33", "Organization": "Meta AI", "Publication date": "2022-05-02"}, "size": 8}, {"x": 2022.35799086758, "y": 31.0, "tooltipData": {"Model": "UL2", "Domain": "Language", "Training compute (FLOP)": "1.2e+23", "Training time (days)": "31", "Organization": "Google Research, Google Brain", "Publication date": "2022-05-10"}, "size": 8}, {"x": 2022.3634703196346, "y": 4.0, "tooltipData": {"Model": "Gato", "Domain": "Multimodal, Robotics, Games, Language", "Training compute (FLOP)": "4.0e+21", "Training time (days)": "4.0", "Organization": "DeepMind", "Publication date": "2022-05-12"}, "size": 8}, {"x": 2022.393607305936, "y": 4.0, "tooltipData": {"Model": "Imagen", "Domain": "Image generation", "Training compute (FLOP)": "1.5e+22", "Training time (days)": "4.0", "Organization": "Google Brain", "Publication date": "2022-05-23"}, "size": 8}, {"x": 2022.4045662100457, "y": 14.0, "tooltipData": {"Model": "Tranception", "Domain": "Biology", "Training compute (FLOP)": "7.2e+21", "Training time (days)": "14", "Organization": "University of Oxford, Harvard Medical School, Cohere", "Publication date": "2022-05-27"}, "size": 8}, {"x": 2022.4522831050228, "y": 5.0, "tooltipData": {"Model": "CoCa", "Domain": "Vision", "Training compute (FLOP)": "7.3e+22", "Training time (days)": "5.0", "Organization": "Google Research", "Publication date": "2022-06-14"}, "size": 8}, {"x": 2022.5109589041097, "y": 21.0, "tooltipData": {"Model": "CodeT5-large", "Domain": "Language", "Training compute (FLOP)": "2.7e+21", "Training time (days)": "21", "Organization": "Salesforce", "Publication date": "2022-07-05"}, "size": 8}, {"x": 2022.527397260274, "y": 117.0, "tooltipData": {"Model": "BLOOM-176B", "Domain": "Language", "Training compute (FLOP)": "3.7e+23", "Training time (days)": "117", "Organization": "Hugging Face, BigScience", "Publication date": "2022-07-11"}, "size": 8}, {"x": 2022.554794520548, "y": 60.0, "tooltipData": {"Model": "ESM2-15B", "Domain": "Biology", "Training compute (FLOP)": "7.4e+22", "Training time (days)": "60", "Organization": "Meta AI, New York University (NYU), Stanford University, Massachusetts Institute of Technology (MIT)", "Publication date": "2022-07-21"}, "size": 8}, {"x": 2022.5860730593606, "y": 120.0, "tooltipData": {"Model": "AlexaTM 20B", "Domain": "Language", "Training compute (FLOP)": "2.0e+23", "Training time (days)": "120", "Organization": "Amazon", "Publication date": "2022-08-02"}, "size": 8}, {"x": 2022.5915525114156, "y": 60.0, "tooltipData": {"Model": "GLM-130B", "Domain": "Language", "Training compute (FLOP)": "3.5e+23", "Training time (days)": "60", "Organization": "Tsinghua University", "Publication date": "2022-08-04"}, "size": 8}, {"x": 2022.7022831050228, "y": 10.0, "tooltipData": {"Model": "PaLI", "Domain": "Language, Vision, Multimodal", "Training compute (FLOP)": "1.7e+23", "Training time (days)": "10", "Organization": "Google", "Publication date": "2022-09-14"}, "size": 8}, {"x": 2022.7582191780823, "y": 18.0, "tooltipData": {"Model": "DiffDock", "Domain": "Biology", "Training compute (FLOP)": "7.2e+19", "Training time (days)": "18", "Organization": "Massachusetts Institute of Technology (MIT)", "Publication date": "2022-10-04"}, "size": 8}, {"x": 2022.8689497716894, "y": 14.5, "tooltipData": {"Model": "EVA-01", "Domain": "Vision", "Training compute (FLOP)": "1.5e+22", "Training time (days)": "14", "Organization": "Beijing Academy of Artificial Intelligence / BAAI, Huazhong University of Science and Technology, Zhejiang University, Beijing Institute of Technology", "Publication date": "2022-11-14"}, "size": 8}, {"x": 2022.879908675799, "y": 2.0, "tooltipData": {"Model": "Fusion in Encoder", "Domain": "Language", "Training compute (FLOP)": "1.3e+20", "Training time (days)": "2.0", "Organization": "Samsung", "Publication date": "2022-11-18"}, "size": 8}, {"x": 2022.907305936073, "y": 20.041666666666668, "tooltipData": {"Model": "Discriminator Guidance", "Domain": "Image generation", "Training compute (FLOP)": "2.2e+20", "Training time (days)": "20", "Organization": "Korea Advanced Institute of Science and Technology (KAIST), NAVER", "Publication date": "2022-11-28"}, "size": 8}, {"x": 2022.9659817351599, "y": 40.0, "tooltipData": {"Model": "CaLM", "Domain": "Biology", "Training compute (FLOP)": "2.9e+19", "Training time (days)": "40", "Organization": "University of Oxford", "Publication date": "2022-12-19"}, "size": 8}, {"x": 2023.0383561643835, "y": 28.0, "tooltipData": {"Model": "Nucleotide Transformer", "Domain": "Biology", "Training compute (FLOP)": "8.1e+21", "Training time (days)": "28", "Organization": "NVIDIA, Technical University of Munich", "Publication date": "2023-01-15"}, "size": 8}, {"x": 2023.0712328767124, "y": 5.0, "tooltipData": {"Model": "DDPM-IP (CelebA)", "Domain": "Image generation", "Training compute (FLOP)": "3.5e+20", "Training time (days)": "5.0", "Organization": "Utrecht University", "Publication date": "2023-01-27"}, "size": 8}, {"x": 2023.0794520547945, "y": 8.333333333333334, "tooltipData": {"Model": "BLIP-2 (Q-Former)", "Domain": "Vision, Language", "Training compute (FLOP)": "1.2e+21", "Training time (days)": "8.3", "Organization": "Salesforce Research", "Publication date": "2023-01-30"}, "size": 8}, {"x": 2023.10799086758, "y": 14.475, "tooltipData": {"Model": "ViT-22B", "Domain": "Vision", "Training compute (FLOP)": "1.9e+23", "Training time (days)": "14", "Organization": "Google", "Publication date": "2023-02-10"}, "size": 8}, {"x": 2023.1463470319634, "y": 20.833333333333332, "tooltipData": {"Model": "LLaMA-65B", "Domain": "Language", "Training compute (FLOP)": "5.5e+23", "Training time (days)": "21", "Organization": "Meta AI", "Publication date": "2023-02-24"}, "size": 8}, {"x": 2023.1776255707764, "y": 7.0, "tooltipData": {"Model": "AudioGen", "Domain": "Audio", "Training compute (FLOP)": "9.5e+21", "Training time (days)": "7.0", "Organization": "Meta AI, Hebrew University of Jerusalem", "Publication date": "2023-03-05"}, "size": 8}, {"x": 2023.2050228310502, "y": 60.0, "tooltipData": {"Model": "Falcon-40B", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Training time (days)": "60", "Organization": "Technology Innovation Institute", "Publication date": "2023-03-15"}, "size": 8}, {"x": 2023.2050228310502, "y": 95.0, "tooltipData": {"Model": "GPT-4", "Domain": "Multimodal, Language, Vision, Image generation", "Training compute (FLOP)": "2.1e+25", "Training time (days)": "95", "Organization": "OpenAI", "Publication date": "2023-03-15"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.2187214611872, "y": 100.0, "tooltipData": {"Model": "PanGu-\u03a3", "Domain": "Language", "Training compute (FLOP)": "4.7e+23", "Training time (days)": "100", "Organization": "Huawei Noah's Ark Lab", "Publication date": "2023-03-20"}, "size": 8}, {"x": 2023.2461187214612, "y": 52.916666666666664, "tooltipData": {"Model": "BloombergGPT", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Training time (days)": "53", "Organization": "Bloomberg, Johns Hopkins University", "Publication date": "2023-03-30"}, "size": 8}, {"x": 2023.271917808219, "y": 24.0, "tooltipData": {"Model": "Incoder-6.7B", "Domain": "Language", "Training compute (FLOP)": "3.0e+21", "Training time (days)": "24", "Organization": "Facebook AI Research, University of Washington, UC Berkeley, Carnegie Mellon University (CMU), Toyota Technological Institute at Chicago", "Publication date": "2023-04-09"}, "size": 8}, {"x": 2023.3184931506848, "y": 10.0, "tooltipData": {"Model": "Agile Soccer Robot", "Domain": "Robotics", "Training compute (FLOP)": "NA", "Training time (days)": "10", "Organization": "Google DeepMind", "Publication date": "2023-04-26"}, "size": 8}, {"x": 2023.3552511415523, "y": 26.0625, "tooltipData": {"Model": "StarCoder", "Domain": "Language", "Training compute (FLOP)": "8.5e+22", "Training time (days)": "26", "Organization": "Hugging Face, ServiceNow, Northeastern University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), Carnegie Mellon University (CMU), Johns Hopkins University, Leipzig University, ScaDS.AI, Queen Mary University of London, Roblox, Sea AI Lab, Technion - Israel Institute of Technology, Monash University, CSIRO, Data61, McGill University, Saama, University of British Columbia (UBC), Massachusetts Institute of Technology (MIT), Technical University of Munich, IBM, University of Vermont, UnfoldML, SAP, University of Notre Dame, Columbia University, New York University (NYU), University of Allahabad, Discover Dollar, Toloka, Telefonica, Stanford University, Weizmann Institute of Science, Alan Turing Institute, Wellesley College, EleutherAI, Forschungszentrum Julich", "Publication date": "2023-05-09"}, "size": 8}, {"x": 2023.4878995433792, "y": 28.0, "tooltipData": {"Model": "HyenaDNA", "Domain": "Biology", "Training compute (FLOP)": "1.8e+21", "Training time (days)": "28", "Organization": "Stanford University, Harvard University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2023-06-27"}, "size": 8}, {"x": 2023.5109589041097, "y": 64.0, "tooltipData": {"Model": "Pangu-Weather", "Domain": "Earth science", "Training compute (FLOP)": "4.0e+22", "Training time (days)": "64", "Organization": "Huawei", "Publication date": "2023-07-05"}, "size": 8}, {"x": 2023.513698630137, "y": 163.0, "tooltipData": {"Model": "xTrimoPGLM -100B", "Domain": "Biology", "Training compute (FLOP)": "6.2e+23", "Training time (days)": "163", "Organization": "Tsinghua University, BioMap Research", "Publication date": "2023-07-06"}, "size": 8}, {"x": 2023.5465753424658, "y": 72.0, "tooltipData": {"Model": "Llama 2-70B", "Domain": "Language", "Training compute (FLOP)": "8.1e+23", "Training time (days)": "72", "Organization": "Meta AI", "Publication date": "2023-07-18"}, "size": 8}, {"x": 2023.6600456621004, "y": 25.0, "tooltipData": {"Model": "Jais", "Domain": "Language", "Training compute (FLOP)": "3.1e+22", "Training time (days)": "25", "Organization": "Cerebras Systems, Mohamed bin Zayed University of Artificial Intelligence, Inception", "Publication date": "2023-08-29"}, "size": 8}, {"x": 2023.6627853881278, "y": 0.034708333333333334, "tooltipData": {"Model": "Swift", "Domain": "Robotics", "Training compute (FLOP)": "5.3e+16", "Training time (days)": "0.03", "Organization": "Intel Labs", "Publication date": "2023-08-30"}, "size": 8}, {"x": 2023.6803652968038, "y": 180.0, "tooltipData": {"Model": "Falcon-180B", "Domain": "Language", "Training compute (FLOP)": "3.8e+24", "Training time (days)": "180", "Organization": "Technology Innovation Institute", "Publication date": "2023-09-06"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.7406392694065, "y": 48.0, "tooltipData": {"Model": "Amazon Titan", "Domain": "Language, Image generation", "Training compute (FLOP)": "4.8e+24", "Training time (days)": "48", "Organization": "Amazon", "Publication date": "2023-09-28"}, "size": 8}, {"x": 2023.8184931506848, "y": 0.4583333333333333, "tooltipData": {"Model": "CODEFUSION (Python)", "Domain": "Language", "Training compute (FLOP)": "7.9e+18", "Training time (days)": "0.46", "Organization": "Microsoft, Microsoft Research", "Publication date": "2023-10-26"}, "size": 8}, {"x": 2023.8294520547945, "y": 39.166666666666664, "tooltipData": {"Model": "Skywork-13B", "Domain": "Language", "Training compute (FLOP)": "2.5e+23", "Training time (days)": "39", "Organization": "Kunlun Inc.", "Publication date": "2023-10-30"}, "size": 8}, {"x": 2023.852511415525, "y": 2.0, "tooltipData": {"Model": "MultiBand Diffusion", "Domain": "Audio, Speech", "Training compute (FLOP)": "2.6e+19", "Training time (days)": "2.0", "Organization": "Meta AI, Hebrew University of Jerusalem, LORIA", "Publication date": "2023-11-08"}, "size": 8}, {"x": 2023.8716894977167, "y": 19.0, "tooltipData": {"Model": "Nemotron-3-8B", "Domain": "Language", "Training compute (FLOP)": "1.8e+23", "Training time (days)": "19", "Organization": "NVIDIA", "Publication date": "2023-11-15"}, "size": 8}, {"x": 2023.9303652968038, "y": 100.0, "tooltipData": {"Model": "Gemini 1.0 Ultra", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.0e+25", "Training time (days)": "100", "Organization": "Google DeepMind", "Publication date": "2023-12-06"}, "size": 8}, {"x": 2024.143607305936, "y": 21.0, "tooltipData": {"Model": "MegaScale (Production)", "Domain": "Language", "Training compute (FLOP)": "1.2e+25", "Training time (days)": "21", "Organization": "ByteDance, Peking University", "Publication date": "2024-02-23"}, "size": 8}, {"x": 2024.151826484018, "y": 104.16666666666667, "tooltipData": {"Model": "Mistral Large", "Domain": "Language", "Training compute (FLOP)": "1.1e+25", "Training time (days)": "104", "Organization": "Mistral AI", "Publication date": "2024-02-26"}, "size": 8}, {"x": 2024.4522831050228, "y": 91.66666666666667, "tooltipData": {"Model": "Nemotron-4 340B", "Domain": "Language", "Training compute (FLOP)": "1.8e+25", "Training time (days)": "92", "Organization": "NVIDIA", "Publication date": "2024-06-14"}, "size": 8}, {"x": 2024.5602739726028, "y": 89.25, "tooltipData": {"Model": "Llama 3.1-405B", "Domain": "Language", "Training compute (FLOP)": "3.8e+25", "Training time (days)": "89", "Organization": "Meta AI", "Publication date": "2024-07-23"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2024.7582191780823, "y": 13.791666666666666, "tooltipData": {"Model": "Meta Movie Gen Video", "Domain": "Video", "Training compute (FLOP)": "1.6e+24", "Training time (days)": "14", "Organization": "Meta AI", "Publication date": "2024-10-04"}, "size": 8}], "size": 8, "fillColor": "rgb(0.0, 165.0, 166.0)", "strokeColor": "rgb(0.0, 165.0, 166.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "line", "color": "#E03D90", "zOrder": 2, "clip": true, "strokeWidth": 1.5, "lineStyle": "-", "tooltipData": {"Growth rate": "1.26x/year", "90% CI": "1.19x to 1.34x per year", "R\u00b2": "0.24"}, "points": [{"x": 2010.0, "y": 0.89412438981031}, {"x": 2010.1515151515152, "y": 0.9262434576790476}, {"x": 2010.3030303030303, "y": 0.9595163186133331}, {"x": 2010.4545454545455, "y": 0.993984419595615}, {"x": 2010.6060606060605, "y": 1.0303516368714132}, {"x": 2010.7575757575758, "y": 1.0673643104215624}, {"x": 2010.909090909091, "y": 1.1057065669552104}, {"x": 2011.060606060606, "y": 1.146161397203049}, {"x": 2011.2121212121212, "y": 1.1873342319057263}, {"x": 2011.3636363636363, "y": 1.229986092443245}, {"x": 2011.5151515151515, "y": 1.2749879763644152}, {"x": 2011.6666666666667, "y": 1.3207885672121393}, {"x": 2011.8181818181818, "y": 1.3682344238668935}, {"x": 2011.969696969697, "y": 1.4173846481772348}, {"x": 2012.121212121212, "y": 1.4692429413732317}, {"x": 2012.2727272727273, "y": 1.5220216311029158}, {"x": 2012.4242424242425, "y": 1.5777083101550946}, {"x": 2012.5757575757575, "y": 1.6343833330807054}, {"x": 2012.7272727272727, "y": 1.694181024688659}, {"x": 2012.878787878788, "y": 1.7550400236534975}, {"x": 2013.030303030303, "y": 1.8192522191463556}, {"x": 2013.1818181818182, "y": 1.8846040719343102}, {"x": 2013.3333333333333, "y": 1.9523035182104589}, {"x": 2013.4848484848485, "y": 2.022434889098308}, {"x": 2013.6363636363637, "y": 2.0964303437432767}, {"x": 2013.7878787878788, "y": 2.1717390918997346}, {"x": 2013.939393939394, "y": 2.2497531088316705}, {"x": 2014.090909090909, "y": 2.332065525920733}, {"x": 2014.2424242424242, "y": 2.4158388484638533}, {"x": 2014.3939393939395, "y": 2.50262150736236}, {"x": 2014.5454545454545, "y": 2.5941856992380834}, {"x": 2014.6969696969697, "y": 2.6873749998402694}, {"x": 2014.8484848484848, "y": 2.7839118810531067}, {"x": 2015.0, "y": 2.885767731365251}, {"x": 2015.1515151515152, "y": 2.9894313498432092}, {"x": 2015.3030303030303, "y": 3.0968188112620116}, {"x": 2015.4545454545455, "y": 3.2080638848889707}, {"x": 2015.6060606060605, "y": 3.3254383165564843}, {"x": 2015.7575757575758, "y": 3.4448959448236613}, {"x": 2015.909090909091, "y": 3.5686447743081184}, {"x": 2016.060606060606, "y": 3.699211891185267}, {"x": 2016.2121212121212, "y": 3.832096352394074}, {"x": 2016.3636363636363, "y": 3.9723024541418908}, {"x": 2016.5151515151515, "y": 4.114996975814164}, {"x": 2016.6666666666667, "y": 4.26555365070608}, {"x": 2016.8181818181818, "y": 4.418782450598045}, {"x": 2016.969696969697, "y": 4.577515592256096}, {"x": 2017.121212121212, "y": 4.744994579698029}, {"x": 2017.2727272727273, "y": 4.915446034416143}, {"x": 2017.4242424242425, "y": 5.092020509493068}, {"x": 2017.5757575757575, "y": 5.2783238484497215}, {"x": 2017.7272727272727, "y": 5.4679337549164275}, {"x": 2017.878787878788, "y": 5.664354898749872}, {"x": 2018.030303030303, "y": 5.871598414109876}, {"x": 2018.1818181818182, "y": 6.082520149507812}, {"x": 2018.3333333333333, "y": 6.3010186936941}, {"x": 2018.4848484848485, "y": 6.5273662236026615}, {"x": 2018.6363636363637, "y": 6.766185002863849}, {"x": 2018.7878787878788, "y": 7.009242409412757}, {"x": 2018.939393939394, "y": 7.261031014244014}, {"x": 2019.090909090909, "y": 7.52669261550859}, {"x": 2019.2424242424242, "y": 7.797069258511783}, {"x": 2019.3939393939395, "y": 8.077158471540093}, {"x": 2019.5454545454545, "y": 8.372679982053363}, {"x": 2019.6969696969697, "y": 8.673446496926465}, {"x": 2019.8484848484848, "y": 8.985017258070501}, {"x": 2020.0, "y": 9.313754880521348}, {"x": 2020.1515151515152, "y": 9.648327036844197}, {"x": 2020.3030303030303, "y": 9.994917818225208}, {"x": 2020.4545454545455, "y": 10.3606049867394}, {"x": 2020.6060606060605, "y": 10.732782480746538}, {"x": 2020.7575757575758, "y": 11.12546613328305}, {"x": 2020.909090909091, "y": 11.525119253003222}, {"x": 2021.060606060606, "y": 11.946792377592008}, {"x": 2021.2121212121212, "y": 12.375949483205693}, {"x": 2021.3636363636363, "y": 12.820522929496239}, {"x": 2021.5151515151515, "y": 13.289591391510992}, {"x": 2021.6666666666667, "y": 13.766985021207516}, {"x": 2021.8181818181818, "y": 14.261527761885764}, {"x": 2021.969696969697, "y": 14.773835650268285}, {"x": 2022.121212121212, "y": 15.314370572645464}, {"x": 2022.2727272727273, "y": 15.864499070867229}, {"x": 2022.4242424242425, "y": 16.434389488986625}, {"x": 2022.5757575757575, "y": 17.03567961140547}, {"x": 2022.7272727272727, "y": 17.64764161117251}, {"x": 2022.878787878788, "y": 18.2815867368072}, {"x": 2023.030303030303, "y": 18.950460839757728}, {"x": 2023.1818181818182, "y": 19.63120632080334}, {"x": 2023.3333333333333, "y": 20.336405793436942}, {"x": 2023.4848484848485, "y": 21.06693770301043}, {"x": 2023.6363636363637, "y": 21.83772030852932}, {"x": 2023.7878787878788, "y": 22.62218299479713}, {"x": 2023.939393939394, "y": 23.434825440555343}, {"x": 2024.090909090909, "y": 24.292242691586104}, {"x": 2024.2424242424242, "y": 25.16487764102578}, {"x": 2024.3939393939395, "y": 26.085592850286687}, {"x": 2024.5454545454545, "y": 27.022649189094718}, {"x": 2024.6969696969697, "y": 28.011335264101096}, {"x": 2024.8484848484848, "y": 29.01756883595434}, {"x": 2025.0, "y": 30.079243656875484}]}, {"type": "annotation", "color": "#3E555E", "text": "GPT-4", "x": 2023.2050228310502, "y": 95.0, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2023.2050228310502, "targetY": 95.0, "relDx": 0.02, "relDy": 0.03, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "Falcon-180B", "x": 2023.6803652968038, "y": 180.0, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2023.6803652968038, "targetY": 180.0, "relDx": -0.05, "relDy": 0.02, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "OpenAI Five", "x": 2019.9495433789955, "y": 296.0, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2019.9495433789955, "targetY": 296.0, "relDx": -0.04, "relDy": 0.01, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-1", "x": 2018.4166666666667, "y": 30.0, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2018.4166666666667, "targetY": 30.0, "relDx": 0.02, "relDy": 0.03, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "AlexNet", "x": 2012.7461187214612, "y": 5.5, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2012.7461187214612, "targetY": 5.5, "relDx": 0.02, "relDy": 0.03, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "Llama 3.1-405B", "x": 2024.5602739726028, "y": 89.25, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2024.5602739726028, "targetY": 89.25, "relDx": -0.02, "relDy": 0.15, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#E03D90", "text": "1.26x/year", "x": 2015.3, "y": [3.0948322927203527], "background": true, "weight": "bold", "hasArrow": true, "targetX": 2015.3, "targetY": [3.0948322927203527], "relDx": 0.08, "relDy": -0.14, "hasArrowHead": true, "arrowType": "arc", "arrowColor": "#E03D90", "targetSize": 8}], "additionalLegendItems": [], "tooltipKeyWidth": 120, "tooltipMinWidth": 250, "topRightText": "155 models", "addDataPadding": false, "title": "Training length of notable models", "originalDataAspectRatio": 0.7451612903225805}

Enable JavaScript to see an interactive visualization.

The power required to train frontier AI models is doubling annually.

Training frontier models requires a large and growing amount of power for GPUs, servers, cooling and other equipment. This is driven by an increase in GPU count; power draw per GPU is also growing, but at only a few percent per year.

Training compute has grown even faster — around 4x/year. However, hardware efficiency (a 12x improvement in the last ten years), the adoption of lower precision formats (a 8x improvement) and longer training runs (a 4x increase) account for a roughly 2x/year decrease in power requirements relative to training compute.

{"xAxis": {"label": "Publication date", "lim": [2010.3, 2025.7], "scaleType": "linear", "ticks": [2010.0, 2012.0, 2014.0, 2016.0, 2018.0, 2020.0, 2022.0, 2024.0, 2026.0], "tickLabels": ["2010", "2012", "2014", "2016", "2018", "2020", "2022", "2024", "2026"], "hideMinorGrid": true, "nice": false}, "yAxis": {"label": "Total power draw required (W)", "lim": [205.044838736073, 176712285.00017217], "scaleType": "log", "ticks": [1, 10, 100, 1000, 10000, 100000, 1000000, 10000000, 100000000, 1000000000], "tickLabels": ["1", "10", "100", "1k", "10k", "100k", "1M", "10M", "100M", "1B"], "hideMinorGrid": true}, "showLegend": true, "legendPosition": "header", "showFrame": true, "objects": [{"type": "scatter", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2011.0, "y": 477.45555612122587, "tooltipData": {"Model": "Deep Autoencoders", "Domain": "Vision", "Training compute (FLOP)": "3.7e+16", "Training power draw (W)": "4.8e+02", "Organization": "University of Toronto", "Publication date": "2011-01-01"}, "size": 8}, {"x": 2014.674885844749, "y": 2295.5738918135626, "tooltipData": {"Model": "VGG16", "Domain": "Vision", "Training compute (FLOP)": "1.2e+19", "Training power draw (W)": "2.3e+03", "Organization": "University of Oxford", "Publication date": "2014-09-04"}, "size": 8}, {"x": 2015.9358447488585, "y": 9126.870567999174, "tooltipData": {"Model": "DeepSpeech2 (English)", "Domain": "Speech", "Training compute (FLOP)": "2.6e+19", "Training power draw (W)": "9.1e+03", "Organization": "Baidu Research - Silicon Valley AI Lab", "Publication date": "2015-12-08"}, "size": 8}, {"x": 2016.7664383561644, "y": 40913.209297738365, "tooltipData": {"Model": "Xception", "Domain": "Vision", "Training compute (FLOP)": "4.4e+20", "Training power draw (W)": "4.1e+04", "Organization": "Google", "Publication date": "2016-10-07"}, "size": 8}, {"x": 2016.8771689497717, "y": 18174.382148313794, "tooltipData": {"Model": "PolyNet", "Domain": "Vision", "Training compute (FLOP)": "6.4e+19", "Training power draw (W)": "1.8e+04", "Organization": "Chinese University of Hong Kong (CUHK)", "Publication date": "2016-11-17"}, "size": 8}, {"x": 2017.0602739726028, "y": 35592.293705651486, "tooltipData": {"Model": "MoE-Multi", "Domain": "Language", "Training compute (FLOP)": "9.4e+19", "Training power draw (W)": "3.6e+04", "Organization": "Jagiellonian University, Google Brain", "Publication date": "2017-01-23"}, "size": 8}, {"x": 2017.5246575342467, "y": 33978.95160170179, "tooltipData": {"Model": "JFT", "Domain": "Vision", "Training compute (FLOP)": "8.4e+20", "Training power draw (W)": "3.4e+04", "Organization": "Google Research, Carnegie Mellon University (CMU)", "Publication date": "2017-07-10"}, "size": 8}, {"x": 2017.9276255707764, "y": 5647.409436097064, "tooltipData": {"Model": "AlphaZero", "Domain": "Games", "Training compute (FLOP)": "3.7e+22", "Training power draw (W)": "5.6e+03", "Organization": "DeepMind", "Publication date": "2017-12-05"}, "size": 8}, {"x": 2018.7406392694065, "y": 259587.67228108257, "tooltipData": {"Model": "BigGAN-deep 512x512", "Domain": "Image generation", "Training compute (FLOP)": "1.8e+21", "Training power draw (W)": "2.6e+05", "Organization": "Heriot-Watt University, DeepMind", "Publication date": "2018-09-28"}, "size": 8}, {"x": 2019.5, "y": 575049.4486114612, "tooltipData": {"Model": "RoBERTa Large", "Domain": "Language", "Training compute (FLOP)": "8.5e+21", "Training power draw (W)": "5.8e+05", "Organization": "Facebook, University of Washington", "Publication date": "2019-07-01"}, "size": 8}, {"x": 2019.7105022831051, "y": 287273.8627260526, "tooltipData": {"Model": "Megatron-BERT", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Training power draw (W)": "2.9e+05", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.7105022831051, "y": 287273.8627260526, "tooltipData": {"Model": "Megatron-LM (8.3B)", "Domain": "Language", "Training compute (FLOP)": "9.1e+21", "Training power draw (W)": "2.9e+05", "Organization": "NVIDIA", "Publication date": "2019-09-17"}, "size": 8}, {"x": 2019.8102739726028, "y": 516885.91216129856, "tooltipData": {"Model": "T5-11B", "Domain": "Language", "Training compute (FLOP)": "3.3e+22", "Training power draw (W)": "5.2e+05", "Organization": "Google", "Publication date": "2019-10-23"}, "size": 8}, {"x": 2019.8294520547945, "y": 387634.3155427717, "tooltipData": {"Model": "AlphaStar", "Domain": "Games", "Training compute (FLOP)": "5.9e+22", "Training power draw (W)": "3.9e+05", "Organization": "DeepMind", "Publication date": "2019-10-30"}, "size": 8}, {"x": 2019.8607305936073, "y": 1033553.9740854003, "tooltipData": {"Model": "Noisy Student (L2)", "Domain": "Vision", "Training compute (FLOP)": "2.6e+22", "Training power draw (W)": "1.0e+06", "Organization": "Carnegie Mellon University (CMU), Google", "Publication date": "2019-11-11"}, "size": 8}, {"x": 2020.0739726027398, "y": 1032664.6317417204, "tooltipData": {"Model": "Meena", "Domain": "Language", "Training compute (FLOP)": "1.1e+23", "Training power draw (W)": "1.0e+06", "Organization": "Google Brain", "Publication date": "2020-01-28"}, "size": 8}, {"x": 2020.407305936073, "y": 5595164.712836602, "tooltipData": {"Model": "GPT-3 175B (davinci)", "Domain": "Language", "Training compute (FLOP)": "3.1e+23", "Training power draw (W)": "5.6e+06", "Organization": "OpenAI", "Publication date": "2020-05-28"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2020.4961187214612, "y": 1030932.0891586556, "tooltipData": {"Model": "GShard (dense)", "Domain": "Language", "Training compute (FLOP)": "4.8e+22", "Training power draw (W)": "1.0e+06", "Organization": "Google", "Publication date": "2020-06-30"}, "size": 8}, {"x": 2021.0109589041097, "y": 571581.9187928778, "tooltipData": {"Model": "DALL-E", "Domain": "Image generation", "Training compute (FLOP)": "4.7e+22", "Training power draw (W)": "5.7e+05", "Organization": "OpenAI", "Publication date": "2021-01-05"}, "size": 8}, {"x": 2021.027397260274, "y": 1028782.01688865, "tooltipData": {"Model": "Switch", "Domain": "Language", "Training compute (FLOP)": "8.2e+22", "Training power draw (W)": "1.0e+06", "Organization": "Google", "Publication date": "2021-01-11"}, "size": 8}, {"x": 2021.1666666666667, "y": 1028249.30827849, "tooltipData": {"Model": "Meta Pseudo Labels", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Training power draw (W)": "1.0e+06", "Organization": "Google Brain, Google AI", "Publication date": "2021-03-01"}, "size": 8}, {"x": 2021.3415525114156, "y": 513779.0240754994, "tooltipData": {"Model": "ProtT5-XXL", "Domain": "Biology", "Training compute (FLOP)": "7.4e+22", "Training power draw (W)": "5.1e+05", "Organization": "Technical University of Munich, Med AI Technology, NVIDIA, Oak Ridge National Laboratory, Google, Seoul National University", "Publication date": "2021-05-04"}, "size": 8}, {"x": 2021.777397260274, "y": 3989424.741941752, "tooltipData": {"Model": "Megatron-Turing NLG 530B", "Domain": "Language", "Training compute (FLOP)": "1.2e+24", "Training power draw (W)": "4.0e+06", "Organization": "Microsoft, NVIDIA", "Publication date": "2021-10-11"}, "size": 8}, {"x": 2021.9358447488585, "y": 4100965.606974364, "tooltipData": {"Model": "Gopher (280B)", "Domain": "Language", "Training compute (FLOP)": "6.3e+23", "Training power draw (W)": "4.1e+06", "Organization": "DeepMind", "Publication date": "2021-12-08"}, "size": 8}, {"x": 2021.9495433789955, "y": 437413.9513787687, "tooltipData": {"Model": "GLaM", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Training power draw (W)": "4.4e+05", "Organization": "Google", "Publication date": "2021-12-13"}, "size": 8}, {"x": 2021.9769406392695, "y": 2106.480893731142, "tooltipData": {"Model": "ERNIE 3.0 Titan", "Domain": "Language", "Training compute (FLOP)": "1.0e+24", "Training power draw (W)": "2.1e+03", "Organization": "Baidu, Peng Cheng Laboratory", "Publication date": "2021-12-23"}, "size": 8}, {"x": 2022.10799086758, "y": 1024572.2817390452, "tooltipData": {"Model": "LaMDA", "Domain": "Language", "Training compute (FLOP)": "3.6e+23", "Training power draw (W)": "1.0e+06", "Organization": "Google", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.2582191780823, "y": 2621496.0548983514, "tooltipData": {"Model": "PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Training power draw (W)": "2.6e+06", "Organization": "Google Research", "Publication date": "2022-04-04"}, "size": 8}, {"x": 2022.3360730593606, "y": 909984.4296839886, "tooltipData": {"Model": "OPT-175B", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Training power draw (W)": "9.1e+05", "Organization": "Meta AI", "Publication date": "2022-05-02"}, "size": 8}, {"x": 2022.4933789954339, "y": 436538.00637838274, "tooltipData": {"Model": "Minerva (540B)", "Domain": "Language", "Training compute (FLOP)": "2.7e+24", "Training power draw (W)": "4.4e+05", "Organization": "Google", "Publication date": "2022-06-29"}, "size": 8}, {"x": 2022.60799086758, "y": 113634.05202226162, "tooltipData": {"Model": "BlenderBot 3", "Domain": "Language", "Training compute (FLOP)": "4.3e+23", "Training power draw (W)": "1.1e+05", "Organization": "McGill University, Meta AI, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms)", "Publication date": "2022-08-10"}, "size": 8}, {"x": 2022.8020547945205, "y": 218023.49948616367, "tooltipData": {"Model": "Flan-PaLM 540B", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Training power draw (W)": "2.2e+05", "Organization": "Google", "Publication date": "2022-10-20"}, "size": 8}, {"x": 2022.8020547945205, "y": 218023.49948616367, "tooltipData": {"Model": "U-PaLM (540B)", "Domain": "Language", "Training compute (FLOP)": "2.5e+24", "Training power draw (W)": "2.2e+05", "Organization": "Google", "Publication date": "2022-10-20"}, "size": 8}, {"x": 2023.2050228310502, "y": 22146708.683295924, "tooltipData": {"Model": "GPT-4", "Domain": "Multimodal, Language, Vision, Image generation", "Training compute (FLOP)": "2.1e+25", "Training power draw (W)": "2.2e+07", "Organization": "OpenAI", "Publication date": "2023-03-15"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}, {"x": 2023.6803652968038, "y": 3622388.5616717767, "tooltipData": {"Model": "Falcon-180B", "Domain": "Language", "Training compute (FLOP)": "3.8e+24", "Training power draw (W)": "3.6e+06", "Organization": "Technology Innovation Institute", "Publication date": "2023-09-06"}, "size": 8}, {"x": 2023.7406392694065, "y": 12166402.812291188, "tooltipData": {"Model": "Amazon Titan", "Domain": "Language, Image generation", "Training compute (FLOP)": "4.8e+24", "Training power draw (W)": "1.2e+07", "Organization": "Amazon", "Publication date": "2023-09-28"}, "size": 8}, {"x": 2023.8908675799087, "y": 7732579.928411527, "tooltipData": {"Model": "Inflection-2", "Domain": "Language", "Training compute (FLOP)": "1.0e+25", "Training power draw (W)": "7.7e+06", "Organization": "Inflection AI", "Publication date": "2023-11-22"}, "size": 8}, {"x": 2023.9303652968038, "y": 24175462.27839436, "tooltipData": {"Model": "Gemini 1.0 Ultra", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.0e+25", "Training power draw (W)": "2.4e+07", "Organization": "Google DeepMind", "Publication date": "2023-12-06"}, "size": 8}, {"x": 2024.143607305936, "y": 10849658.51586543, "tooltipData": {"Model": "MegaScale (Production)", "Domain": "Language", "Training compute (FLOP)": "1.2e+25", "Training power draw (W)": "1.1e+07", "Organization": "ByteDance, Peking University", "Publication date": "2024-02-23"}, "size": 8}, {"x": 2024.5602739726028, "y": 25280251.591531232, "tooltipData": {"Model": "Llama 3.1-405B", "Domain": "Language", "Training compute (FLOP)": "3.8e+25", "Training power draw (W)": "2.5e+07", "Organization": "Meta AI", "Publication date": "2024-07-23"}, "strokeColor": "black", "strokeWidth": 0.5, "size": 8, "zOrder": 2}], "size": 8, "fillColor": "rgb(0.0, 165.0, 166.0)", "strokeColor": "rgb(0.0, 165.0, 166.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "scatter", "alpha": 0.45, "zOrder": 1, "clip": true, "points": [{"x": 2016.4100456621004, "y": 569.1734872517258, "tooltipData": {"Model": "Named Entity Recognition model", "Domain": "Language", "Training compute (FLOP)": "9.7e+16", "Training power draw (W)": "5.7e+02", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.4100456621004, "y": 569.1734872517258, "tooltipData": {"Model": "Part-of-sentence tagging model", "Domain": "Language", "Training compute (FLOP)": "1.5e+17", "Training power draw (W)": "5.7e+02", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2016-05-29"}, "size": 8}, {"x": 2016.844292237443, "y": 4544.272433902901, "tooltipData": {"Model": "BIDAF", "Domain": "Language", "Training compute (FLOP)": "3.5e+18", "Training power draw (W)": "4.5e+03", "Organization": "University of Washington, Allen Institute for AI", "Publication date": "2016-11-05"}, "size": 8}, {"x": 2017.4468036529681, "y": 5438.4785486715045, "tooltipData": {"Model": "Transformer", "Domain": "Language", "Training compute (FLOP)": "7.4e+18", "Training power draw (W)": "5.4e+03", "Organization": "Google Research, Google Brain", "Publication date": "2017-06-12"}, "size": 8}, {"x": 2017.5997716894976, "y": 4528.993268174125, "tooltipData": {"Model": "RetinaNet-R101", "Domain": "Vision", "Training compute (FLOP)": "2.1e+18", "Training power draw (W)": "4.5e+03", "Organization": "Facebook AI Research", "Publication date": "2017-08-07"}, "size": 8}, {"x": 2018.094292237443, "y": 249117.8520724292, "tooltipData": {"Model": "AmoebaNet-A (F=448)", "Domain": "Vision", "Training compute (FLOP)": "3.9e+20", "Training power draw (W)": "2.5e+05", "Organization": "Google Brain", "Publication date": "2018-02-05"}, "size": 8}, {"x": 2018.094292237443, "y": 677.8717063195352, "tooltipData": {"Model": "IMPALA", "Domain": "Games", "Training compute (FLOP)": "1.7e+20", "Training power draw (W)": "6.8e+02", "Organization": "DeepMind", "Publication date": "2018-02-05"}, "size": 8}, {"x": 2018.2378995433792, "y": 5419.7701134184135, "tooltipData": {"Model": "LSTM (Hebbian, Cache, MbPA)", "Domain": "Language", "Training compute (FLOP)": "3.3e+19", "Training power draw (W)": "5.4e+03", "Organization": "DeepMind, University College London (UCL)", "Publication date": "2018-03-27"}, "size": 8}, {"x": 2018.4166666666667, "y": 722.075954334533, "tooltipData": {"Model": "GPT-1", "Domain": "Language", "Training compute (FLOP)": "1.8e+19", "Training power draw (W)": "7.2e+02", "Organization": "OpenAI", "Publication date": "2018-06-01"}, "size": 8}, {"x": 2018.657305936073, "y": 72133.58132994265, "tooltipData": {"Model": "Big Transformer for Back-Translation", "Domain": "Language", "Training compute (FLOP)": "4.8e+20", "Training power draw (W)": "7.2e+04", "Organization": "Facebook AI Research, Google Brain", "Publication date": "2018-08-28"}, "size": 8}, {"x": 2018.777397260274, "y": 40374.23895246312, "tooltipData": {"Model": "BERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.8e+20", "Training power draw (W)": "4.0e+04", "Organization": "Google", "Publication date": "2018-10-11"}, "size": 8}, {"x": 2018.844292237443, "y": 40362.59984258301, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 2.9B (translation)", "Domain": "Language", "Training compute (FLOP)": "6.8e+19", "Training power draw (W)": "4.0e+04", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2018.844292237443, "y": 161450.39937033202, "tooltipData": {"Model": "Mesh-TensorFlow Transformer 4.9B (language)", "Domain": "Language", "Training compute (FLOP)": "1.6e+20", "Training power draw (W)": "1.6e+05", "Organization": "Google Brain", "Publication date": "2018-11-05"}, "size": 8}, {"x": 2019.2351598173516, "y": 4047.748042705093, "tooltipData": {"Model": "SciBERT", "Domain": "Language", "Training compute (FLOP)": "8.9e+19", "Training power draw (W)": "4.0e+03", "Organization": "Allen Institute for AI", "Publication date": "2019-03-26"}, "size": 8}, {"x": 2019.4100456621004, "y": 258868.3859107431, "tooltipData": {"Model": "MnasNet-A1 + SSDLite", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Training power draw (W)": "2.6e+05", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.4100456621004, "y": 258868.3859107431, "tooltipData": {"Model": "MnasNet-A3", "Domain": "Vision", "Training compute (FLOP)": "1.5e+21", "Training power draw (W)": "2.6e+05", "Organization": "Google", "Publication date": "2019-05-29"}, "size": 8}, {"x": 2019.844292237443, "y": 280388.1069024762, "tooltipData": {"Model": "XLM-RoBERTa", "Domain": "Language", "Training compute (FLOP)": "NA", "Training power draw (W)": "2.8e+05", "Organization": "Facebook AI", "Publication date": "2019-11-05"}, "size": 8}, {"x": 2020.1052511415523, "y": 516264.2595914164, "tooltipData": {"Model": "ALBERT-xxlarge", "Domain": "Language", "Training compute (FLOP)": "2.4e+21", "Training power draw (W)": "5.2e+05", "Organization": "Toyota Technological Institute at Chicago, Google", "Publication date": "2020-02-09"}, "size": 8}, {"x": 2020.116210045662, "y": 143400.44308513464, "tooltipData": {"Model": "Turing-NLG", "Domain": "Language", "Training compute (FLOP)": "1.6e+22", "Training power draw (W)": "1.4e+05", "Organization": "Microsoft", "Publication date": "2020-02-13"}, "size": 8}, {"x": 2020.3360730593606, "y": 8059.314206761232, "tooltipData": {"Model": "UnifiedQA", "Domain": "Language", "Training compute (FLOP)": "1.6e+19", "Training power draw (W)": "8.1e+03", "Organization": "Allen Institute for AI, University of Washington", "Publication date": "2020-05-02"}, "size": 8}, {"x": 2020.8020547945205, "y": 257422.8158198682, "tooltipData": {"Model": "Conformer + Wav2vec 2.0 + Noisy Student", "Domain": "Speech", "Training compute (FLOP)": "7.6e+21", "Training power draw (W)": "2.6e+05", "Organization": "Google, Google Research, Google Brain", "Publication date": "2020-10-20"}, "size": 8}, {"x": 2020.804794520548, "y": 64355.01603597277, "tooltipData": {"Model": "GBERT-Large", "Domain": "Language", "Training compute (FLOP)": "2.2e+21", "Training power draw (W)": "6.4e+04", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.804794520548, "y": 64355.01603597277, "tooltipData": {"Model": "German ELECTRA Large", "Domain": "Language", "Training compute (FLOP)": "1.4e+21", "Training power draw (W)": "6.4e+04", "Organization": "deepset, Bayerische Staatsbibliothek Muenchen", "Publication date": "2020-10-21"}, "size": 8}, {"x": 2020.9769406392695, "y": 4466.099780544917, "tooltipData": {"Model": "DensePhrases", "Domain": "Language", "Training compute (FLOP)": "2.1e+18", "Training power draw (W)": "4.5e+03", "Organization": "Korea University, Princeton University", "Publication date": "2020-12-23"}, "size": 8}, {"x": 2021.116210045662, "y": 17854.564630974684, "tooltipData": {"Model": "MSA Transformer", "Domain": "Biology", "Training compute (FLOP)": "5.5e+21", "Training power draw (W)": "1.8e+04", "Organization": "Facebook AI Research, UC Berkeley, New York University (NYU)", "Publication date": "2021-02-13"}, "size": 8}, {"x": 2021.1776255707764, "y": 267761.9673807506, "tooltipData": {"Model": "M6-T", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "5.5e+21", "Training power draw (W)": "2.7e+05", "Organization": "Alibaba", "Publication date": "2021-03-05"}, "size": 8}, {"x": 2021.2993150684931, "y": 114191.06714895004, "tooltipData": {"Model": "PLUG", "Domain": "Language", "Training compute (FLOP)": "3.6e+22", "Training power draw (W)": "1.1e+05", "Organization": "Alibaba", "Publication date": "2021-04-19"}, "size": 8}, {"x": 2021.3415525114156, "y": 513779.0240754994, "tooltipData": {"Model": "ProtT5-XXL-BFD", "Domain": "Biology", "Training compute (FLOP)": "3.7e+22", "Training power draw (W)": "5.1e+05", "Organization": "Technical University of Munich, Med AI Technology, NVIDIA, Oak Ridge National Laboratory, Google, Seoul National University", "Publication date": "2021-05-04"}, "size": 8}, {"x": 2021.3415525114156, "y": 1027558.0481509988, "tooltipData": {"Model": "ProtBERT-BFD", "Domain": "Biology", "Training compute (FLOP)": "3.9e+22", "Training power draw (W)": "1.0e+06", "Organization": "Technical University of Munich, NVIDIA, Seoul National University, Google, Oak Ridge National Laboratory, Med AI Technology", "Publication date": "2021-05-04"}, "size": 8}, {"x": 2021.3853881278537, "y": 557.3925941869019, "tooltipData": {"Model": "MedBERT", "Domain": "Medicine", "Training compute (FLOP)": "9.5e+18", "Training power draw (W)": "5.6e+02", "Organization": "Peng Cheng Laboratory, University of Texas at Houston", "Publication date": "2021-05-20"}, "size": 8}, {"x": 2021.401826484018, "y": 285367.11236577464, "tooltipData": {"Model": "CogView", "Domain": "Image generation", "Training compute (FLOP)": "2.7e+22", "Training power draw (W)": "2.9e+05", "Organization": "Tsinghua University, Alibaba DAMO Academy", "Publication date": "2021-05-26"}, "size": 8}, {"x": 2021.4358447488585, "y": 2054364.3396154805, "tooltipData": {"Model": "ViT-G/14", "Domain": "Vision", "Training compute (FLOP)": "5.8e+22", "Training power draw (W)": "2.1e+06", "Organization": "Google Brain, Google Research", "Publication date": "2021-06-08"}, "size": 8}, {"x": 2021.4440639269408, "y": 513575.011135734, "tooltipData": {"Model": "ALIGN", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "2.6e+22", "Training power draw (W)": "5.1e+05", "Organization": "Google Research", "Publication date": "2021-06-11"}, "size": 8}, {"x": 2021.4769406392695, "y": 16047.21161558559, "tooltipData": {"Model": "EfficientNetV2-XL", "Domain": "Vision", "Training compute (FLOP)": "9.6e+19", "Training power draw (W)": "1.6e+04", "Organization": "Google, Google Brain", "Publication date": "2021-06-23"}, "size": 8}, {"x": 2021.7582191780823, "y": 64120.37846954732, "tooltipData": {"Model": "AlphaFold-Multimer", "Domain": "Biology", "Training compute (FLOP)": "4.4e+21", "Training power draw (W)": "6.4e+04", "Organization": "Google DeepMind, DeepMind", "Publication date": "2021-10-04"}, "size": 8}, {"x": 2021.7883561643835, "y": 256452.45740223827, "tooltipData": {"Model": "T0-XXL", "Domain": "Language", "Training compute (FLOP)": "9.2e+20", "Training power draw (W)": "2.6e+05", "Organization": "Hugging Face, Brown University", "Publication date": "2021-10-15"}, "size": 8}, {"x": 2021.8908675799087, "y": 455737.5358105461, "tooltipData": {"Model": "Florence", "Domain": "Vision", "Training compute (FLOP)": "4.8e+22", "Training power draw (W)": "4.6e+05", "Organization": "Microsoft", "Publication date": "2021-11-22"}, "size": 8}, {"x": 2021.9550228310502, "y": 128145.99696745229, "tooltipData": {"Model": "LongT5", "Domain": "Language", "Training compute (FLOP)": "NA", "Training power draw (W)": "1.3e+05", "Organization": "Google Research", "Publication date": "2021-12-15"}, "size": 8}, {"x": 2021.9687214611872, "y": 227803.45886835252, "tooltipData": {"Model": "XGLM-7.5B", "Domain": "Language", "Training compute (FLOP)": "2.2e+22", "Training power draw (W)": "2.3e+05", "Organization": "Meta AI, Facebook AI Research", "Publication date": "2021-12-20"}, "size": 8}, {"x": 2022.0860730593606, "y": 1601024.4557014774, "tooltipData": {"Model": "AlphaCode", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Training power draw (W)": "1.6e+06", "Organization": "DeepMind", "Publication date": "2022-02-02"}, "size": 8}, {"x": 2022.1052511415523, "y": 85381.89156398654, "tooltipData": {"Model": "GPT-NeoX-20B", "Domain": "Language", "Training compute (FLOP)": "9.3e+22", "Training power draw (W)": "8.5e+04", "Organization": "EleutherAI", "Publication date": "2022-02-09"}, "size": 8}, {"x": 2022.10799086758, "y": 511.39675520829087, "tooltipData": {"Model": "ProteinBERT", "Domain": "Biology", "Training compute (FLOP)": "6.5e+19", "Training power draw (W)": "5.1e+02", "Organization": "Hebrew University of Jerusalem, Ben-Gurion University of the Negev, Deep Trading", "Publication date": "2022-02-10"}, "size": 8}, {"x": 2022.2828767123287, "y": 227539.7263970043, "tooltipData": {"Model": "Stable Diffusion (LDM-KL-8-G)", "Domain": "Image generation", "Training compute (FLOP)": "5.0e+22", "Training power draw (W)": "2.3e+05", "Organization": "Runway, Ludwig Maximilian University", "Publication date": "2022-04-13"}, "size": 8}, {"x": 2022.3267123287671, "y": 655208.6062150282, "tooltipData": {"Model": "Flamingo", "Domain": "Multimodal, Vision, Language, Video", "Training compute (FLOP)": "2.2e+23", "Training power draw (W)": "6.6e+05", "Organization": "DeepMind", "Publication date": "2022-04-29"}, "size": 8}, {"x": 2022.35799086758, "y": 218378.65933256023, "tooltipData": {"Model": "UL2", "Domain": "Language", "Training compute (FLOP)": "1.2e+23", "Training power draw (W)": "2.2e+05", "Organization": "Google Research, Google Brain", "Publication date": "2022-05-10"}, "size": 8}, {"x": 2022.3634703196346, "y": 255907.33702054748, "tooltipData": {"Model": "Gato", "Domain": "Multimodal, Robotics, Games, Language", "Training compute (FLOP)": "4.0e+21", "Training power draw (W)": "2.6e+05", "Organization": "DeepMind", "Publication date": "2022-05-12"}, "size": 8}, {"x": 2022.393607305936, "y": 109175.04391063086, "tooltipData": {"Model": "Imagen", "Domain": "Image generation", "Training compute (FLOP)": "1.5e+22", "Training power draw (W)": "1.1e+05", "Organization": "Google Brain", "Publication date": "2022-05-23"}, "size": 8}, {"x": 2022.4045662100457, "y": 56859.714898306775, "tooltipData": {"Model": "Tranception", "Domain": "Biology", "Training compute (FLOP)": "7.2e+21", "Training power draw (W)": "5.7e+04", "Organization": "University of Oxford, Harvard Medical School, Cohere", "Publication date": "2022-05-27"}, "size": 8}, {"x": 2022.4522831050228, "y": 873207.3344090481, "tooltipData": {"Model": "CoCa", "Domain": "Vision", "Training compute (FLOP)": "7.3e+22", "Training power draw (W)": "8.7e+05", "Organization": "Google Research", "Publication date": "2022-06-14"}, "size": 8}, {"x": 2022.527397260274, "y": 341004.3433715435, "tooltipData": {"Model": "BLOOM-176B", "Domain": "Language", "Training compute (FLOP)": "3.7e+23", "Training power draw (W)": "3.4e+05", "Organization": "Hugging Face, BigScience", "Publication date": "2022-07-11"}, "size": 8}, {"x": 2022.5860730593606, "y": 113643.12380745166, "tooltipData": {"Model": "AlexaTM 20B", "Domain": "Language", "Training compute (FLOP)": "2.0e+23", "Training power draw (W)": "1.1e+05", "Organization": "Amazon", "Publication date": "2022-08-02"}, "size": 8}, {"x": 2022.5915525114156, "y": 681845.1304541394, "tooltipData": {"Model": "GLM-130B", "Domain": "Language", "Training compute (FLOP)": "3.5e+23", "Training power draw (W)": "6.8e+05", "Organization": "Tsinghua University", "Publication date": "2022-08-04"}, "size": 8}, {"x": 2022.7022831050228, "y": 436202.7314577782, "tooltipData": {"Model": "PaLI", "Domain": "Language, Vision, Multimodal", "Training compute (FLOP)": "1.7e+23", "Training power draw (W)": "4.4e+05", "Organization": "Google", "Publication date": "2022-09-14"}, "size": 8}, {"x": 2022.8689497716894, "y": 113525.84154878548, "tooltipData": {"Model": "EVA-01", "Domain": "Vision", "Training compute (FLOP)": "1.5e+22", "Training power draw (W)": "1.1e+05", "Organization": "Beijing Academy of Artificial Intelligence / BAAI, Huazhong University of Science and Technology, Zhejiang University, Beijing Institute of Technology", "Publication date": "2022-11-14"}, "size": 8}, {"x": 2022.8744292237443, "y": 113523.59989094557, "tooltipData": {"Model": "Galactica", "Domain": "Language, Biology", "Training compute (FLOP)": "3.2e+23", "Training power draw (W)": "1.1e+05", "Organization": "Meta AI", "Publication date": "2022-11-16"}, "size": 8}, {"x": 2022.9659817351599, "y": 1418.583589472499, "tooltipData": {"Model": "CaLM", "Domain": "Biology", "Training compute (FLOP)": "2.9e+19", "Training power draw (W)": "1.4e+03", "Organization": "University of Oxford", "Publication date": "2022-12-19"}, "size": 8}, {"x": 2023.0383561643835, "y": 113456.5902448678, "tooltipData": {"Model": "Nucleotide Transformer", "Domain": "Biology", "Training compute (FLOP)": "8.1e+21", "Training power draw (W)": "1.1e+05", "Organization": "NVIDIA, Technical University of Munich", "Publication date": "2023-01-15"}, "size": 8}, {"x": 2023.041095890411, "y": 27229.31456276312, "tooltipData": {"Model": "Ankh_large", "Domain": "Biology", "Training compute (FLOP)": "6.5e+21", "Training power draw (W)": "2.7e+04", "Organization": "Technical University of Munich, Columbia University", "Publication date": "2023-01-16"}, "size": 8}, {"x": 2023.0794520547945, "y": 14179.988778270785, "tooltipData": {"Model": "BLIP-2 (Q-Former)", "Domain": "Vision, Language", "Training compute (FLOP)": "1.2e+21", "Training power draw (W)": "1.4e+04", "Organization": "Salesforce Research", "Publication date": "2023-01-30"}, "size": 8}, {"x": 2023.10799086758, "y": 435562.3547730654, "tooltipData": {"Model": "ViT-22B", "Domain": "Vision", "Training compute (FLOP)": "1.9e+23", "Training power draw (W)": "4.4e+05", "Organization": "Google", "Publication date": "2023-02-10"}, "size": 8}, {"x": 2023.1463470319634, "y": 1814594.7887512865, "tooltipData": {"Model": "LLaMA-65B", "Domain": "Language", "Training compute (FLOP)": "5.5e+23", "Training power draw (W)": "1.8e+06", "Organization": "Meta AI", "Publication date": "2023-02-24"}, "size": 8}, {"x": 2023.2050228310502, "y": 340173.4453754254, "tooltipData": {"Model": "Falcon-40B", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Training power draw (W)": "3.4e+05", "Organization": "Technology Innovation Institute", "Publication date": "2023-03-15"}, "size": 8}, {"x": 2023.2187214611872, "y": 351495.4313105972, "tooltipData": {"Model": "PanGu-\u03a3", "Domain": "Language", "Training compute (FLOP)": "4.7e+23", "Training power draw (W)": "3.5e+05", "Organization": "Huawei Noah's Ark Lab", "Publication date": "2023-03-20"}, "size": 8}, {"x": 2023.2433789954339, "y": 56687.84260872892, "tooltipData": {"Model": "VideoMAE V2", "Domain": "Video", "Training compute (FLOP)": "9.7e+21", "Training power draw (W)": "5.7e+04", "Organization": "Nanjing University, Shenzhen Institute of Advanced Technology, Shanghai AI Lab", "Publication date": "2023-03-29"}, "size": 8}, {"x": 2023.2461187214612, "y": 453498.3266248031, "tooltipData": {"Model": "BloombergGPT", "Domain": "Language", "Training compute (FLOP)": "2.4e+23", "Training power draw (W)": "4.5e+05", "Organization": "Bloomberg, Johns Hopkins University", "Publication date": "2023-03-30"}, "size": 8}, {"x": 2023.2609589041097, "y": 226735.9259263604, "tooltipData": {"Model": "Segment Anything Model", "Domain": "Vision", "Training compute (FLOP)": "7.8e+21", "Training power draw (W)": "2.3e+05", "Organization": "Meta AI", "Publication date": "2023-04-05"}, "size": 8}, {"x": 2023.2938356164384, "y": 7084.6712076106505, "tooltipData": {"Model": "LLaVA", "Domain": "Multimodal, Vision, Language", "Training compute (FLOP)": "7.8e+22", "Training power draw (W)": "7.1e+03", "Organization": "University of Wisconsin Madison, Microsoft Research, Columbia University", "Publication date": "2023-04-17"}, "size": 8}, {"x": 2023.3552511415523, "y": 453322.1740125758, "tooltipData": {"Model": "StarCoder", "Domain": "Language", "Training compute (FLOP)": "8.5e+22", "Training power draw (W)": "4.5e+05", "Organization": "Hugging Face, ServiceNow, Northeastern University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), Carnegie Mellon University (CMU), Johns Hopkins University, Leipzig University, ScaDS.AI, Queen Mary University of London, Roblox, Sea AI Lab, Technion - Israel Institute of Technology, Monash University, CSIRO, Data61, McGill University, Saama, University of British Columbia (UBC), Massachusetts Institute of Technology (MIT), Technical University of Munich, IBM, University of Vermont, UnfoldML, SAP, University of Notre Dame, Columbia University, New York University (NYU), University of Allahabad, Discover Dollar, Toloka, Telefonica, Stanford University, Weizmann Institute of Science, Alan Turing Institute, Wellesley College, EleutherAI, Forschungszentrum Julich", "Publication date": "2023-05-09"}, "size": 8}, {"x": 2023.3607305936073, "y": 14166.043366089072, "tooltipData": {"Model": "InstructBLIP", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "1.9e+20", "Training power draw (W)": "1.4e+04", "Organization": "Salesforce Research, Hong Kong University of Science and Technology, Nanyang Technological University", "Publication date": "2023-05-11"}, "size": 8}, {"x": 2023.4878995433792, "y": 7079.804571805214, "tooltipData": {"Model": "HyenaDNA", "Domain": "Biology", "Training compute (FLOP)": "1.8e+21", "Training power draw (W)": "7.1e+03", "Organization": "Stanford University, Harvard University, Mila - Quebec AI (originally Montreal Institute for Learning Algorithms), University of Montreal / Universit\u00e9 de Montr\u00e9al", "Publication date": "2023-06-27"}, "size": 8}, {"x": 2023.513698630137, "y": 679602.2897189565, "tooltipData": {"Model": "xTrimoPGLM -100B", "Domain": "Biology", "Training compute (FLOP)": "6.2e+23", "Training power draw (W)": "6.8e+05", "Organization": "Tsinghua University, BioMap Research", "Publication date": "2023-07-06"}, "size": 8}, {"x": 2023.5465753424658, "y": 884796.5963891526, "tooltipData": {"Model": "Llama 2-70B", "Domain": "Language", "Training compute (FLOP)": "8.1e+23", "Training power draw (W)": "8.8e+05", "Organization": "Meta AI", "Publication date": "2023-07-18"}, "size": 8}, {"x": 2023.594292237443, "y": 1769.2870682237342, "tooltipData": {"Model": "GGNN", "Domain": "Biology", "Training compute (FLOP)": "7.6e+21", "Training power draw (W)": "1.8e+03", "Organization": "Westlake University, Tsinghua University, Toyota Technological Institute at Chicago", "Publication date": "2023-08-05"}, "size": 8}, {"x": 2023.657305936073, "y": 552.7802604940294, "tooltipData": {"Model": "PeptideBERT", "Domain": "Biology", "Training compute (FLOP)": "4.9e+16", "Training power draw (W)": "5.5e+02", "Organization": "Carnegie Mellon University (CMU)", "Publication date": "2023-08-28"}, "size": 8}, {"x": 2023.6627853881278, "y": 773.8775402191181, "tooltipData": {"Model": "Swift", "Domain": "Robotics", "Training compute (FLOP)": "5.3e+16", "Training power draw (W)": "7.7e+02", "Organization": "Intel Labs", "Publication date": "2023-08-30"}, "size": 8}, {"x": 2023.7664383561644, "y": 773.5965245485567, "tooltipData": {"Model": "FinGPT-13B", "Domain": "Language", "Training compute (FLOP)": "1.6e+23", "Training power draw (W)": "7.7e+02", "Organization": "University of California Los Angeles (UCLA), Columbia University, New York University (NYU)", "Publication date": "2023-10-07"}, "size": 8}, {"x": 2023.777397260274, "y": 7072.612714018849, "tooltipData": {"Model": "Ferret (13B)", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "NA", "Training power draw (W)": "7.1e+03", "Organization": "Columbia University, Apple", "Publication date": "2023-10-11"}, "size": 8}, {"x": 2023.844292237443, "y": 7070.929307644795, "tooltipData": {"Model": "LLaVA 1.5", "Domain": "Multimodal, Language, Vision", "Training compute (FLOP)": "7.8e+22", "Training power draw (W)": "7.1e+03", "Organization": "University of Wisconsin Madison, Microsoft Research", "Publication date": "2023-11-05"}, "size": 8}, {"x": 2023.866210045662, "y": 28281.566571321346, "tooltipData": {"Model": "SPHINX (Llama 2 13B)", "Domain": "Vision, Language, Multimodal", "Training compute (FLOP)": "3.0e+22", "Training power draw (W)": "2.8e+04", "Organization": "Shanghai AI Lab, Chinese University of Hong Kong (CUHK), ShanghaiTech University", "Publication date": "2023-11-13"}, "size": 8}, {"x": 2023.8716894977167, "y": 904992.9349364166, "tooltipData": {"Model": "Nemotron-3-8B", "Domain": "Language", "Training compute (FLOP)": "1.8e+23", "Training power draw (W)": "9.0e+05", "Organization": "NVIDIA", "Publication date": "2023-11-15"}, "size": 8}, {"x": 2024.1134703196346, "y": 54253.90483162765, "tooltipData": {"Model": "Aya", "Domain": "Language", "Training compute (FLOP)": "NA", "Training power draw (W)": "5.4e+04", "Organization": "Cohere for AI, Brown University, Cohere, Carnegie Mellon University (CMU), Massachusetts Institute of Technology (MIT)", "Publication date": "2024-02-12"}, "size": 8}, {"x": 2024.4495433789955, "y": 56450.0592214581, "tooltipData": {"Model": "OpenVLA", "Domain": "Robotics, Vision, Language", "Training compute (FLOP)": "1.1e+23", "Training power draw (W)": "5.6e+04", "Organization": "Stanford University, UC Berkeley, Toyota Research Institute, Google DeepMind, Massachusetts Institute of Technology (MIT), Physical Intelligence", "Publication date": "2024-06-13"}, "size": 8}, {"x": 2024.5767123287671, "y": 3466813.5881738467, "tooltipData": {"Model": "AFM-server", "Domain": "Language", "Training compute (FLOP)": "NA", "Training power draw (W)": "3.5e+06", "Organization": "Apple", "Publication date": "2024-07-29"}, "size": 8}, {"x": 2024.7582191780823, "y": 9473720.068741636, "tooltipData": {"Model": "Meta Movie Gen Video", "Domain": "Video", "Training compute (FLOP)": "1.6e+24", "Training power draw (W)": "9.5e+06", "Organization": "Meta AI", "Publication date": "2024-10-04"}, "size": 8}], "size": 8, "fillColor": "rgb(155.0, 191.0, 193.0)", "strokeColor": "rgb(155.0, 191.0, 193.0)", "fillAlpha": 0.45, "strokeAlpha": 1, "marker": "M 0.0,-0.5 C 0.13260155,-0.5 0.25978993539242673,-0.44731684579412084 0.3535533905932738,-0.3535533905932738 C 0.44731684579412084,-0.25978993539242673 0.5,-0.13260155 0.5,0.0 C 0.5,0.13260155 0.44731684579412084,0.25978993539242673 0.3535533905932738,0.3535533905932738 C 0.25978993539242673,0.44731684579412084 0.13260155,0.5 0.0,0.5 C -0.13260155,0.5 -0.25978993539242673,0.44731684579412084 -0.3535533905932738,0.3535533905932738 C -0.44731684579412084,0.25978993539242673 -0.5,0.13260155 -0.5,0.0 C -0.5,-0.13260155 -0.44731684579412084,-0.25978993539242673 -0.3535533905932738,-0.3535533905932738 C -0.25978993539242673,-0.44731684579412084 -0.13260155,-0.5 0.0,-0.5 Z 0.0,-0.5", "isFilled": true}, {"type": "line", "color": "#E03D90", "zOrder": 2, "clip": true, "strokeWidth": 1.5, "lineStyle": "-", "tooltipData": {"Growth rate": "2.1x/year", "90% CI": "1.9x to 2.3x per year", "R\u00b2": "0.65"}, "points": [{"x": 2011.0, "y": 358.3226006409832}, {"x": 2011.141414141414, "y": 397.290613256239}, {"x": 2011.2828282828282, "y": 441.389001118131}, {"x": 2011.4242424242425, "y": 489.390584420547}, {"x": 2011.5656565656566, "y": 543.7118673497074}, {"x": 2011.7070707070707, "y": 604.0626937006856}, {"x": 2011.8484848484848, "y": 669.7552361929145}, {"x": 2011.989898989899, "y": 744.0966004053038}, {"x": 2012.1313131313132, "y": 826.6896932109239}, {"x": 2012.2727272727273, "y": 916.5931889330265}, {"x": 2012.4141414141413, "y": 1018.3330252356103}, {"x": 2012.5555555555557, "y": 1131.3657605209214}, {"x": 2012.6969696969697, "y": 1256.9448818402004}, {"x": 2012.8383838383838, "y": 1393.6391454021734}, {"x": 2012.979797979798, "y": 1548.3300379704906}, {"x": 2013.121212121212, "y": 1720.19128078471}, {"x": 2013.2626262626263, "y": 1907.2641458799094}, {"x": 2013.4040404040404, "y": 2118.9662884776412}, {"x": 2013.5454545454545, "y": 2354.1669051992176}, {"x": 2013.6868686868686, "y": 2610.1853798812986}, {"x": 2013.828282828283, "y": 2899.910239802688}, {"x": 2013.969696969697, "y": 3215.2789567231243}, {"x": 2014.111111111111, "y": 3572.167878300304}, {"x": 2014.2525252525252, "y": 3968.6706884582973}, {"x": 2014.3939393939395, "y": 4400.268386111173}, {"x": 2014.5353535353536, "y": 4888.688538796732}, {"x": 2014.6767676767677, "y": 5431.322258615743}, {"x": 2014.8181818181818, "y": 6021.985068923139}, {"x": 2014.959595959596, "y": 6690.412221258886}, {"x": 2015.1010101010102, "y": 7418.002575236585}, {"x": 2015.2424242424242, "y": 8241.384612992333}, {"x": 2015.3838383838383, "y": 9156.160253435959}, {"x": 2015.5252525252524, "y": 10151.903663491561}, {"x": 2015.6666666666667, "y": 11278.742733816249}, {"x": 2015.8080808080808, "y": 12505.319534580034}, {"x": 2015.949494949495, "y": 13893.382611767749}, {"x": 2016.090909090909, "y": 15435.517650161531}, {"x": 2016.2323232323233, "y": 17148.826299990287}, {"x": 2016.3737373737374, "y": 19013.781729537426}, {"x": 2016.5151515151515, "y": 21124.269854485163}, {"x": 2016.6565656565656, "y": 23469.01753857903}, {"x": 2016.79797979798, "y": 26074.027079759147}, {"x": 2016.939393939394, "y": 28909.60879957536}, {"x": 2017.080808080808, "y": 32118.51205387987}, {"x": 2017.2222222222222, "y": 35683.59654075911}, {"x": 2017.3636363636363, "y": 39564.23046579445}, {"x": 2017.5050505050506, "y": 43955.773387593035}, {"x": 2017.6464646464647, "y": 48736.016466921545}, {"x": 2017.7878787878788, "y": 54145.607545318635}, {"x": 2017.9292929292928, "y": 60155.6513844805}, {"x": 2018.0707070707072, "y": 66697.6506271631}, {"x": 2018.2121212121212, "y": 74100.94375491764}, {"x": 2018.3535353535353, "y": 82325.98620395488}, {"x": 2018.4949494949494, "y": 91279.03595080404}, {"x": 2018.6363636363637, "y": 101410.80900740159}, {"x": 2018.7777777777778, "y": 112439.35612568012}, {"x": 2018.919191919192, "y": 124919.87837297175}, {"x": 2019.060606060606, "y": 138785.71125287708}, {"x": 2019.20202020202, "y": 153878.82382028125}, {"x": 2019.3434343434344, "y": 170959.0362142798}, {"x": 2019.4848484848485, "y": 189551.03646202548}, {"x": 2019.6262626262626, "y": 210590.78632431765}, {"x": 2019.7676767676767, "y": 233965.90233663822}, {"x": 2019.909090909091, "y": 259409.97484974403}, {"x": 2020.050505050505, "y": 288203.9138568544}, {"x": 2020.1919191919192, "y": 320193.9170245803}, {"x": 2020.3333333333333, "y": 355015.38956251694}, {"x": 2020.4747474747476, "y": 394421.3201924768}, {"x": 2020.6161616161617, "y": 438201.2228090765}, {"x": 2020.7575757575758, "y": 486840.59872248955}, {"x": 2020.8989898989898, "y": 540878.838824552}, {"x": 2021.040404040404, "y": 599700.0613121558}, {"x": 2021.1818181818182, "y": 666265.4545586536}, {"x": 2021.3232323232323, "y": 738722.6219042437}, {"x": 2021.4646464646464, "y": 820719.2148671933}, {"x": 2021.6060606060605, "y": 911817.250046741}, {"x": 2021.7474747474748, "y": 1010978.4696825346}, {"x": 2021.888888888889, "y": 1123194.865410578}, {"x": 2022.030303030303, "y": 1247867.0352698122}, {"x": 2022.171717171717, "y": 1383574.072128693}, {"x": 2022.3131313131314, "y": 1537147.763609787}, {"x": 2022.4545454545455, "y": 1704314.4266578546}, {"x": 2022.5959595959596, "y": 1893489.594954373}, {"x": 2022.7373737373737, "y": 2103662.7925705947}, {"x": 2022.878787878788, "y": 2332438.644533306}, {"x": 2023.020202020202, "y": 2591334.2251957064}, {"x": 2023.1616161616162, "y": 2873144.9304013913}, {"x": 2023.3030303030303, "y": 3192057.7244544523}, {"x": 2023.4444444444443, "y": 3546369.1401126725}, {"x": 2023.5858585858587, "y": 3932041.0378468805}, {"x": 2023.7272727272727, "y": 4368488.980462799}, {"x": 2023.8686868686868, "y": 4853381.688731005}, {"x": 2024.010101010101, "y": 5381192.768842331}, {"x": 2024.1515151515152, "y": 5978493.379434879}, {"x": 2024.2929292929293, "y": 6642093.049499882}, {"x": 2024.4343434343434, "y": 7364428.63559163}, {"x": 2024.5757575757575, "y": 8181864.083392999}, {"x": 2024.7171717171718, "y": 9090033.075422479}, {"x": 2024.858585858586, "y": 10099006.83634763}, {"x": 2025.0, "y": 11219974.474722432}]}, {"type": "annotation", "color": "#3E555E", "text": "GPT-4", "x": 2023.2050228310502, "y": 22146708.683295924, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2023.2050228310502, "targetY": 22146708.683295924, "relDx": -0.015, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "GPT-3", "x": 2020.407305936073, "y": 5595164.712836602, "ha": "right", "va": "center", "background": true, "hasArrow": false, "targetX": 2020.407305936073, "targetY": 5595164.712836602, "relDx": -0.015, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#3E555E", "text": "Llama 3", "x": 2024.5602739726028, "y": 25280251.591531232, "ha": "right", "va": "center", "background": true, "hasArrow": true, "targetX": 2024.5602739726028, "targetY": 25280251.591531232, "relDx": -0.02, "relDy": 0.05, "arrowType": "arc", "arrowColor": "#3E555E", "targetSize": 8}, {"type": "annotation", "color": "#E03D90", "text": "2.1x/year", "x": 2018.3, "y": [79059.65974286897], "background": true, "weight": "bold", "hasArrow": true, "targetX": 2018.3, "targetY": [79059.65974286897], "relDx": 0.08, "relDy": -0.14, "hasArrowHead": true, "arrowType": "arc", "arrowColor": "#E03D90", "targetSize": 8}], "additionalLegendItems": [{"label": "Frontier models", "color": "#00A5A6", "iconStr": "circle", "tooltipContent": "<div style=\"font-weight: 700; padding-bottom: 8px\">Frontier models</div>These are the models that were in the top 10 of training compute at the time of their release."}, {"label": "Remainder", "color": "#9BBFC1", "iconStr": "circle"}], "tooltipKeyWidth": 120, "tooltipMinWidth": 250, "topRightText": "40 frontier models", "addDataPadding": false, "title": "Power required for training frontier models", "originalDataAspectRatio": 0.7451612903225805}

Enable JavaScript to see an interactive visualization.

Related work

FAQ

What is a notable model?

A notable model meets any of the following criteria: (i) state-of-the-art improvement on a recognized benchmark; (ii) highly cited (over 1000 citations); (iii) historical relevance; (iv) significant use.

How was the Notable AI models database created?

The database was originally created for the report “Compute Trends Across Three Eras of Machine Learning” and has continually grown and expanded since then.

What is the difference between the Notable and Large-Scale AI Models databases?

The Notable AI Models database is our largest database, featuring over 800 machine learning models chosen for their significant technological advancements, wide citations, historical importance, extensive use, and/or high training costs. The Large-Scale AI Models database is a subset of the Notable AI Models database that highlights models with training compute over 10²³ floating point operations (FLOP).

Why are the number of models in the database and the results in the explorer different?

The explorer only shows models where we have estimates to visualize, e.g. for training compute, parameter count, or dataset size. While we do our best to collect as much information as possible about the models in our databases, this process is limited by the amount of publicly available information from companies, labs, researchers, and other organizations. Further details about coverage can be found in the Records section of the documentation.

How is the data licensed?

Epoch AI’s data is free to use, distribute, and reproduce provided the source and authors are credited under the Creative Commons Attribution license. Complete citations can be found here.

How do you estimate details like training compute?

Where possible, we collect details such as training compute directly from publications. Otherwise, we estimate details from information such as model architecture and training data, or training hardware and duration. The documentation describes these approaches further. Per-entry notes on the estimation process can be found within the database.

How accurate is the data?

Records are labeled based on the uncertainty of their training compute, parameter count, and dataset size. “Confident” records are accurate within a factor of 3x, “Likely” records within a factor of 10x, and “Speculative” records within a factor of 30x, larger or smaller. Further details are available in the documentation. If you spot a mistake, feel free to report it to data@epochai.org.

How up-to-date is the data?

We strive to maintain an up-to-date database, though the field of machine learning is active with frequent new releases, so there will inevitably be some models that have not yet been added. Generally, major models should be added within two weeks of their release, and others are added periodically during literature reviews. If you notice a missing model, you can notify us at data@epochai.org.

How can I access this data?

Download the data in CSV format.
Explore the data using our interactive tools.
View the data directly in a table format.

Who can I contact with questions or comments about the data?

Feedback can be directed to the data team at data@epochai.org.

Documentation

The database is focused on notable machine learning models. A notable model meets any of the following criteria: (i) state-of-the-art improvement on a recognized benchmark; (ii) highly cited (over 1000 citations); (iii) historical relevance; (iv) significant use.

Models were initially selected from various sources, including literature reviews, Papers With Code, historical accounts, previous databases, highly-cited publications of top conferences, and suggestions from individuals. This is currently a non-exhaustive list of notable models. Additional information about our approach to measuring parameter counts, dataset size, and training compute can be found in the accompanying documentation.

Read the complete documentation

Use this work

Licensing

Epoch’s data is free to use, distribute, and reproduce provided the source and authors are credited under the Creative Commons Attribution license.

Citation

          Epoch AI, ‘Data on Notable AI Models’. Published online at epochai.org. Retrieved from ‘https://epochai.org/data/notable-ai-models’ [online resource]. Accessed .
        

BibTeX Citation

@misc{EpochNotableModels2024,
  title = “Data on Notable AI Models”,
  author = {{Epoch AI}},
  year = 2024,
  url = {https://epochai.org/data/notable-ai-models},
  note = “Accessed: ”
}

Download this data

Notable AI Models

CSV, Updated November 02, 2024

Notable AI Models

Data insights

The training compute of notable AI models is doubling roughly every six months.

Training compute costs are doubling every nine months for the largest AI models.

Training compute has scaled up faster for language than vision.

The size of datasets used to train language models doubles approximately every eight months.

The length of time spent training notable models is growing.

The power required to train frontier AI models is doubling annually.