{"id": 1015508, "name": "Training dataset size", "unit": "datapoints", "createdAt": "2025-03-15T08:53:18.000Z", "updatedAt": "2025-10-09T11:02:56.000Z", "coverage": "", "timespan": "", "datasetId": 7001, "columnOrder": 0, "shortName": "training_dataset_size__datapoints", "catalogPath": "grapher/artificial_intelligence/2025-03-12/epoch_regressions/epoch_regressions#training_dataset_size__datapoints", "descriptionShort": "The number of examples provided to train an AI model. Typically, more data results in a more comprehensive understanding by the model.", "type": "float", "dataChecksum": "11908039958973126151", "metadataChecksum": "4299610194470252213", "datasetName": "Parameter, Compute and Data Trends in Machine Learning - Regressions", "updatePeriodDays": 31, "datasetVersion": "2025-03-12", "nonRedistributable": false, "display": {"unit": "datapoints", "zeroDay": "1949-01-01", "yearIsDay": true, "numDecimalPlaces": 0}, "schemaVersion": 2, "processingLevel": "major", "presentation": {"topicTagsLinks": ["Artificial Intelligence"]}, "descriptionKey": ["Training data size refers to the volume of data employed to train an artificial intelligence (AI) model effectively. It's a representation of the number of examples that the model learns from during its training process. It is a fundamental measure of the scope of the data used in the model's learning phase.", "To grasp the concept of training data size, imagine teaching a friend the art of distinguishing different types of birds. In this analogy, each bird picture presented to your friend corresponds to an individual piece of training data. If you showed them 100 unique bird photos, then the training data size in this scenario would be quantified as 100.", "Training data size is an essential indicator in AI and machine learning. First and foremost, it directly impacts the depth of learning achieved by the model. The more extensive the dataset, the more profound and comprehensive the model's understanding of the subject matter becomes. Additionally, a large training data size contributes significantly to improved recognition capabilities. By exposing the model to a diverse array of examples, it becomes adept at identifying subtle nuances, much like how it becomes skilled at distinguishing various bird species through exposure to a large variety of bird images."], "dimensions": {"years": {"values": [{"id": 547}, {"id": 2250}, {"id": 2922}, {"id": 3833}, {"id": 4106}, {"id": 4198}, {"id": 4899}, {"id": 6513}, {"id": 7425}, {"id": 9070}, {"id": 9739}, {"id": 11413}, {"id": 12661}, {"id": 12874}, {"id": 13740}, {"id": 13787}, {"id": 14035}, {"id": 14044}, {"id": 14457}, {"id": 14610}, {"id": 14940}, {"id": 14944}, {"id": 15126}, {"id": 15142}, {"id": 15248}, {"id": 15279}, {"id": 15675}, {"id": 15826}, {"id": 15979}, {"id": 16039}, {"id": 16236}, {"id": 16283}, {"id": 16403}, {"id": 16442}, {"id": 16730}, {"id": 16771}, {"id": 17044}, {"id": 17131}, {"id": 17320}, {"id": 17335}, {"id": 17350}, {"id": 17562}, {"id": 17836}, {"id": 17850}, {"id": 18201}, {"id": 18263}, {"id": 18414}, {"id": 18444}, {"id": 18959}, {"id": 19334}, {"id": 19505}, {"id": 19796}, {"id": 20266}, {"id": 20423}, {"id": 20459}, {"id": 20629}, {"id": 20672}, {"id": 20986}, {"id": 21017}, {"id": 21335}, {"id": 21356}, {"id": 21735}, {"id": 21878}, {"id": 21891}, {"id": 21892}, {"id": 22012}, {"id": 22080}, {"id": 22127}, {"id": 22133}, {"id": 22158}, {"id": 22178}, {"id": 22240}, {"id": 22280}, {"id": 22443}, {"id": 22445}, {"id": 22537}, {"id": 22548}, {"id": 22763}, {"id": 22814}, {"id": 22823}, {"id": 22905}, {"id": 22920}, {"id": 22956}, {"id": 23164}, {"id": 23203}, {"id": 23262}, {"id": 23283}, {"id": 23347}, {"id": 23391}, {"id": 23521}, {"id": 23649}, {"id": 23664}, {"id": 23714}, {"id": 23720}, {"id": 23729}, {"id": 23741}, {"id": 23874}, {"id": 23892}, {"id": 23901}, {"id": 23909}, {"id": 23912}, {"id": 23913}, {"id": 23914}, {"id": 23922}, {"id": 23936}, {"id": 23958}, {"id": 23984}, {"id": 23987}, {"id": 23993}, {"id": 24000}, {"id": 24001}, {"id": 24051}, {"id": 24054}, {"id": 24073}, {"id": 24077}, {"id": 24096}, {"id": 24106}, {"id": 24142}, {"id": 24161}, {"id": 24181}, {"id": 24243}, {"id": 24263}, {"id": 24305}, {"id": 24312}, {"id": 24324}, {"id": 24348}, {"id": 24406}, {"id": 24432}, {"id": 24441}, {"id": 24447}, {"id": 24449}, {"id": 24455}, {"id": 24497}, {"id": 24534}, {"id": 24599}, {"id": 24639}, {"id": 24643}, {"id": 24649}, {"id": 24731}, {"id": 24740}, {"id": 24751}, {"id": 24752}, {"id": 24772}, {"id": 24779}, {"id": 24780}, {"id": 24781}, {"id": 24791}, {"id": 24792}, {"id": 24818}, {"id": 24820}, {"id": 24828}, {"id": 24842}, {"id": 24843}, {"id": 24859}, {"id": 24964}, {"id": 24999}, {"id": 25027}, {"id": 25042}, {"id": 25047}, {"id": 25055}, {"id": 25062}, {"id": 25077}, {"id": 25085}, {"id": 25094}, {"id": 25105}, {"id": 25127}, {"id": 25137}, {"id": 25138}, {"id": 25140}, {"id": 25150}, {"id": 25171}, {"id": 25175}, {"id": 25233}, {"id": 25237}, {"id": 25282}, {"id": 25299}, {"id": 25323}, {"id": 25324}, {"id": 25343}, {"id": 25353}, {"id": 25392}, {"id": 25441}, {"id": 25443}, {"id": 25468}, {"id": 25472}, {"id": 25485}, {"id": 25489}, {"id": 25510}, {"id": 25517}, {"id": 25520}, {"id": 25521}, {"id": 25547}, {"id": 25575}, {"id": 25611}, {"id": 25624}, {"id": 25651}, {"id": 25676}, {"id": 25681}, {"id": 25688}, {"id": 25727}, {"id": 25731}, {"id": 25734}, {"id": 25748}, {"id": 25800}, {"id": 25826}, {"id": 25835}, {"id": 25841}, {"id": 25862}, {"id": 25871}, {"id": 25875}, {"id": 25880}, {"id": 25881}, {"id": 25889}, {"id": 25897}, {"id": 25903}, {"id": 25905}, {"id": 25913}, {"id": 25946}, {"id": 25959}, {"id": 25970}, {"id": 25971}, {"id": 25975}, {"id": 25983}, {"id": 26002}, {"id": 26008}, {"id": 26014}, {"id": 26054}, {"id": 26058}, {"id": 26059}, {"id": 26078}, {"id": 26080}, {"id": 26113}, {"id": 26147}, {"id": 26150}, {"id": 26176}, {"id": 26207}, {"id": 26225}, {"id": 26226}, {"id": 26227}, {"id": 26259}, {"id": 26266}, {"id": 26267}, {"id": 26289}, {"id": 26291}, {"id": 26302}, {"id": 26307}, {"id": 26308}, {"id": 26312}, {"id": 26341}, {"id": 26352}, {"id": 26357}, {"id": 26361}, {"id": 26380}, {"id": 26421}, {"id": 26428}, {"id": 26443}, {"id": 26456}, {"id": 26457}, {"id": 26458}, {"id": 26459}, {"id": 26469}, {"id": 26471}, {"id": 26472}, {"id": 26483}, {"id": 26485}, {"id": 26505}, {"id": 26507}, {"id": 26515}, {"id": 26520}, {"id": 26524}, {"id": 26526}, {"id": 26543}, {"id": 26546}, {"id": 26550}, {"id": 26560}, {"id": 26561}, {"id": 26574}, {"id": 26581}, {"id": 26582}, {"id": 26601}, {"id": 26602}, {"id": 26612}, {"id": 26616}, {"id": 26620}, {"id": 26623}, {"id": 26639}, {"id": 26644}, {"id": 26646}, {"id": 26651}, {"id": 26654}, {"id": 26662}, {"id": 26669}, {"id": 26682}, {"id": 26684}, {"id": 26685}, {"id": 26689}, {"id": 26700}, {"id": 26702}, {"id": 26703}, {"id": 26710}, {"id": 26722}, {"id": 26723}, {"id": 26742}, {"id": 26750}, {"id": 26756}, {"id": 26758}, {"id": 26759}, {"id": 26765}, {"id": 26766}, {"id": 26784}, {"id": 26792}, {"id": 26794}, {"id": 26805}, {"id": 26809}, {"id": 26811}, {"id": 26819}, {"id": 26827}, {"id": 26835}, {"id": 26840}, {"id": 26842}, {"id": 26848}, {"id": 26849}, {"id": 26854}, {"id": 26864}, {"id": 26865}, {"id": 26876}, {"id": 26878}, {"id": 26884}, {"id": 26919}, {"id": 26926}, {"id": 26939}, {"id": 26940}, {"id": 26946}, {"id": 26968}, {"id": 26969}, {"id": 26976}, {"id": 26980}, {"id": 26982}, {"id": 27015}, {"id": 27024}, {"id": 27032}, {"id": 27037}, {"id": 27042}, {"id": 27043}, {"id": 27054}, {"id": 27067}, {"id": 27068}, {"id": 27082}, {"id": 27091}, {"id": 27101}, {"id": 27106}, {"id": 27113}, {"id": 27116}, {"id": 27122}, {"id": 27131}, {"id": 27156}, {"id": 27157}, {"id": 27164}, {"id": 27165}, {"id": 27176}, {"id": 27205}, {"id": 27214}, {"id": 27226}, {"id": 27234}, {"id": 27263}, {"id": 27267}, {"id": 27268}, {"id": 27276}, {"id": 27292}, {"id": 27298}, {"id": 27301}, {"id": 27309}, {"id": 27326}, {"id": 27327}, {"id": 27330}, {"id": 27333}, {"id": 27334}, {"id": 27335}, {"id": 27336}, {"id": 27337}, {"id": 27338}, {"id": 27346}, {"id": 27360}, {"id": 27361}, {"id": 27362}, {"id": 27368}, {"id": 27373}, {"id": 27375}, {"id": 27382}, {"id": 27427}, {"id": 27456}, {"id": 27466}, {"id": 27479}, {"id": 27481}, {"id": 27501}, {"id": 27516}, {"id": 27521}, {"id": 27526}, {"id": 27533}, {"id": 27534}, {"id": 27551}, {"id": 27557}, {"id": 27558}, {"id": 27561}, {"id": 27569}, {"id": 27597}, {"id": 27603}, {"id": 27611}, {"id": 27653}, {"id": 27655}, {"id": 27656}, {"id": 27660}, {"id": 27670}, {"id": 27688}, {"id": 27694}, {"id": 27703}, {"id": 27733}, {"id": 27736}, {"id": 27751}, {"id": 27758}, {"id": 27780}, {"id": 27792}, {"id": 27828}, {"id": 27833}, {"id": 27853}, {"id": 27858}, {"id": 27877}, {"id": 27889}, {"id": 27945}, {"id": 27950}, {"id": 27954}, {"id": 27961}, {"id": 27975}, {"id": 28002}, {"id": 28006}, {"id": 28023}]}, "entities": {"values": [{"id": 370903, "name": "1.2x/year between 1950\u20132010", "code": null}, {"id": 256993, "name": "Theseus", "code": null}, {"id": 305970, "name": "Self Organizing System", "code": null}, {"id": 257002, "name": "Perceptron Mark I", "code": null}, {"id": 256994, "name": "Samuel Neural Checkers", "code": null}, {"id": 369024, "name": "Perceptron (1960)", "code": null}, {"id": 256995, "name": "ADALINE", "code": null}, {"id": 369543, "name": "Linear Decision Functions", "code": null}, {"id": 369517, "name": "LTE speaker verification system", "code": null}, {"id": 369520, "name": "Decision tree adaline", "code": null}, {"id": 369537, "name": "Piecewise linear model", "code": null}, {"id": 305980, "name": "Cognitron", "code": null}, {"id": 256996, "name": "Neocognitron", "code": null}, {"id": 305984, "name": "ASE+ACE", "code": null}, {"id": 369510, "name": "Hierarchical Cognitron", "code": null}, {"id": 369560, "name": "Distributed representation NN", "code": null}, {"id": 257004, "name": "Back-propagation", "code": null}, {"id": 368075, "name": "NetTalk (dictionary)", "code": null}, {"id": 368083, "name": "NetTalk (transcription)", "code": null}, {"id": 371864, "name": "Translation-invariant MLP", "code": null}, {"id": 369973, "name": "MLN-ASR", "code": null}, {"id": 369527, "name": "MLP baggage detector", "code": null}, {"id": 369521, "name": "Handwritten Digit Recognition System", "code": null}, {"id": 369529, "name": "Speaker-independent vowel classification", "code": null}, {"id": 257006, "name": "Zip CNN", "code": null}, {"id": 371841, "name": "NETtalk reimplementation", "code": null}, {"id": 369990, "name": "Bankruptcy-NN", "code": null}, {"id": 369986, "name": "ISR network", "code": null}, {"id": 369968, "name": "SexNet classification", "code": null}, {"id": 369967, "name": "SexNet compression", "code": null}, {"id": 369526, "name": "RAAM", "code": null}, {"id": 369977, "name": "Weight Decay", "code": null}, {"id": 257007, "name": "TD-Gammon", "code": null}, {"id": 369984, "name": "Golem", "code": null}, {"id": 369995, "name": "Boosting", "code": null}, {"id": 305992, "name": "IBM-5", "code": null}, {"id": 369969, "name": "Siamese-TDNN", "code": null}, {"id": 369966, "name": "ANN Eye Tracker", "code": null}, {"id": 369972, "name": "Ceramic-MLP", "code": null}, {"id": 305993, "name": "GroupLens", "code": null}, {"id": 369964, "name": "JPMAX", "code": null}, {"id": 369991, "name": "Mixture of linear models", "code": null}, {"id": 369992, "name": "NeuroChess", "code": null}, {"id": 369996, "name": "Predictive Coding NN", "code": null}, {"id": 305994, "name": "Support Vector Machines", "code": null}, {"id": 369970, "name": "LISSOM", "code": null}, {"id": 369993, "name": "MUSIC perceptron", "code": null}, {"id": 256997, "name": "System 11", "code": null}, {"id": 370523, "name": "AdaBoost.M2 Digit Recognition", "code": null}, {"id": 369979, "name": "SOM-CNN", "code": null}, {"id": 367568, "name": "Bidirectional RNN", "code": null}, {"id": 256998, "name": "LSTM", "code": null}, {"id": 245542, "name": "LeNet-5", "code": null}, {"id": 354866, "name": "LSTM with forget gates", "code": null}, {"id": 371856, "name": "RECONTRA-categorized", "code": null}, {"id": 371849, "name": "RECONTRA-uncategorized", "code": null}, {"id": 305998, "name": "IBM Model 4", "code": null}, {"id": 369997, "name": "Neural LM", "code": null}, {"id": 369980, "name": "PoE MNIST", "code": null}, {"id": 257009, "name": "Decision tree (classification)", "code": null}, {"id": 306001, "name": "Thumbs Up?", "code": null}, {"id": 371812, "name": "NPLM (AP News)", "code": null}, {"id": 371816, "name": "NPLM (Brown)", "code": null}, {"id": 369971, "name": "Invariant CNN", "code": null}, {"id": 369988, "name": "LMICA", "code": null}, {"id": 369987, "name": "Hierarchical LM", "code": null}, {"id": 367524, "name": "Histograms of Oriented Gradients", "code": null}, {"id": 370522, "name": "RankNet", "code": null}, {"id": 371873, "name": "SVM-CNN", "code": null}, {"id": 306014, "name": "Deep Belief Nets", "code": null}, {"id": 370719, "name": "Dimensionality Reduction", "code": null}, {"id": 306017, "name": "\u03bb-WASP", "code": null}, {"id": 369975, "name": "KN-LM", "code": null}, {"id": 369976, "name": "SB-LM", "code": null}, {"id": 367308, "name": "Deep Multitask NLP Network", "code": null}, {"id": 306022, "name": "BigChaos 2008", "code": null}, {"id": 369564, "name": "HLBL", "code": null}, {"id": 371852, "name": "GNN", "code": null}, {"id": 368330, "name": "RBM Image Classifier", "code": null}, {"id": 257011, "name": "GPU DBNs", "code": null}, {"id": 306025, "name": "BellKor 2008", "code": null}, {"id": 367305, "name": "BellKor 2009", "code": null}, {"id": 368038, "name": "BigChaos OptiBlend", "code": null}, {"id": 368137, "name": "Pragmatic Theory solution (Netflix 2009)", "code": null}, {"id": 367462, "name": "MatrixFac for Recommenders", "code": null}, {"id": 371877, "name": "Two Stage Feature Extraction (MNIST)", "code": null}, {"id": 306016, "name": "BellKor 2007", "code": null}, {"id": 371881, "name": "LCNP LabelMe", "code": null}, {"id": 371878, "name": "LCNP MNIST", "code": null}, {"id": 371874, "name": "LCNP NORB", "code": null}, {"id": 371656, "name": "2.7x/year between 2010\u20132025", "code": null}, {"id": 369004, "name": "Super-vector coding", "code": null}, {"id": 371880, "name": "iCCCP", "code": null}, {"id": 306030, "name": "ReLU (NORB)", "code": null}, {"id": 371871, "name": "Pooling CNN (Caltech 101)", "code": null}, {"id": 371872, "name": "Pooling CNN (NORB)", "code": null}, {"id": 369528, "name": "RNN LM", "code": null}, {"id": 367473, "name": "YouTube Video Recommendation System", "code": null}, {"id": 370720, "name": "Deep Autoencoders", "code": null}, {"id": 369512, "name": "Vector Space Model", "code": null}, {"id": 369553, "name": "Recursive Neural Network", "code": null}, {"id": 371842, "name": "CNN Committee (MNIST)", "code": null}, {"id": 371862, "name": "CNN Committee (NIST)", "code": null}, {"id": 371863, "name": "CNN committee (traffic sign)", "code": null}, {"id": 306033, "name": "NLP from scratch", "code": null}, {"id": 306036, "name": "Dropout (CIFAR)", "code": null}, {"id": 306037, "name": "Dropout (ImageNet)", "code": null}, {"id": 257017, "name": "Dropout (MNIST)", "code": null}, {"id": 306035, "name": "Dropout (TIMIT)", "code": null}, {"id": 366988, "name": "Unsupervised High-level Feature Learner", "code": null}, {"id": 369534, "name": "LSTM LM", "code": null}, {"id": 240132, "name": "AlexNet", "code": null}, {"id": 371854, "name": "DNN EM segmentation", "code": null}, {"id": 369538, "name": "DistBelief Speech", "code": null}, {"id": 369541, "name": "DistBelief Vision", "code": null}, {"id": 369539, "name": "DistBelief NNLM", "code": null}, {"id": 369989, "name": "Multilingual DNN", "code": null}, {"id": 369513, "name": "RCTM", "code": null}, {"id": 369535, "name": "RNTN", "code": null}, {"id": 257018, "name": "Word2Vec (large)", "code": null}, {"id": 306040, "name": "Word2Vec (small)", "code": null}, {"id": 369963, "name": "DeViSE", "code": null}, {"id": 257106, "name": "TransE", "code": null}, {"id": 369344, "name": "RNN for 1B words", "code": null}, {"id": 306044, "name": "Image generation", "code": null}, {"id": 306049, "name": "GloVe (32B)", "code": null}, {"id": 306048, "name": "GloVe (6B)", "code": null}, {"id": 369565, "name": "Paragraph Vector", "code": null}, {"id": 369562, "name": "AdaRNN", "code": null}, {"id": 369556, "name": "Dropout: SVHN", "code": null}, {"id": 257021, "name": "GANs", "code": null}, {"id": 257097, "name": "SPPNet", "code": null}, {"id": 369994, "name": "Fragment embedding", "code": null}, {"id": 368752, "name": "RNN-WER", "code": null}, {"id": 306047, "name": "Multiresolution CNN", "code": null}, {"id": 306046, "name": "SmooCT", "code": null}, {"id": 371825, "name": "ACF-WIDER", "code": null}, {"id": 371884, "name": "NPD", "code": null}, {"id": 257022, "name": "RNNsearch-50*", "code": null}, {"id": 257023, "name": "VGG16", "code": null}, {"id": 306053, "name": "VGG19", "code": null}, {"id": 307046, "name": "Seq2Seq LSTM", "code": null}, {"id": 257026, "name": "GoogLeNet / InceptionV1", "code": null}, {"id": 367494, "name": "Deeply-supervised nets", "code": null}, {"id": 306054, "name": "LRCN", "code": null}, {"id": 369557, "name": "SC-NLM", "code": null}, {"id": 371867, "name": "TA-CNN", "code": null}, {"id": 370721, "name": "SNM-skip", "code": null}, {"id": 257024, "name": "ADAM (CIFAR-10)", "code": null}, {"id": 371883, "name": "VGG-Face", "code": null}, {"id": 257025, "name": "MSRA (C, PReLU)", "code": null}, {"id": 306056, "name": "DQN-2015", "code": null}, {"id": 368084, "name": "genCNN + dyn eval", "code": null}, {"id": 371876, "name": "U-Net", "code": null}, {"id": 371875, "name": "CFSS", "code": null}, {"id": 371894, "name": "CompACT-Deep", "code": null}, {"id": 371870, "name": "Deep CNN + COTS", "code": null}, {"id": 371859, "name": "DCNN", "code": null}, {"id": 306063, "name": "BPE", "code": null}, {"id": 371882, "name": "SAF R-CNN", "code": null}, {"id": 371824, "name": "3DDFA", "code": null}, {"id": 306064, "name": "Inception v3", "code": null}, {"id": 54159, "name": "SSD", "code": null}, {"id": 371899, "name": "ResNet-101 (ImageNet)", "code": null}, {"id": 257028, "name": "ResNet-152 (ImageNet)", "code": null}, {"id": 368031, "name": "Variational (untied weights, MC) LSTM (Large)", "code": null}, {"id": 257029, "name": "AlphaGo Lee", "code": null}, {"id": 257109, "name": "Named Entity Recognition model", "code": null}, {"id": 371902, "name": "LRR-4X", "code": null}, {"id": 371891, "name": "CMS-RCNN", "code": null}, {"id": 257105, "name": "R-FCN", "code": null}, {"id": 371844, "name": "CCL", "code": null}, {"id": 366658, "name": "ResNet-200", "code": null}, {"id": 257030, "name": "GNMT", "code": null}, {"id": 368076, "name": "Pointer Sentinel-LSTM (medium)", "code": null}, {"id": 240142, "name": "Xception", "code": null}, {"id": 371896, "name": "GAWWN", "code": null}, {"id": 369001, "name": "SPIDER2", "code": null}, {"id": 368057, "name": "VD-LSTM+REAL Large", "code": null}, {"id": 369179, "name": "BIDAF", "code": null}, {"id": 370527, "name": "NAS with base 8 and shared embeddings", "code": null}, {"id": 257031, "name": "NASv3 (CIFAR-10)", "code": null}, {"id": 371895, "name": "DLDL (PASCAL)", "code": null}, {"id": 371858, "name": "DAC-CSR", "code": null}, {"id": 306078, "name": "PolyNet", "code": null}, {"id": 371890, "name": "HR-ResNet101", "code": null}, {"id": 371809, "name": "3DMM-CNN", "code": null}, {"id": 371866, "name": "EnhanceNet", "code": null}, {"id": 257034, "name": "DeepStack", "code": null}, {"id": 368323, "name": "OR-WideResNet", "code": null}, {"id": 370178, "name": "MoE-Multi", "code": null}, {"id": 369153, "name": "Mnemonic Reader", "code": null}, {"id": 371241, "name": "Transformer (2017)", "code": null}, {"id": 257037, "name": "JFT", "code": null}, {"id": 369558, "name": "ConvS2S (ensemble of 8 models)", "code": null}, {"id": 369182, "name": "GSM", "code": null}, {"id": 368011, "name": "AWD-LSTM - 3-layer LSTM (tied) + continuous cache pointer (WT2)", "code": null}, {"id": 308274, "name": "RetinaNet-R101", "code": null}, {"id": 368056, "name": "EI-REHN-1000D", "code": null}, {"id": 368086, "name": "GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)", "code": null}, {"id": 368342, "name": "PyramidNet", "code": null}, {"id": 368023, "name": "ISS", "code": null}, {"id": 368044, "name": "AWD-LSTM+WT+Cache+IOG (WT2)", "code": null}, {"id": 257039, "name": "AlphaGo Zero", "code": null}, {"id": 369175, "name": "PhraseCond", "code": null}, {"id": 368720, "name": "S-Norm", "code": null}, {"id": 369187, "name": "DCN+", "code": null}, {"id": 368045, "name": "Fraternal dropout + AWD-LSTM 3-layer (WT2)", "code": null}, {"id": 368041, "name": "AWD-LSTM-MoS + dynamic evaluation (WT2, 2017)", "code": null}, {"id": 371892, "name": "DL scaling Image", "code": null}, {"id": 371900, "name": "DL scaling LM", "code": null}, {"id": 371893, "name": "DL scaling speech", "code": null}, {"id": 240145, "name": "AlphaZero", "code": null}, {"id": 368074, "name": "QRNN", "code": null}, {"id": 257040, "name": "IMPALA", "code": null}, {"id": 368102, "name": "4 layer QRNN (h=2500)", "code": null}, {"id": 257041, "name": "YOLOv3", "code": null}, {"id": 306104, "name": "ResNeXt-101 32x48d", "code": null}, {"id": 368004, "name": "Dropout-LSTM+Noise(Bernoulli) (WT2)", "code": null}, {"id": 368064, "name": "aLSTM(depth-2)+RecurrentPolicy (WT2)", "code": null}, {"id": 370176, "name": "GPT-1", "code": null}, {"id": 368035, "name": "Big-Little Net", "code": null}, {"id": 369018, "name": "Big-Little Net (speech)", "code": null}, {"id": 369348, "name": "Big Transformer for Back-Translation", "code": null}, {"id": 368101, "name": "(ensemble): AWD-LSTM-DOC (fin) \u00d7 5 (WT2)", "code": null}, {"id": 368009, "name": "LSTM+NeuralCache", "code": null}, {"id": 369555, "name": "Transformer (Adaptive Input Embeddings) WT103", "code": null}, {"id": 257045, "name": "BERT-Large", "code": null}, {"id": 368087, "name": "TrellisNet", "code": null}, {"id": 369326, "name": "Mesh-TensorFlow Transformer 2.9B (translation)", "code": null}, {"id": 370525, "name": "Mesh-TensorFlow Transformer 4.9B (language)", "code": null}, {"id": 370722, "name": "Fine-tuned-AWD-LSTM-DOC (fin)", "code": null}, {"id": 368028, "name": "Multi-cell LSTM", "code": null}, {"id": 306110, "name": "GPipe (Transformer)", "code": null}, {"id": 371889, "name": "StyleGAN", "code": null}, {"id": 369511, "name": "Transformer-XL (257M)", "code": null}, {"id": 369043, "name": "GPT-2 (1.5B)", "code": null}, {"id": 366991, "name": "KataGo", "code": null}, {"id": 369176, "name": "SciBERT", "code": null}, {"id": 368077, "name": "BERT-Large-CAS (PTB+WT2+WT103)", "code": null}, {"id": 371848, "name": "MuseNet", "code": null}, {"id": 371964, "name": "RaptorX-Contact", "code": null}, {"id": 368026, "name": "AWD-LSTM + MoS + Partial Shuffled", "code": null}, {"id": 306122, "name": "FixRes ResNeXt-101 WSL", "code": null}, {"id": 368358, "name": "LaNet-L (CIFAR-10)", "code": null}, {"id": 365388, "name": "RoBERTa Large", "code": null}, {"id": 371970, "name": "trRosetta", "code": null}, {"id": 257055, "name": "Megatron-BERT", "code": null}, {"id": 371869, "name": "Megatron-LM (1.2B)", "code": null}, {"id": 368027, "name": "Megatron-LM (8.3B)", "code": null}, {"id": 306125, "name": "ALBERT", "code": null}, {"id": 257056, "name": "AlphaX-1", "code": null}, {"id": 257059, "name": "T5-11B", "code": null}, {"id": 257058, "name": "T5-3B", "code": null}, {"id": 368067, "name": "Base LM + kNN LM + Continuous Cache", "code": null}, {"id": 369168, "name": "XLM-RoBERTa", "code": null}, {"id": 369328, "name": "CamemBERT", "code": null}, {"id": 368005, "name": "Sandwich Transformer", "code": null}, {"id": 306128, "name": "Noisy Student (L2)", "code": null}, {"id": 306130, "name": "MuZero", "code": null}, {"id": 368010, "name": "Transformer-XL DeFINE (141M)", "code": null}, {"id": 371898, "name": "StyleGAN2", "code": null}, {"id": 371947, "name": "MMLSTM (PTB)", "code": null}, {"id": 371951, "name": "MMLSTM (WT-2)", "code": null}, {"id": 257061, "name": "OpenAI Five", "code": null}, {"id": 257062, "name": "OpenAI Five Rerun", "code": null}, {"id": 257063, "name": "AlphaFold", "code": null}, {"id": 257064, "name": "Meena", "code": null}, {"id": 368046, "name": "TaLK Convolution", "code": null}, {"id": 257107, "name": "ALBERT-xxlarge", "code": null}, {"id": 368060, "name": "Turing-NLG", "code": null}, {"id": 368047, "name": "Feedback Transformer", "code": null}, {"id": 368106, "name": "TransformerXL + spectrum control", "code": null}, {"id": 368107, "name": "Tensor-Transformer(1core)+PN (WT103)", "code": null}, {"id": 306136, "name": "ELECTRA", "code": null}, {"id": 365990, "name": "UnifiedQA", "code": null}, {"id": 371942, "name": "NAS+ESS (23M)", "code": null}, {"id": 368375, "name": "ContextNet", "code": null}, {"id": 368341, "name": "DETR", "code": null}, {"id": 354864, "name": "GPT-3 175B (davinci)", "code": null}, {"id": 257072, "name": "GShard (dense)", "code": null}, {"id": 370248, "name": "DeLighT", "code": null}, {"id": 306133, "name": "ERNIE-GEN (large)", "code": null}, {"id": 368338, "name": "ProBERTa", "code": null}, {"id": 369195, "name": "LUKE", "code": null}, {"id": 368329, "name": "mT5-XXL", "code": null}, {"id": 369146, "name": "German ELECTRA Large", "code": null}, {"id": 306146, "name": "ViT-Huge/14", "code": null}, {"id": 257073, "name": "wave2vec 2.0 LARGE", "code": null}, {"id": 268376, "name": "KEPLER", "code": null}, {"id": 368138, "name": "AlphaFold 2", "code": null}, {"id": 257074, "name": "CPM-Large", "code": null}, {"id": 369162, "name": "DensePhrases", "code": null}, {"id": 368078, "name": "CT-MoS (WT2)", "code": null}, {"id": 306154, "name": "CLIP (ResNet-50)", "code": null}, {"id": 257076, "name": "CLIP (ViT L/14@336px)", "code": null}, {"id": 257077, "name": "DALL-E", "code": null}, {"id": 306151, "name": "BigSSL", "code": null}, {"id": 257078, "name": "Switch", "code": null}, {"id": 366051, "name": "DeiT-B", "code": null}, {"id": 368753, "name": "MSA Transformer", "code": null}, {"id": 368098, "name": "SRU++ Large", "code": null}, {"id": 257104, "name": "Meta Pseudo Labels", "code": null}, {"id": 307047, "name": "Generative BST", "code": null}, {"id": 306163, "name": "M6-T", "code": null}, {"id": 371949, "name": "Unicorn", "code": null}, {"id": 369015, "name": "ProtBERT-BFD", "code": null}, {"id": 371533, "name": "ProtT5-XL-U50", "code": null}, {"id": 368328, "name": "ADM", "code": null}, {"id": 257084, "name": "CogView", "code": null}, {"id": 257098, "name": "Transformer local-attention (NesT-B)", "code": null}, {"id": 257085, "name": "ViT-G/14", "code": null}, {"id": 368362, "name": "CoAtNet", "code": null}, {"id": 368718, "name": "EMDR", "code": null}, {"id": 306150, "name": "DeBERTa", "code": null}, {"id": 257103, "name": "ALIGN", "code": null}, {"id": 306166, "name": "Denoising Diffusion Probabilistic Models (LSUN Bedroom)", "code": null}, {"id": 371868, "name": "StyleGAN3-R", "code": null}, {"id": 371879, "name": "StyleGAN3-T", "code": null}, {"id": 370177, "name": "EfficientNetV2-XL", "code": null}, {"id": 369017, "name": "Fold2Seq", "code": null}, {"id": 257087, "name": "ERNIE 3.0", "code": null}, {"id": 306167, "name": "Codex", "code": null}, {"id": 273165, "name": "GOAT", "code": null}, {"id": 257088, "name": "HuBERT", "code": null}, {"id": 257089, "name": "SEER", "code": null}, {"id": 369177, "name": "YOLOX-X", "code": null}, {"id": 257090, "name": "Jurassic-1-Jumbo", "code": null}, {"id": 367497, "name": "Zidong Taichu", "code": null}, {"id": 368748, "name": "DNABERT", "code": null}, {"id": 306169, "name": "XLMR-XXL", "code": null}, {"id": 368372, "name": "FLAN 137B", "code": null}, {"id": 368104, "name": "PermuteFormer", "code": null}, {"id": 369563, "name": "HyperCLOVA 204B", "code": null}, {"id": 366990, "name": "PLATO-XL", "code": null}, {"id": 369140, "name": "TrOCR", "code": null}, {"id": 368719, "name": "AlphaFold-Multimer", "code": null}, {"id": 257092, "name": "Megatron-Turing NLG 530B", "code": null}, {"id": 257093, "name": "Yuan 1.0", "code": null}, {"id": 368092, "name": "S4", "code": null}, {"id": 368339, "name": "Projected GAN", "code": null}, {"id": 370718, "name": "Masked Autoencoders ViT-H", "code": null}, {"id": 368334, "name": "ViT-G/14 (LiT)", "code": null}, {"id": 368364, "name": "BASIC-L", "code": null}, {"id": 35176, "name": "Florence", "code": null}, {"id": 367255, "name": "Gopher (280B)", "code": null}, {"id": 368065, "name": "GLaM", "code": null}, {"id": 369203, "name": "LongT5", "code": null}, {"id": 368722, "name": "LDM-1.45B", "code": null}, {"id": 369192, "name": "XGLM-7.5B", "code": null}, {"id": 368055, "name": "ERNIE 3.0 Titan", "code": null}, {"id": 368093, "name": "ERNIE-ViLG", "code": null}, {"id": 369202, "name": "Detic", "code": null}, {"id": 306179, "name": "data2vec (language)", "code": null}, {"id": 306178, "name": "data2vec (speech)", "code": null}, {"id": 306177, "name": "data2vec (vision)", "code": null}, {"id": 370245, "name": "AbLang (heavy sequences)", "code": null}, {"id": 369013, "name": "OntoProtein", "code": null}, {"id": 371237, "name": "InstructGPT 175B", "code": null}, {"id": 306180, "name": "RETRO-7B", "code": null}, {"id": 257096, "name": "GPT-NeoX-20B", "code": null}, {"id": 257095, "name": "LaMDA", "code": null}, {"id": 368350, "name": "ProteinBERT", "code": null}, {"id": 368348, "name": "ST-MoE", "code": null}, {"id": 306184, "name": "DeepNet", "code": null}, {"id": 306183, "name": "Statement Curriculum Learning", "code": null}, {"id": 368108, "name": "Segatron-XL large, M=384 + HCP", "code": null}, {"id": 273166, "name": "Chinchilla", "code": null}, {"id": 273167, "name": "PaLM (540B)", "code": null}, {"id": 306186, "name": "DALL\u00b7E 2", "code": null}, {"id": 368730, "name": "BERT-RBP", "code": null}, {"id": 343968, "name": "Stable Diffusion (LDM-KL-8-G)", "code": null}, {"id": 268378, "name": "Sparse all-MLP", "code": null}, {"id": 306188, "name": "OPT-175B", "code": null}, {"id": 306190, "name": "UL2", "code": null}, {"id": 306191, "name": "Gato", "code": null}, {"id": 306192, "name": "Imagen", "code": null}, {"id": 371857, "name": "GPT-2 Medium (FlashAttention)", "code": null}, {"id": 368110, "name": "Tranception", "code": null}, {"id": 366986, "name": "CogVideo", "code": null}, {"id": 368096, "name": "DITTO", "code": null}, {"id": 368373, "name": "CoCa", "code": null}, {"id": 306194, "name": "Parti", "code": null}, {"id": 369037, "name": "ProGen2-xlarge", "code": null}, {"id": 306195, "name": "Minerva (540B)", "code": null}, {"id": 368133, "name": "CodeT5-large", "code": null}, {"id": 306196, "name": "NLLB", "code": null}, {"id": 368746, "name": "BLOOM-176B", "code": null}, {"id": 369022, "name": "ESM2-15B", "code": null}, {"id": 369033, "name": "OmegaPLM", "code": null}, {"id": 354863, "name": "AlexaTM 20B", "code": null}, {"id": 365992, "name": "GLM-130B", "code": null}, {"id": 368716, "name": "BlenderBot 3", "code": null}, {"id": 368374, "name": "PaLI", "code": null}, {"id": 349174, "name": "Whisper", "code": null}, {"id": 369185, "name": "DiffDock", "code": null}, {"id": 371829, "name": "AlphaTensor", "code": null}, {"id": 369025, "name": "GenSLM", "code": null}, {"id": 369151, "name": "eDiff-I", "code": null}, {"id": 369155, "name": "mT0-13B", "code": null}, {"id": 369198, "name": "InternImage", "code": null}, {"id": 369545, "name": "EVA-01", "code": null}, {"id": 368109, "name": "Galactica", "code": null}, {"id": 368726, "name": "CaLM", "code": null}, {"id": 368002, "name": "Hybrid H3-2.7B", "code": null}, {"id": 369034, "name": "VALL-E", "code": null}, {"id": 371511, "name": "DreamerV3", "code": null}, {"id": 369009, "name": "Nucleotide Transformer", "code": null}, {"id": 368732, "name": "Ankh_large", "code": null}, {"id": 368354, "name": "DDPM-IP (CelebA)", "code": null}, {"id": 369036, "name": "ProteinDT", "code": null}, {"id": 369193, "name": "ViT-22B", "code": null}, {"id": 367499, "name": "LLaMA-65B", "code": null}, {"id": 369016, "name": "AudioGen", "code": null}, {"id": 367636, "name": "Falcon-40B", "code": null}, {"id": 363052, "name": "GPT-4", "code": null}, {"id": 368999, "name": "LEP-AD", "code": null}, {"id": 368069, "name": "PanGu-\u03a3", "code": null}, {"id": 371534, "name": "SigLIP 400M", "code": null}, {"id": 367081, "name": "BloombergGPT", "code": null}, {"id": 369040, "name": "Segment Anything Model", "code": null}, {"id": 369173, "name": "DINOv2", "code": null}, {"id": 368130, "name": "StarCoder", "code": null}, {"id": 365387, "name": "PaLM 2", "code": null}, {"id": 369165, "name": "CoEdiT-xxl", "code": null}, {"id": 366449, "name": "ONE-PEACE", "code": null}, {"id": 368325, "name": "PaLI-X", "code": null}, {"id": 369030, "name": "HyenaDNA", "code": null}, {"id": 367509, "name": "InternLM", "code": null}, {"id": 368743, "name": "Llama 2-70B", "code": null}, {"id": 368747, "name": "Llama 2-7B", "code": null}, {"id": 369002, "name": "AudioLM", "code": null}, {"id": 369167, "name": "Qwen-VL", "code": null}, {"id": 369042, "name": "PeptideBERT", "code": null}, {"id": 367533, "name": "Jais", "code": null}, {"id": 369508, "name": "Falcon-180B", "code": null}, {"id": 368737, "name": "AlphaMissense", "code": null}, {"id": 370131, "name": "Amazon Titan", "code": null}, {"id": 368728, "name": "CTM (CIFAR-10)", "code": null}, {"id": 371966, "name": "RoseTTAFold All-Atom (RFAA)", "code": null}, {"id": 368129, "name": "CODEFUSION (Python)", "code": null}, {"id": 369998, "name": "ChatGLM3-6B", "code": null}, {"id": 368349, "name": "Skywork-13B", "code": null}, {"id": 368740, "name": "Yi-34B", "code": null}, {"id": 368337, "name": "BLUUMI", "code": null}, {"id": 368376, "name": "Grok-1", "code": null}, {"id": 369331, "name": "LLaVA 1.5", "code": null}, {"id": 370174, "name": "CogVLM-17B", "code": null}, {"id": 369138, "name": "mPLUG-Owl2", "code": null}, {"id": 369019, "name": "Nemotron-3-8B", "code": null}, {"id": 370977, "name": "GNoME for crystal discovery", "code": null}, {"id": 368736, "name": "Qwen-72B", "code": null}, {"id": 369011, "name": "Mamba-24M (SC09)", "code": null}, {"id": 369174, "name": "Llama Guard", "code": null}, {"id": 371814, "name": "VILA-13B", "code": null}, {"id": 369518, "name": "CogAgent", "code": null}, {"id": 369532, "name": "FunSearch", "code": null}, {"id": 371483, "name": "nekomata-14b", "code": null}, {"id": 371236, "name": "Qwen1.5-72B", "code": null}, {"id": 371244, "name": "Aramco Metabrain AI", "code": null}, {"id": 369524, "name": "MM1-30B", "code": null}, {"id": 369883, "name": "DBRX", "code": null}, {"id": 369542, "name": "ReALM", "code": null}, {"id": 369516, "name": "Llama 3-70B", "code": null}, {"id": 371528, "name": "VILA1.5-13B", "code": null}, {"id": 371962, "name": "AlphaFold 3", "code": null}, {"id": 370141, "name": "Yi-Large", "code": null}, {"id": 370087, "name": "GLM-4 (0520)", "code": null}, {"id": 371471, "name": "ALLaM 7B", "code": null}, {"id": 371545, "name": "ALLaM\u00a0adapted 70B", "code": null}, {"id": 370106, "name": "Qwen2-72B", "code": null}, {"id": 370246, "name": "OpenVLA", "code": null}, {"id": 369982, "name": "Nemotron-4 340B", "code": null}, {"id": 370239, "name": "DeepSeek-Coder-V2 236B", "code": null}, {"id": 369981, "name": "ESM3 (98B)", "code": null}, {"id": 370155, "name": "Llama 3.1-405B", "code": null}, {"id": 370093, "name": "AFM-on-device", "code": null}, {"id": 370105, "name": "AFM-server", "code": null}, {"id": 371536, "name": "LLaVA-OV-72B", "code": null}, {"id": 371474, "name": "Qwen2.5-32B", "code": null}, {"id": 370534, "name": "Qwen2.5-72B", "code": null}, {"id": 371840, "name": "Telechat2-115B", "code": null}, {"id": 371540, "name": "Llama 3.2 11B", "code": null}, {"id": 371329, "name": "Movie Gen Video", "code": null}, {"id": 371240, "name": "NVLM-D 72B", "code": null}, {"id": 371234, "name": "NVLM-H 72B", "code": null}, {"id": 371232, "name": "NVLM-X 72B", "code": null}, {"id": 371248, "name": "Doubao-pro", "code": null}, {"id": 371242, "name": "Hunyuan-Large", "code": null}, {"id": 371535, "name": "Llama 3.3 70B", "code": null}, {"id": 371364, "name": "EXAONE 3.5 32B", "code": null}, {"id": 371328, "name": "DeepSeek-V3", "code": null}, {"id": 371974, "name": "STORM-B/8", "code": null}, {"id": 371331, "name": "Doubao-1.5-pro", "code": null}, {"id": 371539, "name": "Eurus-2-7B-PRIME", "code": null}, {"id": 371513, "name": "Hunyuan-TurboS", "code": null}, {"id": 371472, "name": "EXAONE Deep 32B", "code": null}, {"id": 371514, "name": "Llama 4 Behemoth (preview)", "code": null}, {"id": 371860, "name": "Llama 4 Maverick", "code": null}, {"id": 371843, "name": "Llama 4 Scout", "code": null}, {"id": 371529, "name": "Pangu Ultra", "code": null}, {"id": 371991, "name": "Qwen3-235B-A22B", "code": null}, {"id": 371939, "name": "Seed1.5-VL", "code": null}, {"id": 372168, "name": "dots.llm1", "code": null}, {"id": 371815, "name": "Kimi K2", "code": null}, {"id": 371817, "name": "EXAONE 4.0 (32B)", "code": null}, {"id": 371987, "name": "Qwen3-Coder-480B-A35B", "code": null}, {"id": 371946, "name": "GLM 4.5", "code": null}, {"id": 371847, "name": "gpt-oss-120b", "code": null}, {"id": 371855, "name": "gpt-oss-20b", "code": null}, {"id": 372167, "name": "LongCat-Flash", "code": null}, {"id": 371992, "name": "Qwen3-Max", "code": null}, {"id": 372169, "name": "Qwen3-Omni-30B-A3B", "code": null}]}}, "origins": [{"id": 8834, "title": "Parameter, Compute and Data Trends in Machine Learning", "descriptionSnapshot": "We update this chart with the latest available data from our source every month.\n\nThe authors selected the AI systems for inclusion based on the following necessary criteria:\n\u2014 Have an explicit learning component\n\u2014 Showcase experimental results\n\u2014 Advance the state of the art\n\nIn addition, the systems had to meet at least one of the following notability criteria:\n\u2014 Paper has more than 1000 citations\n\u2014 Historical importance\n\u2014 Important state-of-the-art advance\n\u2014 Deployed in a notable context\n\nThe authors note that: \"For new models (from 2020 onward) it is harder to assess these criteria, so we fall back to a subjective selection. We refer to models meeting our selection criteria as 'milestone models.\"\n", "producer": "Epoch AI", "citationFull": "Epoch AI, \u2018Parameter, Compute and Data Trends in Machine Learning\u2019. Published online at epochai.org. Retrieved from: \u2018https://epoch.ai/data/epochdb/visualization\u2019 [online resource]", "urlMain": "https://epoch.ai/mlinputs/visualization", "urlDownload": "https://epoch.ai/data/epochdb/notable_ai_models.csv", "dateAccessed": "2025-10-09", "datePublished": "2025", "license": {"url": "https://creativecommons.org/licenses/by/4.0/", "name": "CC BY 4.0"}}]}