{"id": 1015501, "name": "Training dataset size", "unit": "", "createdAt": "2025-03-15T08:53:17.000Z", "updatedAt": "2025-10-09T11:02:56.000Z", "coverage": "", "timespan": "", "datasetId": 6999, "shortUnit": "", "columnOrder": 0, "shortName": "training_dataset_size__datapoints", "catalogPath": "grapher/artificial_intelligence/2025-03-12/epoch/epoch#training_dataset_size__datapoints", "descriptionShort": "The number of examples provided to train an AI model. Typically, more data results in a more comprehensive understanding by the model.", "type": "int", "dataChecksum": "6804382938976339769", "metadataChecksum": "-3216778474537820797", "datasetName": "Parameter, Compute and Data Trends in Machine Learning", "updatePeriodDays": 31, "datasetVersion": "2025-03-12", "nonRedistributable": false, "display": {"zeroDay": "1949-01-01", "yearIsDay": true}, "schemaVersion": 2, "processingLevel": "major", "presentation": {"topicTagsLinks": ["Artificial Intelligence"]}, "descriptionKey": ["Training data size refers to the volume of data employed to train an artificial intelligence (AI) model effectively. It's a representation of the number of examples that the model learns from during its training process. It is a fundamental measure of the scope of the data used in the model's learning phase.", "To grasp the concept of training data size, imagine teaching a friend the art of distinguishing different types of birds. In this analogy, each bird picture presented to your friend corresponds to an individual piece of training data. If you showed them 100 unique bird photos, then the training data size in this scenario would be quantified as 100.", "Training data size is an essential indicator in AI and machine learning. First and foremost, it directly impacts the depth of learning achieved by the model. The more extensive the dataset, the more profound and comprehensive the model's understanding of the subject matter becomes. Additionally, a large training data size contributes significantly to improved recognition capabilities. By exposing the model to a diverse array of examples, it becomes adept at identifying subtle nuances, much like how it becomes skilled at distinguishing various bird species through exposure to a large variety of bird images."], "dimensions": {"years": {"values": [{"id": 25443}, {"id": 24432}, {"id": 24820}, {"id": 25282}, {"id": 23936}, {"id": 4198}, {"id": 24096}, {"id": 26428}, {"id": 27603}, {"id": 25835}, {"id": 25971}, {"id": 26459}, {"id": 27534}, {"id": 16403}, {"id": 12661}, {"id": 25727}, {"id": 25055}, {"id": 25105}, {"id": 25150}, {"id": 26684}, {"id": 17350}, {"id": 23892}, {"id": 23283}, {"id": 26876}, {"id": 25946}, {"id": 26266}, {"id": 27521}, {"id": 26574}, {"id": 24497}, {"id": 25127}, {"id": 27292}, {"id": 26940}, {"id": 25841}, {"id": 25175}, {"id": 27298}, {"id": 27043}, {"id": 27456}, {"id": 27091}, {"id": 27234}, {"id": 26620}, {"id": 25485}, {"id": 25676}, {"id": 26759}, {"id": 24780}, {"id": 26854}, {"id": 27334}, {"id": 24348}, {"id": 13787}, {"id": 15142}, {"id": 25871}, {"id": 22178}, {"id": 22127}, {"id": 17836}, {"id": 25441}, {"id": 25392}, {"id": 21878}, {"id": 26307}, {"id": 26884}, {"id": 27116}, {"id": 16039}, {"id": 24649}, {"id": 24263}, {"id": 26302}, {"id": 24639}, {"id": 22905}, {"id": 22920}, {"id": 27326}, {"id": 26267}, {"id": 26291}, {"id": 27301}, {"id": 27015}, {"id": 25880}, {"id": 16442}, {"id": 27327}, {"id": 26750}, {"id": 26457}, {"id": 26827}, {"id": 27164}, {"id": 26848}, {"id": 26485}, {"id": 27375}, {"id": 27337}, {"id": 26811}, {"id": 26443}, {"id": 9739}, {"id": 24305}, {"id": 26059}, {"id": 25042}, {"id": 24791}, {"id": 26758}, {"id": 27479}, {"id": 25140}, {"id": 24324}, {"id": 27054}, {"id": 26078}, {"id": 27131}, {"id": 26819}, {"id": 25171}, {"id": 24781}, {"id": 26524}, {"id": 23347}, {"id": 24161}, {"id": 26458}, {"id": 26147}, {"id": 23714}, {"id": 19334}, {"id": 7425}, {"id": 22763}, {"id": 21017}, {"id": 24312}, {"id": 21735}, {"id": 26722}, {"id": 27561}, {"id": 27751}, {"id": 24842}, {"id": 24001}, {"id": 26312}, {"id": 26289}, {"id": 26669}, {"id": 26939}, {"id": 23391}, {"id": 13740}, {"id": 27780}, {"id": 27694}, {"id": 27037}, {"id": 23164}, {"id": 25324}, {"id": 25062}, {"id": 26014}, {"id": 26483}, {"id": 26654}, {"id": 26150}, {"id": 26662}, {"id": 26864}, {"id": 27569}, {"id": 26980}, {"id": 27736}, {"id": 27954}, {"id": 27833}, {"id": 26471}, {"id": 24828}, {"id": 27792}, {"id": 26543}, {"id": 27276}, {"id": 27101}, {"id": 25983}, {"id": 25517}, {"id": 25731}, {"id": 26623}, {"id": 26472}, {"id": 23912}, {"id": 23901}, {"id": 24752}, {"id": 25077}, {"id": 27975}, {"id": 26878}, {"id": 27533}, {"id": 26644}, {"id": 24740}, {"id": 21892}, {"id": 27360}, {"id": 26505}, {"id": 25353}, {"id": 25611}, {"id": 26809}, {"id": 26080}, {"id": 26702}, {"id": 22080}, {"id": 25521}, {"id": 25047}, {"id": 26113}, {"id": 26982}, {"id": 26794}, {"id": 26946}, {"id": 26361}, {"id": 26226}, {"id": 23741}, {"id": 15979}, {"id": 24000}, {"id": 26639}, {"id": 27335}, {"id": 16730}, {"id": 21891}, {"id": 24818}, {"id": 14940}, {"id": 12874}, {"id": 20459}, {"id": 20629}, {"id": 27703}, {"id": 27828}, {"id": 27024}, {"id": 27205}, {"id": 26550}, {"id": 18444}, {"id": 16236}, {"id": 25237}, {"id": 15248}, {"id": 25094}, {"id": 23729}, {"id": 26805}, {"id": 24441}, {"id": 26689}, {"id": 26976}, {"id": 27214}, {"id": 20266}, {"id": 25027}, {"id": 16771}, {"id": 27268}, {"id": 26520}, {"id": 26259}, {"id": 21356}, {"id": 25624}, {"id": 27950}, {"id": 22240}, {"id": 26651}, {"id": 17131}, {"id": 27082}, {"id": 27336}, {"id": 27611}, {"id": 20423}, {"id": 24051}, {"id": 24599}, {"id": 17850}, {"id": 23262}, {"id": 18263}, {"id": 25468}, {"id": 6513}, {"id": 26207}, {"id": 26703}, {"id": 25734}, {"id": 18201}, {"id": 4899}, {"id": 27226}, {"id": 27501}, {"id": 27597}, {"id": 27660}, {"id": 27733}, {"id": 27853}, {"id": 27368}, {"id": 28002}, {"id": 26646}, {"id": 14457}, {"id": 14610}, {"id": 27466}, {"id": 25905}, {"id": 26341}, {"id": 24142}, {"id": 17320}, {"id": 27362}, {"id": 26612}, {"id": 22133}, {"id": 547}, {"id": 2922}, {"id": 4106}, {"id": 14035}, {"id": 14944}, {"id": 15826}, {"id": 18959}, {"id": 24077}, {"id": 25323}, {"id": 26842}, {"id": 2250}, {"id": 3833}, {"id": 27653}, {"id": 11413}, {"id": 17044}, {"id": 26308}, {"id": 25959}, {"id": 25826}, {"id": 26581}, {"id": 25510}, {"id": 26357}, {"id": 24964}, {"id": 24859}, {"id": 27670}, {"id": 25889}, {"id": 25520}, {"id": 23521}, {"id": 23914}, {"id": 25681}, {"id": 26058}, {"id": 15126}, {"id": 26849}, {"id": 22956}, {"id": 23958}, {"id": 19796}, {"id": 27688}, {"id": 24534}, {"id": 27346}, {"id": 27558}, {"id": 25881}, {"id": 27042}, {"id": 27165}, {"id": 26784}, {"id": 24843}, {"id": 26865}, {"id": 26685}, {"id": 25913}, {"id": 27557}, {"id": 26560}, {"id": 26919}, {"id": 27176}, {"id": 26756}, {"id": 27157}, {"id": 27106}, {"id": 27858}, {"id": 23874}, {"id": 26835}, {"id": 27267}, {"id": 26546}, {"id": 25137}, {"id": 9070}, {"id": 24792}, {"id": 22537}, {"id": 26176}, {"id": 26840}, {"id": 26602}, {"id": 26421}, {"id": 27067}, {"id": 25085}, {"id": 25233}, {"id": 27361}, {"id": 27263}, {"id": 27427}, {"id": 27551}, {"id": 27655}, {"id": 27877}, {"id": 27961}, {"id": 28006}, {"id": 28023}, {"id": 24643}, {"id": 15279}, {"id": 22012}, {"id": 23649}, {"id": 18414}, {"id": 26700}, {"id": 22548}, {"id": 23720}, {"id": 23913}, {"id": 23984}, {"id": 20672}, {"id": 25688}, {"id": 27481}, {"id": 22445}, {"id": 22823}, {"id": 24449}, {"id": 24731}, {"id": 25748}, {"id": 27309}, {"id": 25138}, {"id": 26601}, {"id": 24406}, {"id": 24054}, {"id": 26507}, {"id": 17562}, {"id": 24772}, {"id": 23909}, {"id": 26352}, {"id": 24447}, {"id": 26710}, {"id": 27758}, {"id": 20986}, {"id": 25651}, {"id": 27889}, {"id": 26742}, {"id": 27122}, {"id": 23993}, {"id": 16283}, {"id": 27113}, {"id": 27330}, {"id": 23922}, {"id": 26766}, {"id": 26765}, {"id": 27156}, {"id": 26723}, {"id": 25547}, {"id": 25903}, {"id": 26469}, {"id": 22280}, {"id": 17335}, {"id": 25862}, {"id": 24073}, {"id": 25970}, {"id": 27656}, {"id": 26008}, {"id": 19505}, {"id": 26561}, {"id": 24999}, {"id": 25472}, {"id": 25575}, {"id": 25897}, {"id": 26002}, {"id": 14044}, {"id": 25489}, {"id": 25975}, {"id": 22158}, {"id": 24243}, {"id": 26792}, {"id": 26380}, {"id": 26054}, {"id": 23203}, {"id": 27032}, {"id": 24779}, {"id": 24106}, {"id": 23987}, {"id": 27373}, {"id": 27516}, {"id": 24455}, {"id": 22814}, {"id": 27068}, {"id": 26456}, {"id": 26616}, {"id": 26227}, {"id": 15675}, {"id": 26926}, {"id": 23664}, {"id": 25875}, {"id": 26526}, {"id": 24751}, {"id": 26515}, {"id": 25299}, {"id": 27333}, {"id": 27526}, {"id": 26582}, {"id": 25343}, {"id": 26682}, {"id": 27945}, {"id": 26968}, {"id": 24181}, {"id": 22443}, {"id": 27338}, {"id": 26969}, {"id": 26225}, {"id": 27382}, {"id": 25800}, {"id": 21335}]}, "entities": {"values": [{"id": 368101, "name": "(ensemble): AWD-LSTM-DOC (fin) \u00d7 5 (WT2)", "code": null}, {"id": 371824, "name": "3DDFA", "code": null}, {"id": 371809, "name": "3DMM-CNN", "code": null}, {"id": 368102, "name": "4 layer QRNN (h=2500)", "code": null}, {"id": 371825, "name": "ACF-WIDER", "code": null}, {"id": 256995, "name": "ADALINE", "code": null}, {"id": 257024, "name": "ADAM (CIFAR-10)", "code": null}, {"id": 368328, "name": "ADM", "code": null}, {"id": 370093, "name": "AFM-on-device", "code": null}, {"id": 370105, "name": "AFM-server", "code": null}, {"id": 306125, "name": "ALBERT", "code": null}, {"id": 257107, "name": "ALBERT-xxlarge", "code": null}, {"id": 257103, "name": "ALIGN", "code": null}, {"id": 371471, "name": "ALLaM 7B", "code": null}, {"id": 371545, "name": "ALLaM\u00a0adapted 70B", "code": null}, {"id": 369966, "name": "ANN Eye Tracker", "code": null}, {"id": 305984, "name": "ASE+ACE", "code": null}, {"id": 368026, "name": "AWD-LSTM + MoS + Partial Shuffled", "code": null}, {"id": 368011, "name": "AWD-LSTM - 3-layer LSTM (tied) + continuous cache pointer (WT2)", "code": null}, {"id": 368044, "name": "AWD-LSTM+WT+Cache+IOG (WT2)", "code": null}, {"id": 368041, "name": "AWD-LSTM-MoS + dynamic evaluation (WT2, 2017)", "code": null}, {"id": 370245, "name": "AbLang (heavy sequences)", "code": null}, {"id": 370523, "name": "AdaBoost.M2 Digit Recognition", "code": null}, {"id": 369562, "name": "AdaRNN", "code": null}, {"id": 240132, "name": "AlexNet", "code": null}, {"id": 354863, "name": "AlexaTM 20B", "code": null}, {"id": 257063, "name": "AlphaFold", "code": null}, {"id": 368138, "name": "AlphaFold 2", "code": null}, {"id": 371962, "name": "AlphaFold 3", "code": null}, {"id": 368719, "name": "AlphaFold-Multimer", "code": null}, {"id": 257029, "name": "AlphaGo Lee", "code": null}, {"id": 257039, "name": "AlphaGo Zero", "code": null}, {"id": 368737, "name": "AlphaMissense", "code": null}, {"id": 371829, "name": "AlphaTensor", "code": null}, {"id": 257056, "name": "AlphaX-1", "code": null}, {"id": 240145, "name": "AlphaZero", "code": null}, {"id": 370131, "name": "Amazon Titan", "code": null}, {"id": 368732, "name": "Ankh_large", "code": null}, {"id": 371244, "name": "Aramco Metabrain AI", "code": null}, {"id": 369016, "name": "AudioGen", "code": null}, {"id": 369002, "name": "AudioLM", "code": null}, {"id": 368364, "name": "BASIC-L", "code": null}, {"id": 257045, "name": "BERT-Large", "code": null}, {"id": 368077, "name": "BERT-Large-CAS (PTB+WT2+WT103)", "code": null}, {"id": 368730, "name": "BERT-RBP", "code": null}, {"id": 369179, "name": "BIDAF", "code": null}, {"id": 368746, "name": "BLOOM-176B", "code": null}, {"id": 368337, "name": "BLUUMI", "code": null}, {"id": 306063, "name": "BPE", "code": null}, {"id": 257004, "name": "Back-propagation", "code": null}, {"id": 369990, "name": "Bankruptcy-NN", "code": null}, {"id": 368067, "name": "Base LM + kNN LM + Continuous Cache", "code": null}, {"id": 306016, "name": "BellKor 2007", "code": null}, {"id": 306025, "name": "BellKor 2008", "code": null}, {"id": 367305, "name": "BellKor 2009", "code": null}, {"id": 367568, "name": "Bidirectional RNN", "code": null}, {"id": 369348, "name": "Big Transformer for Back-Translation", "code": null}, {"id": 368035, "name": "Big-Little Net", "code": null}, {"id": 369018, "name": "Big-Little Net (speech)", "code": null}, {"id": 306022, "name": "BigChaos 2008", "code": null}, {"id": 368038, "name": "BigChaos OptiBlend", "code": null}, {"id": 306151, "name": "BigSSL", "code": null}, {"id": 368716, "name": "BlenderBot 3", "code": null}, {"id": 367081, "name": "BloombergGPT", "code": null}, {"id": 369995, "name": "Boosting", "code": null}, {"id": 371844, "name": "CCL", "code": null}, {"id": 371875, "name": "CFSS", "code": null}, {"id": 306154, "name": "CLIP (ResNet-50)", "code": null}, {"id": 257076, "name": "CLIP (ViT L/14@336px)", "code": null}, {"id": 371891, "name": "CMS-RCNN", "code": null}, {"id": 371842, "name": "CNN Committee (MNIST)", "code": null}, {"id": 371862, "name": "CNN Committee (NIST)", "code": null}, {"id": 371863, "name": "CNN committee (traffic sign)", "code": null}, {"id": 368129, "name": "CODEFUSION (Python)", "code": null}, {"id": 257074, "name": "CPM-Large", "code": null}, {"id": 368078, "name": "CT-MoS (WT2)", "code": null}, {"id": 368728, "name": "CTM (CIFAR-10)", "code": null}, {"id": 368726, "name": "CaLM", "code": null}, {"id": 369328, "name": "CamemBERT", "code": null}, {"id": 369972, "name": "Ceramic-MLP", "code": null}, {"id": 369998, "name": "ChatGLM3-6B", "code": null}, {"id": 273166, "name": "Chinchilla", "code": null}, {"id": 368362, "name": "CoAtNet", "code": null}, {"id": 368373, "name": "CoCa", "code": null}, {"id": 369165, "name": "CoEdiT-xxl", "code": null}, {"id": 368133, "name": "CodeT5-large", "code": null}, {"id": 306167, "name": "Codex", "code": null}, {"id": 369518, "name": "CogAgent", "code": null}, {"id": 370174, "name": "CogVLM-17B", "code": null}, {"id": 366986, "name": "CogVideo", "code": null}, {"id": 257084, "name": "CogView", "code": null}, {"id": 305980, "name": "Cognitron", "code": null}, {"id": 371894, "name": "CompACT-Deep", "code": null}, {"id": 368375, "name": "ContextNet", "code": null}, {"id": 369558, "name": "ConvS2S (ensemble of 8 models)", "code": null}, {"id": 371858, "name": "DAC-CSR", "code": null}, {"id": 257077, "name": "DALL-E", "code": null}, {"id": 306186, "name": "DALL\u00b7E 2", "code": null}, {"id": 369883, "name": "DBRX", "code": null}, {"id": 369187, "name": "DCN+", "code": null}, {"id": 371859, "name": "DCNN", "code": null}, {"id": 368354, "name": "DDPM-IP (CelebA)", "code": null}, {"id": 368341, "name": "DETR", "code": null}, {"id": 369173, "name": "DINOv2", "code": null}, {"id": 368096, "name": "DITTO", "code": null}, {"id": 371892, "name": "DL scaling Image", "code": null}, {"id": 371900, "name": "DL scaling LM", "code": null}, {"id": 371893, "name": "DL scaling speech", "code": null}, {"id": 371895, "name": "DLDL (PASCAL)", "code": null}, {"id": 368748, "name": "DNABERT", "code": null}, {"id": 371854, "name": "DNN EM segmentation", "code": null}, {"id": 306056, "name": "DQN-2015", "code": null}, {"id": 306150, "name": "DeBERTa", "code": null}, {"id": 370248, "name": "DeLighT", "code": null}, {"id": 369963, "name": "DeViSE", "code": null}, {"id": 257009, "name": "Decision tree (classification)", "code": null}, {"id": 369520, "name": "Decision tree adaline", "code": null}, {"id": 370720, "name": "Deep Autoencoders", "code": null}, {"id": 306014, "name": "Deep Belief Nets", "code": null}, {"id": 371870, "name": "Deep CNN + COTS", "code": null}, {"id": 367308, "name": "Deep Multitask NLP Network", "code": null}, {"id": 306184, "name": "DeepNet", "code": null}, {"id": 370239, "name": "DeepSeek-Coder-V2 236B", "code": null}, {"id": 371328, "name": "DeepSeek-V3", "code": null}, {"id": 257034, "name": "DeepStack", "code": null}, {"id": 367494, "name": "Deeply-supervised nets", "code": null}, {"id": 366051, "name": "DeiT-B", "code": null}, {"id": 306166, "name": "Denoising Diffusion Probabilistic Models (LSUN Bedroom)", "code": null}, {"id": 369162, "name": "DensePhrases", "code": null}, {"id": 369202, "name": "Detic", "code": null}, {"id": 369185, "name": "DiffDock", "code": null}, {"id": 370719, "name": "Dimensionality Reduction", "code": null}, {"id": 369539, "name": "DistBelief NNLM", "code": null}, {"id": 369538, "name": "DistBelief Speech", "code": null}, {"id": 369541, "name": "DistBelief Vision", "code": null}, {"id": 369560, "name": "Distributed representation NN", "code": null}, {"id": 371331, "name": "Doubao-1.5-pro", "code": null}, {"id": 371248, "name": "Doubao-pro", "code": null}, {"id": 371511, "name": "DreamerV3", "code": null}, {"id": 306036, "name": "Dropout (CIFAR)", "code": null}, {"id": 306037, "name": "Dropout (ImageNet)", "code": null}, {"id": 257017, "name": "Dropout (MNIST)", "code": null}, {"id": 306035, "name": "Dropout (TIMIT)", "code": null}, {"id": 368004, "name": "Dropout-LSTM+Noise(Bernoulli) (WT2)", "code": null}, {"id": 369556, "name": "Dropout: SVHN", "code": null}, {"id": 368056, "name": "EI-REHN-1000D", "code": null}, {"id": 306136, "name": "ELECTRA", "code": null}, {"id": 368718, "name": "EMDR", "code": null}, {"id": 257087, "name": "ERNIE 3.0", "code": null}, {"id": 368055, "name": "ERNIE 3.0 Titan", "code": null}, {"id": 306133, "name": "ERNIE-GEN (large)", "code": null}, {"id": 368093, "name": "ERNIE-ViLG", "code": null}, {"id": 369022, "name": "ESM2-15B", "code": null}, {"id": 369981, "name": "ESM3 (98B)", "code": null}, {"id": 369545, "name": "EVA-01", "code": null}, {"id": 371364, "name": "EXAONE 3.5 32B", "code": null}, {"id": 371817, "name": "EXAONE 4.0 (32B)", "code": null}, {"id": 371472, "name": "EXAONE Deep 32B", "code": null}, {"id": 370177, "name": "EfficientNetV2-XL", "code": null}, {"id": 371866, "name": "EnhanceNet", "code": null}, {"id": 371539, "name": "Eurus-2-7B-PRIME", "code": null}, {"id": 368372, "name": "FLAN 137B", "code": null}, {"id": 369508, "name": "Falcon-180B", "code": null}, {"id": 367636, "name": "Falcon-40B", "code": null}, {"id": 368047, "name": "Feedback Transformer", "code": null}, {"id": 370722, "name": "Fine-tuned-AWD-LSTM-DOC (fin)", "code": null}, {"id": 306122, "name": "FixRes ResNeXt-101 WSL", "code": null}, {"id": 35176, "name": "Florence", "code": null}, {"id": 369017, "name": "Fold2Seq", "code": null}, {"id": 369994, "name": "Fragment embedding", "code": null}, {"id": 368045, "name": "Fraternal dropout + AWD-LSTM 3-layer (WT2)", "code": null}, {"id": 369532, "name": "FunSearch", "code": null}, {"id": 257021, "name": "GANs", "code": null}, {"id": 371896, "name": "GAWWN", "code": null}, {"id": 368086, "name": "GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)", "code": null}, {"id": 371946, "name": "GLM 4.5", "code": null}, {"id": 365992, "name": "GLM-130B", "code": null}, {"id": 370087, "name": "GLM-4 (0520)", "code": null}, {"id": 368065, "name": "GLaM", "code": null}, {"id": 257030, "name": "GNMT", "code": null}, {"id": 371852, "name": "GNN", "code": null}, {"id": 370977, "name": "GNoME for crystal discovery", "code": null}, {"id": 273165, "name": "GOAT", "code": null}, {"id": 370176, "name": "GPT-1", "code": null}, {"id": 369043, "name": "GPT-2 (1.5B)", "code": null}, {"id": 371857, "name": "GPT-2 Medium (FlashAttention)", "code": null}, {"id": 354864, "name": "GPT-3 175B (davinci)", "code": null}, {"id": 363052, "name": "GPT-4", "code": null}, {"id": 257096, "name": "GPT-NeoX-20B", "code": null}, {"id": 257011, "name": "GPU DBNs", "code": null}, {"id": 306110, "name": "GPipe (Transformer)", "code": null}, {"id": 369182, "name": "GSM", "code": null}, {"id": 257072, "name": "GShard (dense)", "code": null}, {"id": 368109, "name": "Galactica", "code": null}, {"id": 306191, "name": "Gato", "code": null}, {"id": 369025, "name": "GenSLM", "code": null}, {"id": 307047, "name": "Generative BST", "code": null}, {"id": 369146, "name": "German ELECTRA Large", "code": null}, {"id": 306049, "name": "GloVe (32B)", "code": null}, {"id": 306048, "name": "GloVe (6B)", "code": null}, {"id": 369984, "name": "Golem", "code": null}, {"id": 257026, "name": "GoogLeNet / InceptionV1", "code": null}, {"id": 367255, "name": "Gopher (280B)", "code": null}, {"id": 368376, "name": "Grok-1", "code": null}, {"id": 305993, "name": "GroupLens", "code": null}, {"id": 369564, "name": "HLBL", "code": null}, {"id": 371890, "name": "HR-ResNet101", "code": null}, {"id": 369521, "name": "Handwritten Digit Recognition System", "code": null}, {"id": 369510, "name": "Hierarchical Cognitron", "code": null}, {"id": 369987, "name": "Hierarchical LM", "code": null}, {"id": 367524, "name": "Histograms of Oriented Gradients", "code": null}, {"id": 257088, "name": "HuBERT", "code": null}, {"id": 371242, "name": "Hunyuan-Large", "code": null}, {"id": 371513, "name": "Hunyuan-TurboS", "code": null}, {"id": 368002, "name": "Hybrid H3-2.7B", "code": null}, {"id": 369030, "name": "HyenaDNA", "code": null}, {"id": 369563, "name": "HyperCLOVA 204B", "code": null}, {"id": 305998, "name": "IBM Model 4", "code": null}, {"id": 305992, "name": "IBM-5", "code": null}, {"id": 257040, "name": "IMPALA", "code": null}, {"id": 369986, "name": "ISR network", "code": null}, {"id": 368023, "name": "ISS", "code": null}, {"id": 306044, "name": "Image generation", "code": null}, {"id": 306192, "name": "Imagen", "code": null}, {"id": 306064, "name": "Inception v3", "code": null}, {"id": 371237, "name": "InstructGPT 175B", "code": null}, {"id": 369198, "name": "InternImage", "code": null}, {"id": 367509, "name": "InternLM", "code": null}, {"id": 369971, "name": "Invariant CNN", "code": null}, {"id": 257037, "name": "JFT", "code": null}, {"id": 369964, "name": "JPMAX", "code": null}, {"id": 367533, "name": "Jais", "code": null}, {"id": 257090, "name": "Jurassic-1-Jumbo", "code": null}, {"id": 268376, "name": "KEPLER", "code": null}, {"id": 369975, "name": "KN-LM", "code": null}, {"id": 366991, "name": "KataGo", "code": null}, {"id": 371815, "name": "Kimi K2", "code": null}, {"id": 371881, "name": "LCNP LabelMe", "code": null}, {"id": 371878, "name": "LCNP MNIST", "code": null}, {"id": 371874, "name": "LCNP NORB", "code": null}, {"id": 368722, "name": "LDM-1.45B", "code": null}, {"id": 368999, "name": "LEP-AD", "code": null}, {"id": 369970, "name": "LISSOM", "code": null}, {"id": 367499, "name": "LLaMA-65B", "code": null}, {"id": 369331, "name": "LLaVA 1.5", "code": null}, {"id": 371536, "name": "LLaVA-OV-72B", "code": null}, {"id": 369988, "name": "LMICA", "code": null}, {"id": 306054, "name": "LRCN", "code": null}, {"id": 371902, "name": "LRR-4X", "code": null}, {"id": 256998, "name": "LSTM", "code": null}, {"id": 369534, "name": "LSTM LM", "code": null}, {"id": 354866, "name": "LSTM with forget gates", "code": null}, {"id": 368009, "name": "LSTM+NeuralCache", "code": null}, {"id": 369517, "name": "LTE speaker verification system", "code": null}, {"id": 369195, "name": "LUKE", "code": null}, {"id": 257095, "name": "LaMDA", "code": null}, {"id": 368358, "name": "LaNet-L (CIFAR-10)", "code": null}, {"id": 245542, "name": "LeNet-5", "code": null}, {"id": 369543, "name": "Linear Decision Functions", "code": null}, {"id": 368743, "name": "Llama 2-70B", "code": null}, {"id": 368747, "name": "Llama 2-7B", "code": null}, {"id": 369516, "name": "Llama 3-70B", "code": null}, {"id": 370155, "name": "Llama 3.1-405B", "code": null}, {"id": 371540, "name": "Llama 3.2 11B", "code": null}, {"id": 371535, "name": "Llama 3.3 70B", "code": null}, {"id": 371514, "name": "Llama 4 Behemoth (preview)", "code": null}, {"id": 371860, "name": "Llama 4 Maverick", "code": null}, {"id": 371843, "name": "Llama 4 Scout", "code": null}, {"id": 369174, "name": "Llama Guard", "code": null}, {"id": 372167, "name": "LongCat-Flash", "code": null}, {"id": 369203, "name": "LongT5", "code": null}, {"id": 306163, "name": "M6-T", "code": null}, {"id": 369973, "name": "MLN-ASR", "code": null}, {"id": 369527, "name": "MLP baggage detector", "code": null}, {"id": 369524, "name": "MM1-30B", "code": null}, {"id": 371947, "name": "MMLSTM (PTB)", "code": null}, {"id": 371951, "name": "MMLSTM (WT-2)", "code": null}, {"id": 368753, "name": "MSA Transformer", "code": null}, {"id": 257025, "name": "MSRA (C, PReLU)", "code": null}, {"id": 369993, "name": "MUSIC perceptron", "code": null}, {"id": 369011, "name": "Mamba-24M (SC09)", "code": null}, {"id": 370718, "name": "Masked Autoencoders ViT-H", "code": null}, {"id": 367462, "name": "MatrixFac for Recommenders", "code": null}, {"id": 370172, "name": "Maximum compute", "code": null}, {"id": 370173, "name": "Maximum data", "code": null}, {"id": 370175, "name": "Maximum parameters", "code": null}, {"id": 257064, "name": "Meena", "code": null}, {"id": 257055, "name": "Megatron-BERT", "code": null}, {"id": 371869, "name": "Megatron-LM (1.2B)", "code": null}, {"id": 368027, "name": "Megatron-LM (8.3B)", "code": null}, {"id": 257092, "name": "Megatron-Turing NLG 530B", "code": null}, {"id": 369326, "name": "Mesh-TensorFlow Transformer 2.9B (translation)", "code": null}, {"id": 370525, "name": "Mesh-TensorFlow Transformer 4.9B (language)", "code": null}, {"id": 257104, "name": "Meta Pseudo Labels", "code": null}, {"id": 306195, "name": "Minerva (540B)", "code": null}, {"id": 369991, "name": "Mixture of linear models", "code": null}, {"id": 369153, "name": "Mnemonic Reader", "code": null}, {"id": 370178, "name": "MoE-Multi", "code": null}, {"id": 371329, "name": "Movie Gen Video", "code": null}, {"id": 306130, "name": "MuZero", "code": null}, {"id": 368028, "name": "Multi-cell LSTM", "code": null}, {"id": 369989, "name": "Multilingual DNN", "code": null}, {"id": 306047, "name": "Multiresolution CNN", "code": null}, {"id": 371848, "name": "MuseNet", "code": null}, {"id": 370527, "name": "NAS with base 8 and shared embeddings", "code": null}, {"id": 371942, "name": "NAS+ESS (23M)", "code": null}, {"id": 257031, "name": "NASv3 (CIFAR-10)", "code": null}, {"id": 371841, "name": "NETtalk reimplementation", "code": null}, {"id": 306196, "name": "NLLB", "code": null}, {"id": 306033, "name": "NLP from scratch", "code": null}, {"id": 371884, "name": "NPD", "code": null}, {"id": 371812, "name": "NPLM (AP News)", "code": null}, {"id": 371816, "name": "NPLM (Brown)", "code": null}, {"id": 371240, "name": "NVLM-D 72B", "code": null}, {"id": 371234, "name": "NVLM-H 72B", "code": null}, {"id": 371232, "name": "NVLM-X 72B", "code": null}, {"id": 257109, "name": "Named Entity Recognition model", "code": null}, {"id": 369019, "name": "Nemotron-3-8B", "code": null}, {"id": 369982, "name": "Nemotron-4 340B", "code": null}, {"id": 256996, "name": "Neocognitron", "code": null}, {"id": 368075, "name": "NetTalk (dictionary)", "code": null}, {"id": 368083, "name": "NetTalk (transcription)", "code": null}, {"id": 369997, "name": "Neural LM", "code": null}, {"id": 369992, "name": "NeuroChess", "code": null}, {"id": 306128, "name": "Noisy Student (L2)", "code": null}, {"id": 369009, "name": "Nucleotide Transformer", "code": null}, {"id": 366449, "name": "ONE-PEACE", "code": null}, {"id": 306188, "name": "OPT-175B", "code": null}, {"id": 368323, "name": "OR-WideResNet", "code": null}, {"id": 369033, "name": "OmegaPLM", "code": null}, {"id": 369013, "name": "OntoProtein", "code": null}, {"id": 257061, "name": "OpenAI Five", "code": null}, {"id": 257062, "name": "OpenAI Five Rerun", "code": null}, {"id": 370246, "name": "OpenVLA", "code": null}, {"id": 366990, "name": "PLATO-XL", "code": null}, {"id": 368374, "name": "PaLI", "code": null}, {"id": 368325, "name": "PaLI-X", "code": null}, {"id": 273167, "name": "PaLM (540B)", "code": null}, {"id": 365387, "name": "PaLM 2", "code": null}, {"id": 368069, "name": "PanGu-\u03a3", "code": null}, {"id": 371529, "name": "Pangu Ultra", "code": null}, {"id": 369565, "name": "Paragraph Vector", "code": null}, {"id": 306194, "name": "Parti", "code": null}, {"id": 369042, "name": "PeptideBERT", "code": null}, {"id": 369024, "name": "Perceptron (1960)", "code": null}, {"id": 257002, "name": "Perceptron Mark I", "code": null}, {"id": 368104, "name": "PermuteFormer", "code": null}, {"id": 369175, "name": "PhraseCond", "code": null}, {"id": 369537, "name": "Piecewise linear model", "code": null}, {"id": 369980, "name": "PoE MNIST", "code": null}, {"id": 368076, "name": "Pointer Sentinel-LSTM (medium)", "code": null}, {"id": 306078, "name": "PolyNet", "code": null}, {"id": 371871, "name": "Pooling CNN (Caltech 101)", "code": null}, {"id": 371872, "name": "Pooling CNN (NORB)", "code": null}, {"id": 368137, "name": "Pragmatic Theory solution (Netflix 2009)", "code": null}, {"id": 369996, "name": "Predictive Coding NN", "code": null}, {"id": 368338, "name": "ProBERTa", "code": null}, {"id": 369037, "name": "ProGen2-xlarge", "code": null}, {"id": 368339, "name": "Projected GAN", "code": null}, {"id": 369015, "name": "ProtBERT-BFD", "code": null}, {"id": 371533, "name": "ProtT5-XL-U50", "code": null}, {"id": 368350, "name": "ProteinBERT", "code": null}, {"id": 369036, "name": "ProteinDT", "code": null}, {"id": 368342, "name": "PyramidNet", "code": null}, {"id": 368074, "name": "QRNN", "code": null}, {"id": 368736, "name": "Qwen-72B", "code": null}, {"id": 369167, "name": "Qwen-VL", "code": null}, {"id": 371236, "name": "Qwen1.5-72B", "code": null}, {"id": 370106, "name": "Qwen2-72B", "code": null}, {"id": 371474, "name": "Qwen2.5-32B", "code": null}, {"id": 370534, "name": "Qwen2.5-72B", "code": null}, {"id": 371991, "name": "Qwen3-235B-A22B", "code": null}, {"id": 371987, "name": "Qwen3-Coder-480B-A35B", "code": null}, {"id": 371992, "name": "Qwen3-Max", "code": null}, {"id": 372169, "name": "Qwen3-Omni-30B-A3B", "code": null}, {"id": 257105, "name": "R-FCN", "code": null}, {"id": 369526, "name": "RAAM", "code": null}, {"id": 368330, "name": "RBM Image Classifier", "code": null}, {"id": 369513, "name": "RCTM", "code": null}, {"id": 371856, "name": "RECONTRA-categorized", "code": null}, {"id": 371849, "name": "RECONTRA-uncategorized", "code": null}, {"id": 306180, "name": "RETRO-7B", "code": null}, {"id": 369528, "name": "RNN LM", "code": null}, {"id": 369344, "name": "RNN for 1B words", "code": null}, {"id": 368752, "name": "RNN-WER", "code": null}, {"id": 257022, "name": "RNNsearch-50*", "code": null}, {"id": 369535, "name": "RNTN", "code": null}, {"id": 370522, "name": "RankNet", "code": null}, {"id": 371964, "name": "RaptorX-Contact", "code": null}, {"id": 369542, "name": "ReALM", "code": null}, {"id": 306030, "name": "ReLU (NORB)", "code": null}, {"id": 369553, "name": "Recursive Neural Network", "code": null}, {"id": 306104, "name": "ResNeXt-101 32x48d", "code": null}, {"id": 371899, "name": "ResNet-101 (ImageNet)", "code": null}, {"id": 257028, "name": "ResNet-152 (ImageNet)", "code": null}, {"id": 366658, "name": "ResNet-200", "code": null}, {"id": 308274, "name": "RetinaNet-R101", "code": null}, {"id": 365388, "name": "RoBERTa Large", "code": null}, {"id": 371966, "name": "RoseTTAFold All-Atom (RFAA)", "code": null}, {"id": 368720, "name": "S-Norm", "code": null}, {"id": 368092, "name": "S4", "code": null}, {"id": 371882, "name": "SAF R-CNN", "code": null}, {"id": 369976, "name": "SB-LM", "code": null}, {"id": 369557, "name": "SC-NLM", "code": null}, {"id": 257089, "name": "SEER", "code": null}, {"id": 370721, "name": "SNM-skip", "code": null}, {"id": 369979, "name": "SOM-CNN", "code": null}, {"id": 369001, "name": "SPIDER2", "code": null}, {"id": 257097, "name": "SPPNet", "code": null}, {"id": 368098, "name": "SRU++ Large", "code": null}, {"id": 54159, "name": "SSD", "code": null}, {"id": 368348, "name": "ST-MoE", "code": null}, {"id": 371974, "name": "STORM-B/8", "code": null}, {"id": 371873, "name": "SVM-CNN", "code": null}, {"id": 256994, "name": "Samuel Neural Checkers", "code": null}, {"id": 368005, "name": "Sandwich Transformer", "code": null}, {"id": 369176, "name": "SciBERT", "code": null}, {"id": 371939, "name": "Seed1.5-VL", "code": null}, {"id": 368108, "name": "Segatron-XL large, M=384 + HCP", "code": null}, {"id": 369040, "name": "Segment Anything Model", "code": null}, {"id": 305970, "name": "Self Organizing System", "code": null}, {"id": 307046, "name": "Seq2Seq LSTM", "code": null}, {"id": 369968, "name": "SexNet classification", "code": null}, {"id": 369967, "name": "SexNet compression", "code": null}, {"id": 369969, "name": "Siamese-TDNN", "code": null}, {"id": 371534, "name": "SigLIP 400M", "code": null}, {"id": 368349, "name": "Skywork-13B", "code": null}, {"id": 306046, "name": "SmooCT", "code": null}, {"id": 268378, "name": "Sparse all-MLP", "code": null}, {"id": 369529, "name": "Speaker-independent vowel classification", "code": null}, {"id": 343968, "name": "Stable Diffusion (LDM-KL-8-G)", "code": null}, {"id": 368130, "name": "StarCoder", "code": null}, {"id": 306183, "name": "Statement Curriculum Learning", "code": null}, {"id": 371889, "name": "StyleGAN", "code": null}, {"id": 371898, "name": "StyleGAN2", "code": null}, {"id": 371868, "name": "StyleGAN3-R", "code": null}, {"id": 371879, "name": "StyleGAN3-T", "code": null}, {"id": 369004, "name": "Super-vector coding", "code": null}, {"id": 305994, "name": "Support Vector Machines", "code": null}, {"id": 257078, "name": "Switch", "code": null}, {"id": 256997, "name": "System 11", "code": null}, {"id": 257059, "name": "T5-11B", "code": null}, {"id": 257058, "name": "T5-3B", "code": null}, {"id": 371867, "name": "TA-CNN", "code": null}, {"id": 257007, "name": "TD-Gammon", "code": null}, {"id": 368046, "name": "TaLK Convolution", "code": null}, {"id": 371840, "name": "Telechat2-115B", "code": null}, {"id": 368107, "name": "Tensor-Transformer(1core)+PN (WT103)", "code": null}, {"id": 256993, "name": "Theseus", "code": null}, {"id": 306001, "name": "Thumbs Up?", "code": null}, {"id": 369140, "name": "TrOCR", "code": null}, {"id": 368110, "name": "Tranception", "code": null}, {"id": 257106, "name": "TransE", "code": null}, {"id": 371241, "name": "Transformer (2017)", "code": null}, {"id": 369555, "name": "Transformer (Adaptive Input Embeddings) WT103", "code": null}, {"id": 257098, "name": "Transformer local-attention (NesT-B)", "code": null}, {"id": 369511, "name": "Transformer-XL (257M)", "code": null}, {"id": 368010, "name": "Transformer-XL DeFINE (141M)", "code": null}, {"id": 368106, "name": "TransformerXL + spectrum control", "code": null}, {"id": 371864, "name": "Translation-invariant MLP", "code": null}, {"id": 368087, "name": "TrellisNet", "code": null}, {"id": 368060, "name": "Turing-NLG", "code": null}, {"id": 371877, "name": "Two Stage Feature Extraction (MNIST)", "code": null}, {"id": 371876, "name": "U-Net", "code": null}, {"id": 306190, "name": "UL2", "code": null}, {"id": 371949, "name": "Unicorn", "code": null}, {"id": 365990, "name": "UnifiedQA", "code": null}, {"id": 366988, "name": "Unsupervised High-level Feature Learner", "code": null}, {"id": 369034, "name": "VALL-E", "code": null}, {"id": 368057, "name": "VD-LSTM+REAL Large", "code": null}, {"id": 371883, "name": "VGG-Face", "code": null}, {"id": 257023, "name": "VGG16", "code": null}, {"id": 306053, "name": "VGG19", "code": null}, {"id": 371814, "name": "VILA-13B", "code": null}, {"id": 371528, "name": "VILA1.5-13B", "code": null}, {"id": 368031, "name": "Variational (untied weights, MC) LSTM (Large)", "code": null}, {"id": 369512, "name": "Vector Space Model", "code": null}, {"id": 369193, "name": "ViT-22B", "code": null}, {"id": 257085, "name": "ViT-G/14", "code": null}, {"id": 368334, "name": "ViT-G/14 (LiT)", "code": null}, {"id": 306146, "name": "ViT-Huge/14", "code": null}, {"id": 369977, "name": "Weight Decay", "code": null}, {"id": 349174, "name": "Whisper", "code": null}, {"id": 257018, "name": "Word2Vec (large)", "code": null}, {"id": 306040, "name": "Word2Vec (small)", "code": null}, {"id": 369192, "name": "XGLM-7.5B", "code": null}, {"id": 369168, "name": "XLM-RoBERTa", "code": null}, {"id": 306169, "name": "XLMR-XXL", "code": null}, {"id": 240142, "name": "Xception", "code": null}, {"id": 369177, "name": "YOLOX-X", "code": null}, {"id": 257041, "name": "YOLOv3", "code": null}, {"id": 368740, "name": "Yi-34B", "code": null}, {"id": 370141, "name": "Yi-Large", "code": null}, {"id": 367473, "name": "YouTube Video Recommendation System", "code": null}, {"id": 257093, "name": "Yuan 1.0", "code": null}, {"id": 367497, "name": "Zidong Taichu", "code": null}, {"id": 257006, "name": "Zip CNN", "code": null}, {"id": 368064, "name": "aLSTM(depth-2)+RecurrentPolicy (WT2)", "code": null}, {"id": 306179, "name": "data2vec (language)", "code": null}, {"id": 306178, "name": "data2vec (speech)", "code": null}, {"id": 306177, "name": "data2vec (vision)", "code": null}, {"id": 372168, "name": "dots.llm1", "code": null}, {"id": 369151, "name": "eDiff-I", "code": null}, {"id": 368084, "name": "genCNN + dyn eval", "code": null}, {"id": 371847, "name": "gpt-oss-120b", "code": null}, {"id": 371855, "name": "gpt-oss-20b", "code": null}, {"id": 371880, "name": "iCCCP", "code": null}, {"id": 369138, "name": "mPLUG-Owl2", "code": null}, {"id": 369155, "name": "mT0-13B", "code": null}, {"id": 368329, "name": "mT5-XXL", "code": null}, {"id": 371483, "name": "nekomata-14b", "code": null}, {"id": 371970, "name": "trRosetta", "code": null}, {"id": 257073, "name": "wave2vec 2.0 LARGE", "code": null}, {"id": 306017, "name": "\u03bb-WASP", "code": null}]}}, "origins": [{"id": 8834, "title": "Parameter, Compute and Data Trends in Machine Learning", "descriptionSnapshot": "We update this chart with the latest available data from our source every month.\n\nThe authors selected the AI systems for inclusion based on the following necessary criteria:\n\u2014 Have an explicit learning component\n\u2014 Showcase experimental results\n\u2014 Advance the state of the art\n\nIn addition, the systems had to meet at least one of the following notability criteria:\n\u2014 Paper has more than 1000 citations\n\u2014 Historical importance\n\u2014 Important state-of-the-art advance\n\u2014 Deployed in a notable context\n\nThe authors note that: \"For new models (from 2020 onward) it is harder to assess these criteria, so we fall back to a subjective selection. We refer to models meeting our selection criteria as 'milestone models.\"\n", "producer": "Epoch AI", "citationFull": "Epoch AI, \u2018Parameter, Compute and Data Trends in Machine Learning\u2019. Published online at epochai.org. Retrieved from: \u2018https://epoch.ai/data/epochdb/visualization\u2019 [online resource]", "urlMain": "https://epoch.ai/mlinputs/visualization", "urlDownload": "https://epoch.ai/data/epochdb/notable_ai_models.csv", "dateAccessed": "2025-10-09", "datePublished": "2025", "license": {"url": "https://creativecommons.org/licenses/by/4.0/", "name": "CC BY 4.0"}}]}