{"id": 1204041, "name": "Training dataset size", "unit": "unique datapoints", "createdAt": "2026-02-27T14:42:59.000Z", "updatedAt": "2026-03-08T06:32:17.000Z", "coverage": "", "timespan": "", "datasetId": 7001, "columnOrder": 0, "shortName": "training_dataset_size__total", "catalogPath": "grapher/artificial_intelligence/2025-03-12/epoch_regressions/epoch_regressions#training_dataset_size__total", "descriptionShort": "The number of unique data points used to train the model. Each domain has a specific data point unit; for example, for vision it is images, for language it is words, and for games it is timesteps. This means systems can only be compared directly within the same domain.", "type": "float", "dataChecksum": "9891595264868458499", "metadataChecksum": "161374728122921696", "datasetName": "Parameter, Compute and Data Trends in Machine Learning - Regressions", "updatePeriodDays": 31, "datasetVersion": "2025-03-12", "nonRedistributable": false, "display": {"unit": "unique datapoints", "zeroDay": "1949-01-01", "yearIsDay": true, "numDecimalPlaces": 0}, "schemaVersion": 2, "processingLevel": "major", "presentation": {"topicTagsLinks": ["Artificial Intelligence"]}, "descriptionKey": ["Training data size measures the volume of unique examples used to train an AI model during its learning phase. It represents the total number of distinct data points the model learns from, counted only once regardless of how many times they're seen during training.", "To understand this concept, imagine teaching someone to identify different bird species. Each unique bird photo you show them is one piece of training data. If you show 100 different photos, your training data size is 100, even if you review those same photos multiple times.", "Since datasets vary by domain, there's no universal unit for measuring size. Text models might count tokens, image models count pictures, and video models count clips. Epoch AI typically uses the smallest unit that triggers a model update during training. For language models that predict the next word, this would be individual tokens.", "Training data size directly impacts model performance. Larger datasets enable deeper learning and more nuanced pattern recognition, allowing models to identify subtle distinctions and handle diverse real-world scenarios more effectively."], "dimensions": {"years": {"values": [{"id": 547}, {"id": 2250}, {"id": 2922}, {"id": 3986}, {"id": 4106}, {"id": 4198}, {"id": 4899}, {"id": 4929}, {"id": 6513}, {"id": 7121}, {"id": 9070}, {"id": 9739}, {"id": 11413}, {"id": 11893}, {"id": 12661}, {"id": 12874}, {"id": 13516}, {"id": 13740}, {"id": 13787}, {"id": 14035}, {"id": 14044}, {"id": 14457}, {"id": 14610}, {"id": 14940}, {"id": 14944}, {"id": 15126}, {"id": 15142}, {"id": 15248}, {"id": 15279}, {"id": 15675}, {"id": 15826}, {"id": 15979}, {"id": 15994}, {"id": 16039}, {"id": 16236}, {"id": 16283}, {"id": 16403}, {"id": 16442}, {"id": 16771}, {"id": 17044}, {"id": 17131}, {"id": 17320}, {"id": 17335}, {"id": 17350}, {"id": 17562}, {"id": 17836}, {"id": 17850}, {"id": 18201}, {"id": 18263}, {"id": 18414}, {"id": 18444}, {"id": 18959}, {"id": 19266}, {"id": 19334}, {"id": 19505}, {"id": 19796}, {"id": 20266}, {"id": 20423}, {"id": 20459}, {"id": 20629}, {"id": 20672}, {"id": 20851}, {"id": 20986}, {"id": 21017}, {"id": 21153}, {"id": 21156}, {"id": 21356}, {"id": 21449}, {"id": 21484}, {"id": 21520}, {"id": 21735}, {"id": 21891}, {"id": 21892}, {"id": 22012}, {"id": 22080}, {"id": 22133}, {"id": 22158}, {"id": 22240}, {"id": 22282}, {"id": 22412}, {"id": 22443}, {"id": 22445}, {"id": 22537}, {"id": 22548}, {"id": 22747}, {"id": 22763}, {"id": 22814}, {"id": 22823}, {"id": 22841}, {"id": 22905}, {"id": 22920}, {"id": 22956}, {"id": 23164}, {"id": 23203}, {"id": 23218}, {"id": 23262}, {"id": 23283}, {"id": 23345}, {"id": 23346}, {"id": 23347}, {"id": 23391}, {"id": 23521}, {"id": 23588}, {"id": 23649}, {"id": 23664}, {"id": 23691}, {"id": 23714}, {"id": 23720}, {"id": 23725}, {"id": 23728}, {"id": 23729}, {"id": 23741}, {"id": 23804}, {"id": 23874}, {"id": 23892}, {"id": 23900}, {"id": 23901}, {"id": 23909}, {"id": 23912}, {"id": 23914}, {"id": 23936}, {"id": 23958}, {"id": 23984}, {"id": 23987}, {"id": 23993}, {"id": 23997}, {"id": 24000}, {"id": 24001}, {"id": 24006}, {"id": 24051}, {"id": 24054}, {"id": 24072}, {"id": 24073}, {"id": 24077}, {"id": 24092}, {"id": 24096}, {"id": 24106}, {"id": 24142}, {"id": 24161}, {"id": 24181}, {"id": 24201}, {"id": 24225}, {"id": 24243}, {"id": 24260}, {"id": 24263}, {"id": 24271}, {"id": 24312}, {"id": 24324}, {"id": 24348}, {"id": 24379}, {"id": 24406}, {"id": 24432}, {"id": 24441}, {"id": 24447}, {"id": 24449}, {"id": 24454}, {"id": 24455}, {"id": 24497}, {"id": 24505}, {"id": 24524}, {"id": 24525}, {"id": 24534}, {"id": 24542}, {"id": 24591}, {"id": 24599}, {"id": 24638}, {"id": 24643}, {"id": 24649}, {"id": 24705}, {"id": 24722}, {"id": 24726}, {"id": 24731}, {"id": 24733}, {"id": 24740}, {"id": 24751}, {"id": 24752}, {"id": 24772}, {"id": 24779}, {"id": 24780}, {"id": 24781}, {"id": 24782}, {"id": 24791}, {"id": 24792}, {"id": 24796}, {"id": 24807}, {"id": 24818}, {"id": 24820}, {"id": 24828}, {"id": 24830}, {"id": 24842}, {"id": 24843}, {"id": 24859}, {"id": 24868}, {"id": 24910}, {"id": 24925}, {"id": 24943}, {"id": 24953}, {"id": 24964}, {"id": 24981}, {"id": 24988}, {"id": 24994}, {"id": 24995}, {"id": 24999}, {"id": 25000}, {"id": 25004}, {"id": 25017}, {"id": 25020}, {"id": 25027}, {"id": 25035}, {"id": 25038}, {"id": 25042}, {"id": 25047}, {"id": 25055}, {"id": 25062}, {"id": 25064}, {"id": 25077}, {"id": 25084}, {"id": 25085}, {"id": 25094}, {"id": 25100}, {"id": 25105}, {"id": 25127}, {"id": 25137}, {"id": 25138}, {"id": 25140}, {"id": 25142}, {"id": 25150}, {"id": 25161}, {"id": 25171}, {"id": 25175}, {"id": 25231}, {"id": 25233}, {"id": 25237}, {"id": 25239}, {"id": 25247}, {"id": 25282}, {"id": 25299}, {"id": 25323}, {"id": 25324}, {"id": 25343}, {"id": 25353}, {"id": 25357}, {"id": 25370}, {"id": 25385}, {"id": 25392}, {"id": 25427}, {"id": 25441}, {"id": 25443}, {"id": 25461}, {"id": 25462}, {"id": 25468}, {"id": 25472}, {"id": 25485}, {"id": 25489}, {"id": 25505}, {"id": 25510}, {"id": 25517}, {"id": 25520}, {"id": 25521}, {"id": 25539}, {"id": 25547}, {"id": 25567}, {"id": 25575}, {"id": 25597}, {"id": 25598}, {"id": 25611}, {"id": 25624}, {"id": 25651}, {"id": 25664}, {"id": 25673}, {"id": 25676}, {"id": 25682}, {"id": 25688}, {"id": 25700}, {"id": 25714}, {"id": 25717}, {"id": 25718}, {"id": 25721}, {"id": 25727}, {"id": 25730}, {"id": 25731}, {"id": 25734}, {"id": 25748}, {"id": 25751}, {"id": 25813}, {"id": 25826}, {"id": 25834}, {"id": 25835}, {"id": 25841}, {"id": 25862}, {"id": 25868}, {"id": 25871}, {"id": 25875}, {"id": 25880}, {"id": 25881}, {"id": 25883}, {"id": 25889}, {"id": 25895}, {"id": 25897}, {"id": 25903}, {"id": 25904}, {"id": 25905}, {"id": 25913}, {"id": 25919}, {"id": 25924}, {"id": 25946}, {"id": 25959}, {"id": 25969}, {"id": 25970}, {"id": 25971}, {"id": 25975}, {"id": 25976}, {"id": 25983}, {"id": 25990}, {"id": 26002}, {"id": 26003}, {"id": 26008}, {"id": 26014}, {"id": 26015}, {"id": 26049}, {"id": 26051}, {"id": 26059}, {"id": 26074}, {"id": 26078}, {"id": 26080}, {"id": 26113}, {"id": 26147}, {"id": 26150}, {"id": 26176}, {"id": 26207}, {"id": 26225}, {"id": 26226}, {"id": 26227}, {"id": 26259}, {"id": 26266}, {"id": 26267}, {"id": 26281}, {"id": 26283}, {"id": 26289}, {"id": 26291}, {"id": 26297}, {"id": 26302}, {"id": 26307}, {"id": 26308}, {"id": 26312}, {"id": 26337}, {"id": 26341}, {"id": 26352}, {"id": 26357}, {"id": 26361}, {"id": 26406}, {"id": 26421}, {"id": 26428}, {"id": 26437}, {"id": 26443}, {"id": 26445}, {"id": 26456}, {"id": 26457}, {"id": 26458}, {"id": 26459}, {"id": 26469}, {"id": 26472}, {"id": 26476}, {"id": 26483}, {"id": 26485}, {"id": 26505}, {"id": 26507}, {"id": 26512}, {"id": 26515}, {"id": 26520}, {"id": 26524}, {"id": 26526}, {"id": 26543}, {"id": 26544}, {"id": 26546}, {"id": 26550}, {"id": 26560}, {"id": 26574}, {"id": 26581}, {"id": 26582}, {"id": 26587}, {"id": 26597}, {"id": 26600}, {"id": 26601}, {"id": 26602}, {"id": 26616}, {"id": 26620}, {"id": 26623}, {"id": 26625}, {"id": 26637}, {"id": 26639}, {"id": 26644}, {"id": 26646}, {"id": 26647}, {"id": 26651}, {"id": 26654}, {"id": 26682}, {"id": 26685}, {"id": 26689}, {"id": 26695}, {"id": 26700}, {"id": 26702}, {"id": 26703}, {"id": 26710}, {"id": 26719}, {"id": 26722}, {"id": 26723}, {"id": 26731}, {"id": 26742}, {"id": 26745}, {"id": 26750}, {"id": 26756}, {"id": 26758}, {"id": 26766}, {"id": 26781}, {"id": 26784}, {"id": 26792}, {"id": 26794}, {"id": 26800}, {"id": 26809}, {"id": 26811}, {"id": 26819}, {"id": 26826}, {"id": 26827}, {"id": 26835}, {"id": 26840}, {"id": 26842}, {"id": 26848}, {"id": 26849}, {"id": 26854}, {"id": 26864}, {"id": 26865}, {"id": 26876}, {"id": 26878}, {"id": 26884}, {"id": 26919}, {"id": 26926}, {"id": 26939}, {"id": 26946}, {"id": 26955}, {"id": 26968}, {"id": 26969}, {"id": 26976}, {"id": 26980}, {"id": 26982}, {"id": 26984}, {"id": 26994}, {"id": 26997}, {"id": 27000}, {"id": 27015}, {"id": 27024}, {"id": 27032}, {"id": 27037}, {"id": 27042}, {"id": 27043}, {"id": 27054}, {"id": 27057}, {"id": 27067}, {"id": 27068}, {"id": 27082}, {"id": 27091}, {"id": 27101}, {"id": 27106}, {"id": 27113}, {"id": 27115}, {"id": 27116}, {"id": 27122}, {"id": 27126}, {"id": 27131}, {"id": 27143}, {"id": 27156}, {"id": 27157}, {"id": 27163}, {"id": 27164}, {"id": 27165}, {"id": 27167}, {"id": 27170}, {"id": 27186}, {"id": 27205}, {"id": 27213}, {"id": 27214}, {"id": 27226}, {"id": 27234}, {"id": 27263}, {"id": 27267}, {"id": 27268}, {"id": 27269}, {"id": 27276}, {"id": 27292}, {"id": 27297}, {"id": 27298}, {"id": 27307}, {"id": 27309}, {"id": 27311}, {"id": 27326}, {"id": 27327}, {"id": 27330}, {"id": 27333}, {"id": 27334}, {"id": 27335}, {"id": 27338}, {"id": 27339}, {"id": 27346}, {"id": 27360}, {"id": 27361}, {"id": 27362}, {"id": 27368}, {"id": 27373}, {"id": 27382}, {"id": 27427}, {"id": 27435}, {"id": 27456}, {"id": 27479}, {"id": 27481}, {"id": 27501}, {"id": 27516}, {"id": 27521}, {"id": 27526}, {"id": 27533}, {"id": 27534}, {"id": 27551}, {"id": 27558}, {"id": 27561}, {"id": 27569}, {"id": 27597}, {"id": 27603}, {"id": 27611}, {"id": 27612}, {"id": 27653}, {"id": 27655}, {"id": 27656}, {"id": 27660}, {"id": 27670}, {"id": 27688}, {"id": 27694}, {"id": 27703}, {"id": 27733}, {"id": 27736}, {"id": 27751}, {"id": 27778}, {"id": 27780}, {"id": 27792}, {"id": 27828}, {"id": 27833}, {"id": 27841}, {"id": 27853}, {"id": 27858}, {"id": 27877}, {"id": 27889}, {"id": 27906}, {"id": 27948}, {"id": 27950}, {"id": 27954}, {"id": 27961}, {"id": 27964}, {"id": 27975}, {"id": 28002}, {"id": 28006}, {"id": 28017}, {"id": 28023}, {"id": 28031}, {"id": 28041}, {"id": 28082}, {"id": 28123}]}, "entities": {"values": [{"id": 370905, "name": "1.3x/year between 1950\u20132010", "code": null}, {"id": 256993, "name": "Theseus", "code": null}, {"id": 305970, "name": "Self Organizing System", "code": null}, {"id": 257002, "name": "Perceptron Mark I", "code": null}, {"id": 354868, "name": "Pattern recognition and reading by machine", "code": null}, {"id": 369024, "name": "Perceptron (1960)", "code": null}, {"id": 256995, "name": "ADALINE", "code": null}, {"id": 369543, "name": "Linear Decision Functions", "code": null}, {"id": 305972, "name": "MADALINE I", "code": null}, {"id": 369517, "name": "LTE speaker verification system", "code": null}, {"id": 305976, "name": "GLEE", "code": null}, {"id": 369537, "name": "Piecewise linear model", "code": null}, {"id": 305980, "name": "Cognitron", "code": null}, {"id": 256996, "name": "Neocognitron", "code": null}, {"id": 305982, "name": "Kohonen network", "code": null}, {"id": 305984, "name": "ASE+ACE", "code": null}, {"id": 369510, "name": "Hierarchical Cognitron", "code": null}, {"id": 367563, "name": "Error Propagation", "code": null}, {"id": 369560, "name": "Distributed representation NN", "code": null}, {"id": 372328, "name": "MLP with back-propagation", "code": null}, {"id": 368075, "name": "NetTalk (dictionary)", "code": null}, {"id": 368083, "name": "NetTalk (transcription)", "code": null}, {"id": 371864, "name": "Translation-invariant MLP", "code": null}, {"id": 369973, "name": "MLN-ASR", "code": null}, {"id": 369527, "name": "MLP baggage detector", "code": null}, {"id": 305988, "name": "Q-learning", "code": null}, {"id": 372312, "name": "Handwritten digit recognition network", "code": null}, {"id": 369529, "name": "Speaker-independent vowel classification", "code": null}, {"id": 257006, "name": "Zip CNN", "code": null}, {"id": 371841, "name": "NETtalk reimplementation", "code": null}, {"id": 369990, "name": "Bankruptcy-NN", "code": null}, {"id": 369986, "name": "ISR network", "code": null}, {"id": 369968, "name": "SexNet classification", "code": null}, {"id": 369967, "name": "SexNet compression", "code": null}, {"id": 369526, "name": "RAAM", "code": null}, {"id": 369977, "name": "Weight Decay", "code": null}, {"id": 257007, "name": "TD-Gammon", "code": null}, {"id": 369984, "name": "Golem", "code": null}, {"id": 369525, "name": "Cancer drug mechanism prediction", "code": null}, {"id": 369995, "name": "Boosting", "code": null}, {"id": 305992, "name": "IBM-5", "code": null}, {"id": 369969, "name": "Siamese-TDNN", "code": null}, {"id": 369966, "name": "ANN Eye Tracker", "code": null}, {"id": 369972, "name": "Ceramic-MLP", "code": null}, {"id": 369964, "name": "JPMAX", "code": null}, {"id": 369991, "name": "Mixture of linear models", "code": null}, {"id": 369992, "name": "NeuroChess", "code": null}, {"id": 369996, "name": "Predictive Coding NN", "code": null}, {"id": 305994, "name": "Support Vector Machines", "code": null}, {"id": 369970, "name": "LISSOM", "code": null}, {"id": 369993, "name": "MUSIC perceptron", "code": null}, {"id": 256997, "name": "System 11", "code": null}, {"id": 370523, "name": "AdaBoost.M2 Digit Recognition", "code": null}, {"id": 369979, "name": "SOM-CNN", "code": null}, {"id": 367568, "name": "Bidirectional RNN", "code": null}, {"id": 256998, "name": "LSTM", "code": null}, {"id": 245542, "name": "LeNet-5", "code": null}, {"id": 354866, "name": "LSTM with forget gates", "code": null}, {"id": 371856, "name": "RECONTRA-categorized", "code": null}, {"id": 371849, "name": "RECONTRA-uncategorized", "code": null}, {"id": 305998, "name": "IBM Model 4", "code": null}, {"id": 369997, "name": "Neural LM", "code": null}, {"id": 369980, "name": "PoE MNIST", "code": null}, {"id": 367484, "name": "Gradient Boosting Machine", "code": null}, {"id": 257009, "name": "Decision tree (classification)", "code": null}, {"id": 306001, "name": "Thumbs Up?", "code": null}, {"id": 371812, "name": "NPLM (AP News)", "code": null}, {"id": 371816, "name": "NPLM (Brown)", "code": null}, {"id": 369971, "name": "Invariant CNN", "code": null}, {"id": 369988, "name": "LMICA", "code": null}, {"id": 369987, "name": "Hierarchical LM", "code": null}, {"id": 367524, "name": "Histograms of Oriented Gradients", "code": null}, {"id": 370522, "name": "RankNet", "code": null}, {"id": 368126, "name": "TFE SVM", "code": null}, {"id": 371873, "name": "SVM-CNN", "code": null}, {"id": 367496, "name": "Spatial Pyramid Matching", "code": null}, {"id": 306014, "name": "Deep Belief Nets", "code": null}, {"id": 370719, "name": "Dimensionality Reduction", "code": null}, {"id": 367564, "name": "Local Binary Patterns for facial recognition", "code": null}, {"id": 367555, "name": "Greedy layer-wise DNN training", "code": null}, {"id": 369975, "name": "KN-LM", "code": null}, {"id": 369976, "name": "SB-LM", "code": null}, {"id": 367301, "name": "BLSTM for handwriting (1)", "code": null}, {"id": 368131, "name": "Enhanced Neighborhood-Based Filtering", "code": null}, {"id": 368136, "name": "BLSTM for handwriting (2)", "code": null}, {"id": 367308, "name": "Deep Multitask NLP Network", "code": null}, {"id": 306020, "name": "Denoising Autoencoders", "code": null}, {"id": 369564, "name": "HLBL", "code": null}, {"id": 371852, "name": "GNN", "code": null}, {"id": 368330, "name": "RBM Image Classifier", "code": null}, {"id": 257011, "name": "GPU DBNs", "code": null}, {"id": 367462, "name": "MatrixFac for Recommenders", "code": null}, {"id": 371877, "name": "Two Stage Feature Extraction (MNIST)", "code": null}, {"id": 371881, "name": "LCNP LabelMe", "code": null}, {"id": 371878, "name": "LCNP MNIST", "code": null}, {"id": 371874, "name": "LCNP NORB", "code": null}, {"id": 371887, "name": "2.8x/year between 2010\u20132025", "code": null}, {"id": 367469, "name": "Stacked Denoising Autoencoders", "code": null}, {"id": 257013, "name": "Feedforward NN", "code": null}, {"id": 371880, "name": "iCCCP", "code": null}, {"id": 306031, "name": "ReLU (LFW)", "code": null}, {"id": 306030, "name": "ReLU (NORB)", "code": null}, {"id": 371871, "name": "Pooling CNN (Caltech 101)", "code": null}, {"id": 371872, "name": "Pooling CNN (NORB)", "code": null}, {"id": 369528, "name": "RNN LM", "code": null}, {"id": 367483, "name": "Deep rectifier networks", "code": null}, {"id": 370720, "name": "Deep Autoencoders", "code": null}, {"id": 369512, "name": "Vector Space Model", "code": null}, {"id": 369553, "name": "Recursive Neural Network", "code": null}, {"id": 371885, "name": "High Performance CNN (NORB)", "code": null}, {"id": 371842, "name": "CNN Committee (MNIST)", "code": null}, {"id": 371862, "name": "CNN Committee (NIST)", "code": null}, {"id": 367475, "name": "Adaptive Subgrad", "code": null}, {"id": 371863, "name": "CNN committee (traffic sign)", "code": null}, {"id": 306033, "name": "NLP from scratch", "code": null}, {"id": 306036, "name": "Dropout (CIFAR)", "code": null}, {"id": 306037, "name": "Dropout (ImageNet)", "code": null}, {"id": 257017, "name": "Dropout (MNIST)", "code": null}, {"id": 366988, "name": "Unsupervised High-level Feature Learner", "code": null}, {"id": 367307, "name": "Context-dependent RNN", "code": null}, {"id": 369534, "name": "LSTM LM", "code": null}, {"id": 240132, "name": "AlexNet", "code": null}, {"id": 368007, "name": "RNN+LDA+KN5+cache", "code": null}, {"id": 367506, "name": "Bayesian automated hyperparameter tuning", "code": null}, {"id": 371854, "name": "DNN EM segmentation", "code": null}, {"id": 369538, "name": "DistBelief Speech", "code": null}, {"id": 369541, "name": "DistBelief Vision", "code": null}, {"id": 369539, "name": "DistBelief NNLM", "code": null}, {"id": 369989, "name": "Multilingual DNN", "code": null}, {"id": 371980, "name": "Hierarchical Scene Labeling (Stanford Background)", "code": null}, {"id": 369513, "name": "RCTM", "code": null}, {"id": 369535, "name": "RNTN", "code": null}, {"id": 257018, "name": "Word2Vec (large)", "code": null}, {"id": 306040, "name": "Word2Vec (small)", "code": null}, {"id": 257019, "name": "Visualizing CNNs", "code": null}, {"id": 369963, "name": "DeViSE", "code": null}, {"id": 257106, "name": "TransE", "code": null}, {"id": 369344, "name": "RNN for 1B words", "code": null}, {"id": 306043, "name": "Network in Network", "code": null}, {"id": 240135, "name": "DQN", "code": null}, {"id": 306044, "name": "Image generation", "code": null}, {"id": 306049, "name": "GloVe (32B)", "code": null}, {"id": 306048, "name": "GloVe (6B)", "code": null}, {"id": 306051, "name": "HyperNEAT", "code": null}, {"id": 369565, "name": "Paragraph Vector", "code": null}, {"id": 369562, "name": "AdaRNN", "code": null}, {"id": 369556, "name": "Dropout: SVHN", "code": null}, {"id": 367502, "name": "Two-stream ConvNets for action recognition", "code": null}, {"id": 257021, "name": "GANs", "code": null}, {"id": 257097, "name": "SPPNet", "code": null}, {"id": 369994, "name": "Fragment embedding", "code": null}, {"id": 367495, "name": "DeepFace", "code": null}, {"id": 306047, "name": "Multiresolution CNN", "code": null}, {"id": 371825, "name": "ACF-WIDER", "code": null}, {"id": 371884, "name": "NPD", "code": null}, {"id": 257022, "name": "RNNsearch-50*", "code": null}, {"id": 257023, "name": "VGG16", "code": null}, {"id": 306053, "name": "VGG19", "code": null}, {"id": 307046, "name": "Seq2Seq LSTM", "code": null}, {"id": 368042, "name": "SPN-4+KN5", "code": null}, {"id": 257026, "name": "GoogLeNet / InceptionV1", "code": null}, {"id": 367494, "name": "Deeply-supervised nets", "code": null}, {"id": 368335, "name": "Spatially-Sparse CNN", "code": null}, {"id": 306054, "name": "LRCN", "code": null}, {"id": 369557, "name": "SC-NLM", "code": null}, {"id": 367488, "name": "Cascaded LNet-ANet", "code": null}, {"id": 371867, "name": "TA-CNN", "code": null}, {"id": 370721, "name": "SNM-skip", "code": null}, {"id": 368346, "name": "Fractional Max-Pooling", "code": null}, {"id": 257024, "name": "ADAM (CIFAR-10)", "code": null}, {"id": 371883, "name": "VGG-Face", "code": null}, {"id": 257025, "name": "MSRA (C, PReLU)", "code": null}, {"id": 306056, "name": "DQN-2015", "code": null}, {"id": 368084, "name": "genCNN + dyn eval", "code": null}, {"id": 371810, "name": "TC-DNN-BLSTM-DNN", "code": null}, {"id": 306058, "name": "Fast R-CNN", "code": null}, {"id": 371876, "name": "U-Net", "code": null}, {"id": 306060, "name": "Faster R-CNN", "code": null}, {"id": 371875, "name": "CFSS", "code": null}, {"id": 306062, "name": "BatchNorm", "code": null}, {"id": 371870, "name": "Deep CNN + COTS", "code": null}, {"id": 371859, "name": "DCNN", "code": null}, {"id": 306063, "name": "BPE", "code": null}, {"id": 257027, "name": "AlphaGo Fan", "code": null}, {"id": 371882, "name": "SAF R-CNN", "code": null}, {"id": 371824, "name": "3DDFA", "code": null}, {"id": 306064, "name": "Inception v3", "code": null}, {"id": 54159, "name": "SSD", "code": null}, {"id": 371899, "name": "ResNet-101 (ImageNet)", "code": null}, {"id": 306065, "name": "ResNet-110 (CIFAR-10)", "code": null}, {"id": 257028, "name": "ResNet-152 (ImageNet)", "code": null}, {"id": 306067, "name": "Advantage Learning", "code": null}, {"id": 368031, "name": "Variational (untied weights, MC) LSTM (Large)", "code": null}, {"id": 257029, "name": "AlphaGo Lee", "code": null}, {"id": 306068, "name": "A3C FF hs", "code": null}, {"id": 306070, "name": "Inception-ResNet-V2", "code": null}, {"id": 306069, "name": "Inceptionv4", "code": null}, {"id": 306071, "name": "SqueezeNet", "code": null}, {"id": 257109, "name": "Named Entity Recognition model", "code": null}, {"id": 371903, "name": "Template Adaptation\n", "code": null}, {"id": 368134, "name": "Gated HORNN (3rd order)", "code": null}, {"id": 371902, "name": "LRR-4X", "code": null}, {"id": 371518, "name": "PixelCNN", "code": null}, {"id": 257105, "name": "R-FCN", "code": null}, {"id": 371844, "name": "CCL", "code": null}, {"id": 368352, "name": "SimpleNet", "code": null}, {"id": 371897, "name": "LF-MMI", "code": null}, {"id": 371904, "name": "MS-ensemble-speech-recognition", "code": null}, {"id": 367554, "name": "WaveNet", "code": null}, {"id": 367493, "name": "ResNet-1001", "code": null}, {"id": 366658, "name": "ResNet-200", "code": null}, {"id": 306076, "name": "Wide Residual Network", "code": null}, {"id": 257030, "name": "GNMT", "code": null}, {"id": 368076, "name": "Pointer Sentinel-LSTM (medium)", "code": null}, {"id": 240142, "name": "Xception", "code": null}, {"id": 371896, "name": "GAWWN", "code": null}, {"id": 369001, "name": "SPIDER2", "code": null}, {"id": 368057, "name": "VD-LSTM+REAL Large", "code": null}, {"id": 369179, "name": "BIDAF", "code": null}, {"id": 370527, "name": "NAS with base 8 and shared embeddings", "code": null}, {"id": 257031, "name": "NASv3 (CIFAR-10)", "code": null}, {"id": 371895, "name": "DLDL (PASCAL)", "code": null}, {"id": 371851, "name": "DTN (Domain Transfer Network)", "code": null}, {"id": 371858, "name": "DAC-CSR", "code": null}, {"id": 372332, "name": "ResNeXt-101 (64\u00d74d)", "code": null}, {"id": 306077, "name": "ResNeXt-50", "code": null}, {"id": 306078, "name": "PolyNet", "code": null}, {"id": 367501, "name": "Image-to-image cGAN", "code": null}, {"id": 306080, "name": "PointNet", "code": null}, {"id": 371890, "name": "HR-ResNet101", "code": null}, {"id": 371809, "name": "3DMM-CNN", "code": null}, {"id": 371866, "name": "EnhanceNet", "code": null}, {"id": 306081, "name": "YOLOv2", "code": null}, {"id": 257034, "name": "DeepStack", "code": null}, {"id": 368323, "name": "OR-WideResNet", "code": null}, {"id": 370178, "name": "MoE-Multi", "code": null}, {"id": 367503, "name": "DnCNN", "code": null}, {"id": 367536, "name": "Prototypical networks", "code": null}, {"id": 306082, "name": "Mask R-CNN", "code": null}, {"id": 306083, "name": "MobileNet", "code": null}, {"id": 367492, "name": "DeepLab (2017)", "code": null}, {"id": 369153, "name": "Mnemonic Reader", "code": null}, {"id": 367562, "name": "SRGAN", "code": null}, {"id": 367520, "name": "Inflated 3D ConvNet", "code": null}, {"id": 306084, "name": "PointNet++", "code": null}, {"id": 369152, "name": "Reading Twice for NLU", "code": null}, {"id": 371241, "name": "Transformer (2017)", "code": null}, {"id": 306085, "name": "HRA", "code": null}, {"id": 306086, "name": "DeepLabV3", "code": null}, {"id": 306087, "name": "NoisyNet-Dueling", "code": null}, {"id": 306088, "name": "ShuffleNet v1", "code": null}, {"id": 257037, "name": "JFT", "code": null}, {"id": 368032, "name": "AWD-LSTM", "code": null}, {"id": 306089, "name": "NASNet-A", "code": null}, {"id": 369558, "name": "ConvS2S (ensemble of 8 models)", "code": null}, {"id": 369182, "name": "GSM", "code": null}, {"id": 368011, "name": "AWD-LSTM - 3-layer LSTM (tied) + continuous cache pointer (WT2)", "code": null}, {"id": 308274, "name": "RetinaNet-R101", "code": null}, {"id": 306090, "name": "RetinaNet-R50", "code": null}, {"id": 368056, "name": "EI-REHN-1000D", "code": null}, {"id": 306092, "name": "NeuMF (Pinterest)", "code": null}, {"id": 368086, "name": "GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)", "code": null}, {"id": 306093, "name": "SENet (ImageNet)", "code": null}, {"id": 368342, "name": "PyramidNet", "code": null}, {"id": 368023, "name": "ISS", "code": null}, {"id": 368048, "name": "LSTM + dynamic eval", "code": null}, {"id": 368044, "name": "AWD-LSTM+WT+Cache+IOG (WT2)", "code": null}, {"id": 257039, "name": "AlphaGo Zero", "code": null}, {"id": 369175, "name": "PhraseCond", "code": null}, {"id": 368720, "name": "S-Norm", "code": null}, {"id": 369187, "name": "DCN+", "code": null}, {"id": 368045, "name": "Fraternal dropout + AWD-LSTM 3-layer (WT2)", "code": null}, {"id": 371823, "name": "VQ-VAE", "code": null}, {"id": 368041, "name": "AWD-LSTM-MoS + dynamic evaluation (WT2, 2017)", "code": null}, {"id": 367482, "name": "TriNet", "code": null}, {"id": 371900, "name": "DL scaling LM", "code": null}, {"id": 371893, "name": "DL scaling speech", "code": null}, {"id": 240145, "name": "AlphaZero", "code": null}, {"id": 372319, "name": "T-DMCA", "code": null}, {"id": 306099, "name": "ELMo", "code": null}, {"id": 368074, "name": "QRNN", "code": null}, {"id": 257040, "name": "IMPALA", "code": null}, {"id": 306101, "name": "DeepLabV3+", "code": null}, {"id": 368357, "name": "TCN (P-MNIST)", "code": null}, {"id": 368102, "name": "4 layer QRNN (h=2500)", "code": null}, {"id": 257041, "name": "YOLOv3", "code": null}, {"id": 306104, "name": "ResNeXt-101 32x48d", "code": null}, {"id": 368004, "name": "Dropout-LSTM+Noise(Bernoulli) (WT2)", "code": null}, {"id": 368064, "name": "aLSTM(depth-2)+RecurrentPolicy (WT2)", "code": null}, {"id": 370176, "name": "GPT-1", "code": null}, {"id": 368025, "name": "Relational Memory Core", "code": null}, {"id": 306105, "name": "MobileNetV2", "code": null}, {"id": 371983, "name": "FTW (For The Win)", "code": null}, {"id": 368035, "name": "Big-Little Net", "code": null}, {"id": 369018, "name": "Big-Little Net (speech)", "code": null}, {"id": 368066, "name": "AWD-LSTM-MoS+PDR + dynamic evaluation (WT2)", "code": null}, {"id": 369348, "name": "Big Transformer for Back-Translation", "code": null}, {"id": 368101, "name": "(ensemble): AWD-LSTM-DOC (fin) \u00d7 5 (WT2)", "code": null}, {"id": 369007, "name": "Transformer + Simple Recurrent Unit", "code": null}, {"id": 368059, "name": "AWD-LSTM-MoS + dynamic evaluation (WT2, 2018)", "code": null}, {"id": 368009, "name": "LSTM+NeuralCache", "code": null}, {"id": 369555, "name": "Transformer (Adaptive Input Embeddings) WT103", "code": null}, {"id": 257045, "name": "BERT-Large", "code": null}, {"id": 368087, "name": "TrellisNet", "code": null}, {"id": 368734, "name": "MemoReader", "code": null}, {"id": 369326, "name": "Mesh-TensorFlow Transformer 2.9B (translation)", "code": null}, {"id": 370525, "name": "Mesh-TensorFlow Transformer 4.9B (language)", "code": null}, {"id": 370722, "name": "Fine-tuned-AWD-LSTM-DOC (fin)", "code": null}, {"id": 368028, "name": "Multi-cell LSTM", "code": null}, {"id": 306110, "name": "GPipe (Transformer)", "code": null}, {"id": 371826, "name": "SPN (ImageNet 128)", "code": null}, {"id": 371889, "name": "StyleGAN", "code": null}, {"id": 306111, "name": "Transformer ELMo", "code": null}, {"id": 369511, "name": "Transformer-XL (257M)", "code": null}, {"id": 306112, "name": "MT-DNN", "code": null}, {"id": 257046, "name": "Hanabi 4 player", "code": null}, {"id": 369043, "name": "GPT-2 (1.5B)", "code": null}, {"id": 366991, "name": "KataGo", "code": null}, {"id": 369176, "name": "SciBERT", "code": null}, {"id": 368073, "name": "True-Regularization+Finetune+Dynamic-Eval", "code": null}, {"id": 368351, "name": "WeNet (Penn Treebank)", "code": null}, {"id": 368052, "name": "Transformer-XL + RMS dynamic eval", "code": null}, {"id": 368077, "name": "BERT-Large-CAS (PTB+WT2+WT103)", "code": null}, {"id": 369178, "name": "Neuro-Symbolic Concept Learner", "code": null}, {"id": 306114, "name": "ResNeXt-101 Billion-scale", "code": null}, {"id": 368063, "name": "AWD-LSTM-DRILL + dynamic evaluation\u2020 (WT2)", "code": null}, {"id": 306116, "name": "EfficientNet-L2", "code": null}, {"id": 257051, "name": "DLRM-2020", "code": null}, {"id": 306120, "name": "XLNet", "code": null}, {"id": 368013, "name": "Transformer-XL Large + Phrase Induction", "code": null}, {"id": 368026, "name": "AWD-LSTM + MoS + Partial Shuffled", "code": null}, {"id": 368022, "name": "Char-CNN-BiLSTM", "code": null}, {"id": 306122, "name": "FixRes ResNeXt-101 WSL", "code": null}, {"id": 368358, "name": "LaNet-L (CIFAR-10)", "code": null}, {"id": 365388, "name": "RoBERTa Large", "code": null}, {"id": 306124, "name": "BigBiGAN", "code": null}, {"id": 368081, "name": "Mogrifier (d2, MoS2, MC) + dynamic eval", "code": null}, {"id": 369023, "name": "UDSMProt", "code": null}, {"id": 257055, "name": "Megatron-BERT", "code": null}, {"id": 371869, "name": "Megatron-LM (1.2B)", "code": null}, {"id": 368027, "name": "Megatron-LM (8.3B)", "code": null}, {"id": 368091, "name": "Adaptive Inputs + LayerDrop", "code": null}, {"id": 306125, "name": "ALBERT", "code": null}, {"id": 257056, "name": "AlphaX-1", "code": null}, {"id": 306126, "name": "DistilBERT", "code": null}, {"id": 257059, "name": "T5-11B", "code": null}, {"id": 257058, "name": "T5-3B", "code": null}, {"id": 306127, "name": "BART-large", "code": null}, {"id": 368067, "name": "Base LM + kNN LM + Continuous Cache", "code": null}, {"id": 369168, "name": "XLM-RoBERTa", "code": null}, {"id": 369328, "name": "CamemBERT", "code": null}, {"id": 368005, "name": "Sandwich Transformer", "code": null}, {"id": 306128, "name": "Noisy Student (L2)", "code": null}, {"id": 306129, "name": "MoCo", "code": null}, {"id": 306130, "name": "MuZero", "code": null}, {"id": 368333, "name": "Transformer - LibriVox + Decoding/Rescoring", "code": null}, {"id": 368015, "name": "Photo-Geometric Autoencoder", "code": null}, {"id": 368010, "name": "Transformer-XL DeFINE (141M)", "code": null}, {"id": 371898, "name": "StyleGAN2", "code": null}, {"id": 306131, "name": "StarGAN v2", "code": null}, {"id": 371947, "name": "MMLSTM (PTB)", "code": null}, {"id": 371951, "name": "MMLSTM (WT-2)", "code": null}, {"id": 257061, "name": "OpenAI Five", "code": null}, {"id": 257062, "name": "OpenAI Five Rerun", "code": null}, {"id": 368324, "name": "DD-PPO", "code": null}, {"id": 306132, "name": "Big Transfer (BiT-L)", "code": null}, {"id": 257063, "name": "AlphaFold", "code": null}, {"id": 257064, "name": "Meena", "code": null}, {"id": 306134, "name": "Theseus 6/768", "code": null}, {"id": 369514, "name": "Perceiver IO (optical flow)", "code": null}, {"id": 368046, "name": "TaLK Convolution", "code": null}, {"id": 257107, "name": "ALBERT-xxlarge", "code": null}, {"id": 306135, "name": "SimCLR", "code": null}, {"id": 368060, "name": "Turing-NLG", "code": null}, {"id": 372320, "name": "FFN SwiGLU", "code": null}, {"id": 368047, "name": "Feedback Transformer", "code": null}, {"id": 370531, "name": "TCAN (WT2)", "code": null}, {"id": 368106, "name": "TransformerXL + spectrum control", "code": null}, {"id": 370529, "name": "Routing Transformer (WT-103)", "code": null}, {"id": 368107, "name": "Tensor-Transformer(1core)+PN (WT103)", "code": null}, {"id": 306136, "name": "ELECTRA", "code": null}, {"id": 306137, "name": "MetNet", "code": null}, {"id": 306141, "name": "Go-explore", "code": null}, {"id": 257068, "name": "Once for All", "code": null}, {"id": 368375, "name": "ContextNet", "code": null}, {"id": 368733, "name": "Retrieval-Augmented Generator", "code": null}, {"id": 368341, "name": "DETR", "code": null}, {"id": 354864, "name": "GPT-3 175B (davinci)", "code": null}, {"id": 257072, "name": "GShard (dense)", "code": null}, {"id": 370248, "name": "DeLighT", "code": null}, {"id": 306133, "name": "ERNIE-GEN (large)", "code": null}, {"id": 368338, "name": "ProBERTa", "code": null}, {"id": 369195, "name": "LUKE", "code": null}, {"id": 368329, "name": "mT5-XXL", "code": null}, {"id": 369146, "name": "German ELECTRA Large", "code": null}, {"id": 306145, "name": "ViT-Base/32", "code": null}, {"id": 306146, "name": "ViT-Huge/14", "code": null}, {"id": 257073, "name": "wave2vec 2.0 LARGE", "code": null}, {"id": 268376, "name": "KEPLER", "code": null}, {"id": 368138, "name": "AlphaFold 2", "code": null}, {"id": 257074, "name": "CPM-Large", "code": null}, {"id": 369335, "name": "ESM1b", "code": null}, {"id": 306149, "name": "VQGAN + CLIP", "code": null}, {"id": 369162, "name": "DensePhrases", "code": null}, {"id": 368078, "name": "CT-MoS (WT2)", "code": null}, {"id": 368012, "name": "ERNIE-Doc (247M)", "code": null}, {"id": 306154, "name": "CLIP (ResNet-50)", "code": null}, {"id": 257076, "name": "CLIP (ViT L/14@336px)", "code": null}, {"id": 257077, "name": "DALL-E", "code": null}, {"id": 306151, "name": "BigSSL", "code": null}, {"id": 257078, "name": "Switch", "code": null}, {"id": 366051, "name": "DeiT-B", "code": null}, {"id": 368088, "name": "top-down frozen classifier", "code": null}, {"id": 368753, "name": "MSA Transformer", "code": null}, {"id": 368098, "name": "SRU++ Large", "code": null}, {"id": 257104, "name": "Meta Pseudo Labels", "code": null}, {"id": 307047, "name": "Generative BST", "code": null}, {"id": 306163, "name": "M6-T", "code": null}, {"id": 355353, "name": "PLUG", "code": null}, {"id": 369015, "name": "ProtBERT-BFD", "code": null}, {"id": 371533, "name": "ProtT5-XL-U50", "code": null}, {"id": 368328, "name": "ADM", "code": null}, {"id": 369184, "name": "MedBERT", "code": null}, {"id": 257084, "name": "CogView", "code": null}, {"id": 257098, "name": "Transformer local-attention (NesT-B)", "code": null}, {"id": 368326, "name": "ByT5-XXL", "code": null}, {"id": 257085, "name": "ViT-G/14", "code": null}, {"id": 368362, "name": "CoAtNet", "code": null}, {"id": 368718, "name": "EMDR", "code": null}, {"id": 306150, "name": "DeBERTa", "code": null}, {"id": 257103, "name": "ALIGN", "code": null}, {"id": 306166, "name": "Denoising Diffusion Probabilistic Models (LSUN Bedroom)", "code": null}, {"id": 371868, "name": "StyleGAN3-R", "code": null}, {"id": 371879, "name": "StyleGAN3-T", "code": null}, {"id": 369017, "name": "Fold2Seq", "code": null}, {"id": 368080, "name": "Adaptive Input Transformer + RD", "code": null}, {"id": 257087, "name": "ERNIE 3.0", "code": null}, {"id": 306167, "name": "Codex", "code": null}, {"id": 273165, "name": "GOAT", "code": null}, {"id": 257088, "name": "HuBERT", "code": null}, {"id": 257089, "name": "SEER", "code": null}, {"id": 368355, "name": "6-Act Tether", "code": null}, {"id": 369177, "name": "YOLOX-X", "code": null}, {"id": 257090, "name": "Jurassic-1-Jumbo", "code": null}, {"id": 368748, "name": "DNABERT", "code": null}, {"id": 306169, "name": "XLMR-XXL", "code": null}, {"id": 368372, "name": "FLAN 137B", "code": null}, {"id": 306170, "name": "MEB", "code": null}, {"id": 368104, "name": "PermuteFormer", "code": null}, {"id": 369563, "name": "HyperCLOVA 204B", "code": null}, {"id": 366990, "name": "PLATO-XL", "code": null}, {"id": 368719, "name": "AlphaFold-Multimer", "code": null}, {"id": 257092, "name": "Megatron-Turing NLG 530B", "code": null}, {"id": 257093, "name": "Yuan 1.0", "code": null}, {"id": 368036, "name": "base LM+GNN+kNN", "code": null}, {"id": 369012, "name": "Eve", "code": null}, {"id": 306173, "name": "EfficientZero", "code": null}, {"id": 368092, "name": "S4", "code": null}, {"id": 368339, "name": "Projected GAN", "code": null}, {"id": 368334, "name": "ViT-G/14 (LiT)", "code": null}, {"id": 368364, "name": "BASIC-L", "code": null}, {"id": 35176, "name": "Florence", "code": null}, {"id": 306174, "name": "N\u00dcWA", "code": null}, {"id": 369536, "name": "Student of Games", "code": null}, {"id": 367255, "name": "Gopher (280B)", "code": null}, {"id": 368065, "name": "GLaM", "code": null}, {"id": 369203, "name": "LongT5", "code": null}, {"id": 368725, "name": "Contriever", "code": null}, {"id": 368722, "name": "LDM-1.45B", "code": null}, {"id": 369192, "name": "XGLM-7.5B", "code": null}, {"id": 368055, "name": "ERNIE 3.0 Titan", "code": null}, {"id": 306179, "name": "data2vec (language)", "code": null}, {"id": 306178, "name": "data2vec (speech)", "code": null}, {"id": 306177, "name": "data2vec (vision)", "code": null}, {"id": 369013, "name": "OntoProtein", "code": null}, {"id": 371237, "name": "InstructGPT 175B", "code": null}, {"id": 306182, "name": "AlphaCode", "code": null}, {"id": 306180, "name": "RETRO-7B", "code": null}, {"id": 257096, "name": "GPT-NeoX-20B", "code": null}, {"id": 257095, "name": "LaMDA", "code": null}, {"id": 368350, "name": "ProteinBERT", "code": null}, {"id": 368348, "name": "ST-MoE", "code": null}, {"id": 368367, "name": "PolyCoder", "code": null}, {"id": 306184, "name": "DeepNet", "code": null}, {"id": 306183, "name": "Statement Curriculum Learning", "code": null}, {"id": 368361, "name": "ViT-G (model soup)", "code": null}, {"id": 368108, "name": "Segatron-XL large, M=384 + HCP", "code": null}, {"id": 371963, "name": "Make-A-Scene", "code": null}, {"id": 273166, "name": "Chinchilla", "code": null}, {"id": 273167, "name": "PaLM (540B)", "code": null}, {"id": 306186, "name": "DALL\u00b7E 2", "code": null}, {"id": 268378, "name": "Sparse all-MLP", "code": null}, {"id": 306187, "name": "Flamingo", "code": null}, {"id": 306188, "name": "OPT-175B", "code": null}, {"id": 306190, "name": "UL2", "code": null}, {"id": 306191, "name": "Gato", "code": null}, {"id": 368050, "name": "SimCSE", "code": null}, {"id": 371857, "name": "GPT-2 Medium (FlashAttention)", "code": null}, {"id": 368110, "name": "Tranception", "code": null}, {"id": 366986, "name": "CogVideo", "code": null}, {"id": 368096, "name": "DITTO", "code": null}, {"id": 306193, "name": "MetaLM", "code": null}, {"id": 368373, "name": "CoCa", "code": null}, {"id": 306194, "name": "Parti", "code": null}, {"id": 369037, "name": "ProGen2-xlarge", "code": null}, {"id": 306195, "name": "Minerva (540B)", "code": null}, {"id": 368133, "name": "CodeT5-large", "code": null}, {"id": 306196, "name": "NLLB", "code": null}, {"id": 368746, "name": "BLOOM-176B", "code": null}, {"id": 369022, "name": "ESM2-15B", "code": null}, {"id": 369033, "name": "OmegaPLM", "code": null}, {"id": 354863, "name": "AlexaTM 20B", "code": null}, {"id": 365992, "name": "GLM-130B", "code": null}, {"id": 368716, "name": "BlenderBot 3", "code": null}, {"id": 368374, "name": "PaLI", "code": null}, {"id": 349174, "name": "Whisper", "code": null}, {"id": 369185, "name": "DiffDock", "code": null}, {"id": 369025, "name": "GenSLM", "code": null}, {"id": 369171, "name": "Flan-PaLM 540B", "code": null}, {"id": 369006, "name": "LMSI-Palm", "code": null}, {"id": 368739, "name": "U-PaLM (540B)", "code": null}, {"id": 369151, "name": "eDiff-I", "code": null}, {"id": 368006, "name": "Mogrifier RLSTM (WT2)", "code": null}, {"id": 369155, "name": "mT0-13B", "code": null}, {"id": 369198, "name": "InternImage", "code": null}, {"id": 369545, "name": "EVA-01", "code": null}, {"id": 368109, "name": "Galactica", "code": null}, {"id": 368735, "name": "Fusion in Encoder", "code": null}, {"id": 367574, "name": "ALM 1.0", "code": null}, {"id": 368360, "name": "DiT-XL/2 + Discriminator Guidance", "code": null}, {"id": 368749, "name": "Discriminator Guidance", "code": null}, {"id": 369000, "name": "DeepNash", "code": null}, {"id": 371482, "name": "Vega v2", "code": null}, {"id": 368726, "name": "CaLM", "code": null}, {"id": 368002, "name": "Hybrid H3-2.7B", "code": null}, {"id": 369034, "name": "VALL-E", "code": null}, {"id": 371511, "name": "DreamerV3", "code": null}, {"id": 369009, "name": "Nucleotide Transformer", "code": null}, {"id": 368732, "name": "Ankh_large", "code": null}, {"id": 368354, "name": "DDPM-IP (CelebA)", "code": null}, {"id": 369196, "name": "BLIP-2 (Q-Former)", "code": null}, {"id": 369036, "name": "ProteinDT", "code": null}, {"id": 369193, "name": "ViT-22B", "code": null}, {"id": 367499, "name": "LLaMA-65B", "code": null}, {"id": 369016, "name": "AudioGen", "code": null}, {"id": 367636, "name": "Falcon-40B", "code": null}, {"id": 372333, "name": "GPT-4 (Jun 2023)", "code": null}, {"id": 372308, "name": "GPT-4 (Mar 2023)", "code": null}, {"id": 368999, "name": "LEP-AD", "code": null}, {"id": 368069, "name": "PanGu-\u03a3", "code": null}, {"id": 371534, "name": "SigLIP 400M", "code": null}, {"id": 369142, "name": "VideoMAE V2", "code": null}, {"id": 367081, "name": "BloombergGPT", "code": null}, {"id": 369040, "name": "Segment Anything Model", "code": null}, {"id": 368359, "name": "Incoder-6.7B", "code": null}, {"id": 369173, "name": "DINOv2", "code": null}, {"id": 368724, "name": "Agile Soccer Robot", "code": null}, {"id": 368130, "name": "StarCoder", "code": null}, {"id": 365387, "name": "PaLM 2", "code": null}, {"id": 371486, "name": "Med-PaLM 2", "code": null}, {"id": 369165, "name": "CoEdiT-xxl", "code": null}, {"id": 366449, "name": "ONE-PEACE", "code": null}, {"id": 368125, "name": "CodeT5+", "code": null}, {"id": 368721, "name": "Goat-7B", "code": null}, {"id": 368037, "name": "MusicGen", "code": null}, {"id": 369030, "name": "HyenaDNA", "code": null}, {"id": 369204, "name": "Pangu-Weather", "code": null}, {"id": 367509, "name": "InternLM", "code": null}, {"id": 369010, "name": "xTrimoPGLM -100B", "code": null}, {"id": 368743, "name": "Llama 2-70B", "code": null}, {"id": 368747, "name": "Llama 2-7B", "code": null}, {"id": 369002, "name": "AudioLM", "code": null}, {"id": 369167, "name": "Qwen-VL", "code": null}, {"id": 369042, "name": "PeptideBERT", "code": null}, {"id": 367533, "name": "Jais", "code": null}, {"id": 367528, "name": "Swift", "code": null}, {"id": 369508, "name": "Falcon-180B", "code": null}, {"id": 368737, "name": "AlphaMissense", "code": null}, {"id": 368030, "name": "Show-1", "code": null}, {"id": 370131, "name": "Amazon Titan", "code": null}, {"id": 368729, "name": "FinGPT-13B", "code": null}, {"id": 371966, "name": "RoseTTAFold All-Atom (RFAA)", "code": null}, {"id": 368043, "name": "Ferret (13B)", "code": null}, {"id": 368129, "name": "CODEFUSION (Python)", "code": null}, {"id": 369998, "name": "ChatGLM3-6B", "code": null}, {"id": 368349, "name": "Skywork-13B", "code": null}, {"id": 368740, "name": "Yi-34B", "code": null}, {"id": 368337, "name": "BLUUMI", "code": null}, {"id": 368376, "name": "Grok-1", "code": null}, {"id": 369138, "name": "mPLUG-Owl2", "code": null}, {"id": 372317, "name": "RoFormer", "code": null}, {"id": 369019, "name": "Nemotron-3-8B", "code": null}, {"id": 370977, "name": "GNoME for crystal discovery", "code": null}, {"id": 368736, "name": "Qwen-72B", "code": null}, {"id": 369011, "name": "Mamba-24M (SC09)", "code": null}, {"id": 369174, "name": "Llama Guard", "code": null}, {"id": 371814, "name": "VILA-13B", "code": null}, {"id": 371483, "name": "nekomata-14b", "code": null}, {"id": 371236, "name": "Qwen1.5-72B", "code": null}, {"id": 369170, "name": "Aya", "code": null}, {"id": 371244, "name": "Aramco Metabrain AI", "code": null}, {"id": 369883, "name": "DBRX", "code": null}, {"id": 369542, "name": "ReALM", "code": null}, {"id": 369516, "name": "Llama 3-70B", "code": null}, {"id": 371528, "name": "VILA1.5-13B", "code": null}, {"id": 371962, "name": "AlphaFold 3", "code": null}, {"id": 370141, "name": "Yi-Large", "code": null}, {"id": 370087, "name": "GLM-4 (0520)", "code": null}, {"id": 371545, "name": "ALLaM\u00a0adapted 70B", "code": null}, {"id": 370106, "name": "Qwen2-72B", "code": null}, {"id": 369982, "name": "Nemotron-4 340B", "code": null}, {"id": 370239, "name": "DeepSeek-Coder-V2 236B", "code": null}, {"id": 369981, "name": "ESM3 (98B)", "code": null}, {"id": 370155, "name": "Llama 3.1-405B", "code": null}, {"id": 370093, "name": "AFM-on-device", "code": null}, {"id": 370105, "name": "AFM-server", "code": null}, {"id": 371536, "name": "LLaVA-OV-72B", "code": null}, {"id": 370521, "name": "Table Tennis Agent", "code": null}, {"id": 371474, "name": "Qwen2.5-32B", "code": null}, {"id": 370534, "name": "Qwen2.5-72B", "code": null}, {"id": 371840, "name": "Telechat2-115B", "code": null}, {"id": 371981, "name": "PixelDance", "code": null}, {"id": 371329, "name": "Movie Gen Video", "code": null}, {"id": 371240, "name": "NVLM-D 72B", "code": null}, {"id": 371234, "name": "NVLM-H 72B", "code": null}, {"id": 371232, "name": "NVLM-X 72B", "code": null}, {"id": 371248, "name": "Doubao-pro", "code": null}, {"id": 371242, "name": "Hunyuan-Large", "code": null}, {"id": 371535, "name": "Llama 3.3 70B", "code": null}, {"id": 371364, "name": "EXAONE 3.5 32B", "code": null}, {"id": 371328, "name": "DeepSeek-V3", "code": null}, {"id": 371370, "name": "DeepSeek-R1", "code": null}, {"id": 371331, "name": "Doubao-1.5-pro", "code": null}, {"id": 371539, "name": "Eurus-2-7B-PRIME", "code": null}, {"id": 371513, "name": "Hunyuan-TurboS", "code": null}, {"id": 371472, "name": "EXAONE Deep 32B", "code": null}, {"id": 372634, "name": "DeepSeek-V3 (Mar 2025)", "code": null}, {"id": 371514, "name": "Llama 4 Behemoth (preview)", "code": null}, {"id": 371860, "name": "Llama 4 Maverick", "code": null}, {"id": 371843, "name": "Llama 4 Scout", "code": null}, {"id": 371529, "name": "Pangu Ultra", "code": null}, {"id": 371991, "name": "Qwen3-235B-A22B", "code": null}, {"id": 371939, "name": "Seed1.5-VL", "code": null}, {"id": 372346, "name": "DeepSeek-R1 (May 2025)", "code": null}, {"id": 371886, "name": "EXAONE Path 2.0", "code": null}, {"id": 371815, "name": "Kimi K2", "code": null}, {"id": 371817, "name": "EXAONE 4.0 (32B)", "code": null}, {"id": 371987, "name": "Qwen3-Coder-480B-A35B", "code": null}, {"id": 372632, "name": "Qwen3-235B-A22B (Jul 2025)", "code": null}, {"id": 372358, "name": "Qwen3-235B-A22B-Thinking (Jul 2025)", "code": null}, {"id": 372366, "name": "GLM-4.5", "code": null}, {"id": 372167, "name": "LongCat-Flash", "code": null}, {"id": 371992, "name": "Qwen3-Max", "code": null}, {"id": 372330, "name": "AgentFounder-30B", "code": null}, {"id": 372169, "name": "Qwen3-Omni-30B-A3B", "code": null}, {"id": 372365, "name": "GLM-4.6", "code": null}, {"id": 372318, "name": "Ling-1T", "code": null}, {"id": 372433, "name": "Olmo 3", "code": null}, {"id": 372363, "name": "K-EXAONE", "code": null}, {"id": 372367, "name": "Solar Open 100B\n", "code": null}]}}, "origins": [{"id": 14136, "title": "Parameter, Compute and Data Trends in Machine Learning", "descriptionSnapshot": "We update this chart with the latest available data from our source every month.\n\nThe authors selected the AI systems for inclusion based on the following necessary criteria:\n\u2014 Have an explicit learning component\n\u2014 Showcase experimental results\n\u2014 Advance the state of the art\n\nIn addition, the systems had to meet at least one of the following notability criteria:\n\u2014 Paper has more than 1000 citations\n\u2014 Historical importance\n\u2014 Important state-of-the-art advance\n\u2014 Deployed in a notable context\n\nThe authors note that: \"For new models (from 2020 onward) it is harder to assess these criteria, so we fall back to a subjective selection. We refer to models meeting our selection criteria as 'milestone models.\"\n", "producer": "Epoch AI", "citationFull": "Epoch AI, \u2018Parameter, Compute and Data Trends in Machine Learning\u2019. Published online at epochai.org. Retrieved from: \u2018https://epoch.ai/data/epochdb/visualization\u2019 [online resource]", "urlMain": "https://epoch.ai/mlinputs/visualization", "urlDownload": "https://epoch.ai/data/epochdb/notable_ai_models.csv", "dateAccessed": "2026-03-07", "datePublished": "2025", "license": {"url": "https://creativecommons.org/licenses/by/4.0/", "name": "CC BY 4.0"}}]}