{"id": 1015509, "name": "Number of parameters", "unit": "", "createdAt": "2025-03-15T08:53:18.000Z", "updatedAt": "2026-03-08T06:32:17.000Z", "coverage": "", "timespan": "", "datasetId": 7001, "columnOrder": 0, "shortName": "parameters", "catalogPath": "grapher/artificial_intelligence/2025-03-12/epoch_regressions/epoch_regressions#parameters", "descriptionShort": "Total number of learnable variables or weights that the model contains. Parameters are adjusted during the training process to optimize the model's performance.", "type": "float", "dataChecksum": "11517187035408571574", "metadataChecksum": "3605410569845637512", "datasetName": "Parameter, Compute and Data Trends in Machine Learning - Regressions", "updatePeriodDays": 31, "datasetVersion": "2025-03-12", "nonRedistributable": false, "display": {"zeroDay": "1949-01-01", "yearIsDay": true, "numDecimalPlaces": 0}, "schemaVersion": 2, "processingLevel": "major", "presentation": {"topicTagsLinks": ["Artificial Intelligence"]}, "descriptionKey": ["Parameters are internal variables that machine learning models adjust during their training process to improve their ability to make accurate predictions. They act as the model's \"knobs\" that are fine-tuned based on the provided data. In deep learning, a subset of artificial intelligence (AI), parameters primarily consist of the weights assigned to the connections between the small processing units called neurons. Picture a vast network of interconnected neurons where the strength of each connection represents a parameter.", "The total number of parameters in a model is influenced by various factors. The model's structure and the number of \u201clayers\u201d of neurons play a significant role. Generally, more complex models with additional layers tend to have a higher number of parameters. Special components of specific deep learning architectures can further contribute to the overall parameter count.", "Understanding the number of parameters in a model is crucial to design effective models. More parameters can help the model understand complex data patterns, potentially leading to higher accuracy. However, there's a fine balance to strike. If a model has too many parameters, it risks memorizing the specific examples in its training data rather than learning their underlying patterns. Consequently, it may perform poorly when presented with new, unseen data. Achieving the right balance of parameters is a critical consideration in model development.", "In recent times, the AI community has witnessed the emergence of what are often referred to as \"giant models.\" These models boast an astounding number of parameters, reaching into the billions or even trillions. While these huge models have achieved remarkable performance, they have a significant computational cost. Effectively managing and training such large-scale models has become a prominent and active area of research and discussion within the AI field."], "dimensions": {"years": {"values": [{"id": 547}, {"id": 1102}, {"id": 2250}, {"id": 2922}, {"id": 3833}, {"id": 3986}, {"id": 4106}, {"id": 4198}, {"id": 6513}, {"id": 7425}, {"id": 9070}, {"id": 9739}, {"id": 11413}, {"id": 11893}, {"id": 12143}, {"id": 12661}, {"id": 12874}, {"id": 13740}, {"id": 13787}, {"id": 14035}, {"id": 14044}, {"id": 14457}, {"id": 14778}, {"id": 14940}, {"id": 14944}, {"id": 15126}, {"id": 15142}, {"id": 15248}, {"id": 15279}, {"id": 15675}, {"id": 15826}, {"id": 15994}, {"id": 16039}, {"id": 16236}, {"id": 16283}, {"id": 16403}, {"id": 16442}, {"id": 16771}, {"id": 17044}, {"id": 17131}, {"id": 17320}, {"id": 17335}, {"id": 17562}, {"id": 17652}, {"id": 17836}, {"id": 17850}, {"id": 18201}, {"id": 18263}, {"id": 18414}, {"id": 18959}, {"id": 19334}, {"id": 19796}, {"id": 20266}, {"id": 20423}, {"id": 20672}, {"id": 20986}, {"id": 21017}, {"id": 21356}, {"id": 21520}, {"id": 21735}, {"id": 21891}, {"id": 21892}, {"id": 21915}, {"id": 22012}, {"id": 22080}, {"id": 22158}, {"id": 22240}, {"id": 22280}, {"id": 22412}, {"id": 22445}, {"id": 22537}, {"id": 22548}, {"id": 22763}, {"id": 22814}, {"id": 22841}, {"id": 22905}, {"id": 22920}, {"id": 22956}, {"id": 23164}, {"id": 23203}, {"id": 23262}, {"id": 23283}, {"id": 23345}, {"id": 23347}, {"id": 23456}, {"id": 23521}, {"id": 23588}, {"id": 23664}, {"id": 23690}, {"id": 23714}, {"id": 23720}, {"id": 23728}, {"id": 23729}, {"id": 23730}, {"id": 23741}, {"id": 23804}, {"id": 23874}, {"id": 23892}, {"id": 23912}, {"id": 23913}, {"id": 23914}, {"id": 23936}, {"id": 23958}, {"id": 23987}, {"id": 23993}, {"id": 23997}, {"id": 24000}, {"id": 24051}, {"id": 24073}, {"id": 24077}, {"id": 24092}, {"id": 24096}, {"id": 24106}, {"id": 24142}, {"id": 24155}, {"id": 24161}, {"id": 24181}, {"id": 24201}, {"id": 24243}, {"id": 24263}, {"id": 24264}, {"id": 24271}, {"id": 24312}, {"id": 24324}, {"id": 24379}, {"id": 24406}, {"id": 24432}, {"id": 24441}, {"id": 24449}, {"id": 24455}, {"id": 24524}, {"id": 24525}, {"id": 24532}, {"id": 24542}, {"id": 24566}, {"id": 24591}, {"id": 24599}, {"id": 24639}, {"id": 24705}, {"id": 24708}, {"id": 24722}, {"id": 24726}, {"id": 24731}, {"id": 24740}, {"id": 24751}, {"id": 24772}, {"id": 24779}, {"id": 24780}, {"id": 24781}, {"id": 24791}, {"id": 24792}, {"id": 24818}, {"id": 24820}, {"id": 24828}, {"id": 24830}, {"id": 24842}, {"id": 24843}, {"id": 24859}, {"id": 24943}, {"id": 24999}, {"id": 25020}, {"id": 25027}, {"id": 25035}, {"id": 25038}, {"id": 25055}, {"id": 25062}, {"id": 25077}, {"id": 25084}, {"id": 25085}, {"id": 25094}, {"id": 25100}, {"id": 25105}, {"id": 25127}, {"id": 25140}, {"id": 25150}, {"id": 25171}, {"id": 25233}, {"id": 25237}, {"id": 25247}, {"id": 25282}, {"id": 25299}, {"id": 25323}, {"id": 25324}, {"id": 25343}, {"id": 25353}, {"id": 25370}, {"id": 25385}, {"id": 25392}, {"id": 25427}, {"id": 25443}, {"id": 25461}, {"id": 25462}, {"id": 25468}, {"id": 25472}, {"id": 25485}, {"id": 25489}, {"id": 25510}, {"id": 25517}, {"id": 25520}, {"id": 25521}, {"id": 25539}, {"id": 25547}, {"id": 25567}, {"id": 25575}, {"id": 25597}, {"id": 25598}, {"id": 25611}, {"id": 25624}, {"id": 25625}, {"id": 25651}, {"id": 25664}, {"id": 25673}, {"id": 25676}, {"id": 25681}, {"id": 25688}, {"id": 25700}, {"id": 25708}, {"id": 25714}, {"id": 25717}, {"id": 25718}, {"id": 25721}, {"id": 25727}, {"id": 25731}, {"id": 25734}, {"id": 25748}, {"id": 25751}, {"id": 25769}, {"id": 25813}, {"id": 25826}, {"id": 25834}, {"id": 25835}, {"id": 25841}, {"id": 25850}, {"id": 25862}, {"id": 25868}, {"id": 25869}, {"id": 25871}, {"id": 25875}, {"id": 25880}, {"id": 25881}, {"id": 25883}, {"id": 25889}, {"id": 25897}, {"id": 25903}, {"id": 25905}, {"id": 25913}, {"id": 25924}, {"id": 25946}, {"id": 25959}, {"id": 25969}, {"id": 25970}, {"id": 25971}, {"id": 25975}, {"id": 25976}, {"id": 25983}, {"id": 25990}, {"id": 26002}, {"id": 26003}, {"id": 26008}, {"id": 26014}, {"id": 26015}, {"id": 26030}, {"id": 26051}, {"id": 26054}, {"id": 26058}, {"id": 26059}, {"id": 26068}, {"id": 26074}, {"id": 26078}, {"id": 26080}, {"id": 26113}, {"id": 26140}, {"id": 26147}, {"id": 26150}, {"id": 26176}, {"id": 26207}, {"id": 26225}, {"id": 26226}, {"id": 26227}, {"id": 26259}, {"id": 26266}, {"id": 26267}, {"id": 26281}, {"id": 26291}, {"id": 26297}, {"id": 26302}, {"id": 26307}, {"id": 26308}, {"id": 26312}, {"id": 26337}, {"id": 26341}, {"id": 26346}, {"id": 26352}, {"id": 26357}, {"id": 26361}, {"id": 26380}, {"id": 26406}, {"id": 26421}, {"id": 26428}, {"id": 26437}, {"id": 26443}, {"id": 26445}, {"id": 26456}, {"id": 26457}, {"id": 26458}, {"id": 26459}, {"id": 26469}, {"id": 26471}, {"id": 26472}, {"id": 26476}, {"id": 26483}, {"id": 26485}, {"id": 26505}, {"id": 26507}, {"id": 26512}, {"id": 26515}, {"id": 26516}, {"id": 26520}, {"id": 26524}, {"id": 26526}, {"id": 26543}, {"id": 26544}, {"id": 26546}, {"id": 26550}, {"id": 26560}, {"id": 26561}, {"id": 26568}, {"id": 26581}, {"id": 26582}, {"id": 26587}, {"id": 26597}, {"id": 26601}, {"id": 26602}, {"id": 26612}, {"id": 26616}, {"id": 26619}, {"id": 26620}, {"id": 26623}, {"id": 26625}, {"id": 26634}, {"id": 26639}, {"id": 26644}, {"id": 26646}, {"id": 26647}, {"id": 26651}, {"id": 26654}, {"id": 26662}, {"id": 26669}, {"id": 26682}, {"id": 26684}, {"id": 26685}, {"id": 26689}, {"id": 26695}, {"id": 26700}, {"id": 26701}, {"id": 26702}, {"id": 26703}, {"id": 26710}, {"id": 26719}, {"id": 26722}, {"id": 26723}, {"id": 26731}, {"id": 26742}, {"id": 26745}, {"id": 26750}, {"id": 26756}, {"id": 26758}, {"id": 26759}, {"id": 26765}, {"id": 26766}, {"id": 26781}, {"id": 26784}, {"id": 26786}, {"id": 26792}, {"id": 26794}, {"id": 26805}, {"id": 26809}, {"id": 26811}, {"id": 26819}, {"id": 26827}, {"id": 26835}, {"id": 26840}, {"id": 26842}, {"id": 26848}, {"id": 26849}, {"id": 26854}, {"id": 26864}, {"id": 26865}, {"id": 26876}, {"id": 26878}, {"id": 26884}, {"id": 26896}, {"id": 26919}, {"id": 26926}, {"id": 26939}, {"id": 26940}, {"id": 26946}, {"id": 26955}, {"id": 26968}, {"id": 26969}, {"id": 26976}, {"id": 26980}, {"id": 26982}, {"id": 26984}, {"id": 26986}, {"id": 26994}, {"id": 27000}, {"id": 27009}, {"id": 27015}, {"id": 27024}, {"id": 27032}, {"id": 27037}, {"id": 27042}, {"id": 27043}, {"id": 27053}, {"id": 27054}, {"id": 27057}, {"id": 27068}, {"id": 27071}, {"id": 27082}, {"id": 27088}, {"id": 27091}, {"id": 27092}, {"id": 27101}, {"id": 27106}, {"id": 27113}, {"id": 27115}, {"id": 27116}, {"id": 27122}, {"id": 27126}, {"id": 27131}, {"id": 27134}, {"id": 27156}, {"id": 27157}, {"id": 27158}, {"id": 27163}, {"id": 27164}, {"id": 27165}, {"id": 27167}, {"id": 27170}, {"id": 27176}, {"id": 27186}, {"id": 27191}, {"id": 27205}, {"id": 27212}, {"id": 27213}, {"id": 27214}, {"id": 27226}, {"id": 27234}, {"id": 27236}, {"id": 27263}, {"id": 27268}, {"id": 27269}, {"id": 27276}, {"id": 27282}, {"id": 27292}, {"id": 27298}, {"id": 27307}, {"id": 27311}, {"id": 27313}, {"id": 27317}, {"id": 27326}, {"id": 27327}, {"id": 27330}, {"id": 27333}, {"id": 27334}, {"id": 27335}, {"id": 27336}, {"id": 27337}, {"id": 27338}, {"id": 27339}, {"id": 27344}, {"id": 27345}, {"id": 27346}, {"id": 27360}, {"id": 27361}, {"id": 27362}, {"id": 27368}, {"id": 27369}, {"id": 27372}, {"id": 27373}, {"id": 27375}, {"id": 27380}, {"id": 27382}, {"id": 27384}, {"id": 27390}, {"id": 27393}, {"id": 27409}, {"id": 27417}, {"id": 27427}, {"id": 27435}, {"id": 27445}, {"id": 27446}, {"id": 27456}, {"id": 27466}, {"id": 27479}, {"id": 27481}, {"id": 27498}, {"id": 27501}, {"id": 27516}, {"id": 27526}, {"id": 27533}, {"id": 27534}, {"id": 27551}, {"id": 27557}, {"id": 27558}, {"id": 27561}, {"id": 27568}, {"id": 27569}, {"id": 27580}, {"id": 27590}, {"id": 27597}, {"id": 27598}, {"id": 27603}, {"id": 27611}, {"id": 27612}, {"id": 27627}, {"id": 27642}, {"id": 27653}, {"id": 27655}, {"id": 27656}, {"id": 27660}, {"id": 27670}, {"id": 27674}, {"id": 27675}, {"id": 27676}, {"id": 27688}, {"id": 27694}, {"id": 27703}, {"id": 27715}, {"id": 27722}, {"id": 27732}, {"id": 27733}, {"id": 27736}, {"id": 27740}, {"id": 27751}, {"id": 27758}, {"id": 27775}, {"id": 27778}, {"id": 27792}, {"id": 27806}, {"id": 27823}, {"id": 27828}, {"id": 27833}, {"id": 27839}, {"id": 27841}, {"id": 27853}, {"id": 27858}, {"id": 27877}, {"id": 27906}, {"id": 27914}, {"id": 27920}, {"id": 27921}, {"id": 27948}, {"id": 27950}, {"id": 27954}, {"id": 27961}, {"id": 27964}, {"id": 27971}, {"id": 27974}, {"id": 27975}, {"id": 28002}, {"id": 28006}, {"id": 28017}, {"id": 28023}, {"id": 28031}, {"id": 28041}, {"id": 28058}, {"id": 28059}, {"id": 28068}, {"id": 28082}, {"id": 28089}, {"id": 28114}, {"id": 28115}, {"id": 28121}, {"id": 28122}, {"id": 28123}]}, "entities": {"values": [{"id": 370903, "name": "1.2x/year between 1950\u20132010", "code": null}, {"id": 256993, "name": "Theseus", "code": null}, {"id": 305969, "name": "SNARC", "code": null}, {"id": 305970, "name": "Self Organizing System", "code": null}, {"id": 257002, "name": "Perceptron Mark I", "code": null}, {"id": 256994, "name": "Samuel Neural Checkers", "code": null}, {"id": 354868, "name": "Pattern recognition and reading by machine", "code": null}, {"id": 369024, "name": "Perceptron (1960)", "code": null}, {"id": 256995, "name": "ADALINE", "code": null}, {"id": 369517, "name": "LTE speaker verification system", "code": null}, {"id": 369520, "name": "Decision tree adaline", "code": null}, {"id": 369537, "name": "Piecewise linear model", "code": null}, {"id": 305980, "name": "Cognitron", "code": null}, {"id": 256996, "name": "Neocognitron", "code": null}, {"id": 305982, "name": "Kohonen network", "code": null}, {"id": 305983, "name": "Hopfield network", "code": null}, {"id": 305984, "name": "ASE+ACE", "code": null}, {"id": 369510, "name": "Hierarchical Cognitron", "code": null}, {"id": 369560, "name": "Distributed representation NN", "code": null}, {"id": 372328, "name": "MLP with back-propagation", "code": null}, {"id": 368075, "name": "NetTalk (dictionary)", "code": null}, {"id": 368083, "name": "NetTalk (transcription)", "code": null}, {"id": 371864, "name": "Translation-invariant MLP", "code": null}, {"id": 369973, "name": "MLN-ASR", "code": null}, {"id": 369546, "name": "Truck backer-upper", "code": null}, {"id": 372312, "name": "Handwritten digit recognition network", "code": null}, {"id": 369529, "name": "Speaker-independent vowel classification", "code": null}, {"id": 257006, "name": "Zip CNN", "code": null}, {"id": 371841, "name": "NETtalk reimplementation", "code": null}, {"id": 369990, "name": "Bankruptcy-NN", "code": null}, {"id": 369968, "name": "SexNet classification", "code": null}, {"id": 369967, "name": "SexNet compression", "code": null}, {"id": 369526, "name": "RAAM", "code": null}, {"id": 369977, "name": "Weight Decay", "code": null}, {"id": 257007, "name": "TD-Gammon", "code": null}, {"id": 369525, "name": "Cancer drug mechanism prediction", "code": null}, {"id": 369995, "name": "Boosting", "code": null}, {"id": 305992, "name": "IBM-5", "code": null}, {"id": 369969, "name": "Siamese-TDNN", "code": null}, {"id": 369966, "name": "ANN Eye Tracker", "code": null}, {"id": 369972, "name": "Ceramic-MLP", "code": null}, {"id": 369964, "name": "JPMAX", "code": null}, {"id": 369991, "name": "Mixture of linear models", "code": null}, {"id": 369992, "name": "NeuroChess", "code": null}, {"id": 369996, "name": "Predictive Coding NN", "code": null}, {"id": 305994, "name": "Support Vector Machines", "code": null}, {"id": 369970, "name": "LISSOM", "code": null}, {"id": 369993, "name": "MUSIC perceptron", "code": null}, {"id": 256997, "name": "System 11", "code": null}, {"id": 369979, "name": "SOM-CNN", "code": null}, {"id": 336936, "name": "Deep Blue", "code": null}, {"id": 367568, "name": "Bidirectional RNN", "code": null}, {"id": 256998, "name": "LSTM", "code": null}, {"id": 245542, "name": "LeNet-5", "code": null}, {"id": 354866, "name": "LSTM with forget gates", "code": null}, {"id": 371856, "name": "RECONTRA-categorized", "code": null}, {"id": 371849, "name": "RECONTRA-uncategorized", "code": null}, {"id": 369997, "name": "Neural LM", "code": null}, {"id": 369980, "name": "PoE MNIST", "code": null}, {"id": 257009, "name": "Decision tree (classification)", "code": null}, {"id": 371812, "name": "NPLM (AP News)", "code": null}, {"id": 371816, "name": "NPLM (Brown)", "code": null}, {"id": 369971, "name": "Invariant CNN", "code": null}, {"id": 369988, "name": "LMICA", "code": null}, {"id": 370522, "name": "RankNet", "code": null}, {"id": 371873, "name": "SVM-CNN", "code": null}, {"id": 306014, "name": "Deep Belief Nets", "code": null}, {"id": 370719, "name": "Dimensionality Reduction", "code": null}, {"id": 369975, "name": "KN-LM", "code": null}, {"id": 369976, "name": "SB-LM", "code": null}, {"id": 368136, "name": "BLSTM for handwriting (2)", "code": null}, {"id": 367308, "name": "Deep Multitask NLP Network", "code": null}, {"id": 369564, "name": "HLBL", "code": null}, {"id": 371852, "name": "GNN", "code": null}, {"id": 371830, "name": "BP-DBN", "code": null}, {"id": 368330, "name": "RBM Image Classifier", "code": null}, {"id": 257011, "name": "GPU DBNs", "code": null}, {"id": 371877, "name": "Two Stage Feature Extraction (MNIST)", "code": null}, {"id": 371881, "name": "LCNP LabelMe", "code": null}, {"id": 371878, "name": "LCNP MNIST", "code": null}, {"id": 371874, "name": "LCNP NORB", "code": null}, {"id": 370904, "name": "2.0x/year between 2010\u20132025", "code": null}, {"id": 369004, "name": "Super-vector coding", "code": null}, {"id": 257013, "name": "Feedforward NN", "code": null}, {"id": 306030, "name": "ReLU (NORB)", "code": null}, {"id": 371871, "name": "Pooling CNN (Caltech 101)", "code": null}, {"id": 371872, "name": "Pooling CNN (NORB)", "code": null}, {"id": 369528, "name": "RNN LM", "code": null}, {"id": 370720, "name": "Deep Autoencoders", "code": null}, {"id": 369512, "name": "Vector Space Model", "code": null}, {"id": 371885, "name": "High Performance CNN (NORB)", "code": null}, {"id": 371842, "name": "CNN Committee (MNIST)", "code": null}, {"id": 371862, "name": "CNN Committee (NIST)", "code": null}, {"id": 371863, "name": "CNN committee (traffic sign)", "code": null}, {"id": 306033, "name": "NLP from scratch", "code": null}, {"id": 257017, "name": "Dropout (MNIST)", "code": null}, {"id": 306035, "name": "Dropout (TIMIT)", "code": null}, {"id": 366988, "name": "Unsupervised High-level Feature Learner", "code": null}, {"id": 369534, "name": "LSTM LM", "code": null}, {"id": 240132, "name": "AlexNet", "code": null}, {"id": 368007, "name": "RNN+LDA+KN5+cache", "code": null}, {"id": 371854, "name": "DNN EM segmentation", "code": null}, {"id": 369538, "name": "DistBelief Speech", "code": null}, {"id": 369541, "name": "DistBelief Vision", "code": null}, {"id": 306039, "name": "PreTrans-3L-250H", "code": null}, {"id": 369989, "name": "Multilingual DNN", "code": null}, {"id": 369974, "name": "ReLU-Speech", "code": null}, {"id": 371980, "name": "Hierarchical Scene Labeling (Stanford Background)", "code": null}, {"id": 257018, "name": "Word2Vec (large)", "code": null}, {"id": 306040, "name": "Word2Vec (small)", "code": null}, {"id": 306041, "name": "R-CNN (T-net)", "code": null}, {"id": 257106, "name": "TransE", "code": null}, {"id": 369344, "name": "RNN for 1B words", "code": null}, {"id": 240135, "name": "DQN", "code": null}, {"id": 306044, "name": "Image generation", "code": null}, {"id": 367550, "name": "OverFeat", "code": null}, {"id": 306049, "name": "GloVe (32B)", "code": null}, {"id": 306048, "name": "GloVe (6B)", "code": null}, {"id": 306051, "name": "HyperNEAT", "code": null}, {"id": 369565, "name": "Paragraph Vector", "code": null}, {"id": 369562, "name": "AdaRNN", "code": null}, {"id": 369556, "name": "Dropout: SVHN", "code": null}, {"id": 369994, "name": "Fragment embedding", "code": null}, {"id": 368752, "name": "RNN-WER", "code": null}, {"id": 306047, "name": "Multiresolution CNN", "code": null}, {"id": 371825, "name": "ACF-WIDER", "code": null}, {"id": 371884, "name": "NPD", "code": null}, {"id": 257023, "name": "VGG16", "code": null}, {"id": 306053, "name": "VGG19", "code": null}, {"id": 307046, "name": "Seq2Seq LSTM", "code": null}, {"id": 368042, "name": "SPN-4+KN5", "code": null}, {"id": 257026, "name": "GoogLeNet / InceptionV1", "code": null}, {"id": 306054, "name": "LRCN", "code": null}, {"id": 371867, "name": "TA-CNN", "code": null}, {"id": 370721, "name": "SNM-skip", "code": null}, {"id": 368346, "name": "Fractional Max-Pooling", "code": null}, {"id": 257024, "name": "ADAM (CIFAR-10)", "code": null}, {"id": 371883, "name": "VGG-Face", "code": null}, {"id": 257025, "name": "MSRA (C, PReLU)", "code": null}, {"id": 370247, "name": "TRPO", "code": null}, {"id": 306056, "name": "DQN-2015", "code": null}, {"id": 368084, "name": "genCNN + dyn eval", "code": null}, {"id": 371810, "name": "TC-DNN-BLSTM-DNN", "code": null}, {"id": 371876, "name": "U-Net", "code": null}, {"id": 371875, "name": "CFSS", "code": null}, {"id": 306061, "name": "YOLO", "code": null}, {"id": 306062, "name": "BatchNorm", "code": null}, {"id": 371870, "name": "Deep CNN + COTS", "code": null}, {"id": 371859, "name": "DCNN", "code": null}, {"id": 257027, "name": "AlphaGo Fan", "code": null}, {"id": 371882, "name": "SAF R-CNN", "code": null}, {"id": 371824, "name": "3DDFA", "code": null}, {"id": 306064, "name": "Inception v3", "code": null}, {"id": 371899, "name": "ResNet-101 (ImageNet)", "code": null}, {"id": 306065, "name": "ResNet-110 (CIFAR-10)", "code": null}, {"id": 257028, "name": "ResNet-152 (ImageNet)", "code": null}, {"id": 368031, "name": "Variational (untied weights, MC) LSTM (Large)", "code": null}, {"id": 306070, "name": "Inception-ResNet-V2", "code": null}, {"id": 306069, "name": "Inceptionv4", "code": null}, {"id": 306071, "name": "SqueezeNet", "code": null}, {"id": 371982, "name": "Double DQN", "code": null}, {"id": 371903, "name": "Template Adaptation\n", "code": null}, {"id": 336810, "name": "Dueling DQN", "code": null}, {"id": 368134, "name": "Gated HORNN (3rd order)", "code": null}, {"id": 371902, "name": "LRR-4X", "code": null}, {"id": 371891, "name": "CMS-RCNN", "code": null}, {"id": 368352, "name": "SimpleNet", "code": null}, {"id": 306073, "name": "DenseNet-264", "code": null}, {"id": 371897, "name": "LF-MMI", "code": null}, {"id": 371904, "name": "MS-ensemble-speech-recognition", "code": null}, {"id": 367493, "name": "ResNet-1001", "code": null}, {"id": 257030, "name": "GNMT", "code": null}, {"id": 368076, "name": "Pointer Sentinel-LSTM (medium)", "code": null}, {"id": 240142, "name": "Xception", "code": null}, {"id": 369001, "name": "SPIDER2", "code": null}, {"id": 368057, "name": "VD-LSTM+REAL Large", "code": null}, {"id": 369179, "name": "BIDAF", "code": null}, {"id": 370527, "name": "NAS with base 8 and shared embeddings", "code": null}, {"id": 257031, "name": "NASv3 (CIFAR-10)", "code": null}, {"id": 371895, "name": "DLDL (PASCAL)", "code": null}, {"id": 372332, "name": "ResNeXt-101 (64\u00d74d)", "code": null}, {"id": 306077, "name": "ResNeXt-50", "code": null}, {"id": 306078, "name": "PolyNet", "code": null}, {"id": 371890, "name": "HR-ResNet101", "code": null}, {"id": 371809, "name": "3DMM-CNN", "code": null}, {"id": 371866, "name": "EnhanceNet", "code": null}, {"id": 306081, "name": "YOLOv2", "code": null}, {"id": 257034, "name": "DeepStack", "code": null}, {"id": 368323, "name": "OR-WideResNet", "code": null}, {"id": 370178, "name": "MoE-Multi", "code": null}, {"id": 306083, "name": "MobileNet", "code": null}, {"id": 371241, "name": "Transformer (2017)", "code": null}, {"id": 306088, "name": "ShuffleNet v1", "code": null}, {"id": 257037, "name": "JFT", "code": null}, {"id": 368032, "name": "AWD-LSTM", "code": null}, {"id": 306089, "name": "NASNet-A", "code": null}, {"id": 368011, "name": "AWD-LSTM - 3-layer LSTM (tied) + continuous cache pointer (WT2)", "code": null}, {"id": 308274, "name": "RetinaNet-R101", "code": null}, {"id": 306090, "name": "RetinaNet-R50", "code": null}, {"id": 368056, "name": "EI-REHN-1000D", "code": null}, {"id": 368086, "name": "GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)", "code": null}, {"id": 306093, "name": "SENet (ImageNet)", "code": null}, {"id": 368342, "name": "PyramidNet", "code": null}, {"id": 368023, "name": "ISS", "code": null}, {"id": 368048, "name": "LSTM + dynamic eval", "code": null}, {"id": 368044, "name": "AWD-LSTM+WT+Cache+IOG (WT2)", "code": null}, {"id": 257039, "name": "AlphaGo Zero", "code": null}, {"id": 368045, "name": "Fraternal dropout + AWD-LSTM 3-layer (WT2)", "code": null}, {"id": 368041, "name": "AWD-LSTM-MoS + dynamic evaluation (WT2, 2017)", "code": null}, {"id": 371892, "name": "DL scaling Image", "code": null}, {"id": 371900, "name": "DL scaling LM", "code": null}, {"id": 371893, "name": "DL scaling speech", "code": null}, {"id": 306099, "name": "ELMo", "code": null}, {"id": 368074, "name": "QRNN", "code": null}, {"id": 257040, "name": "IMPALA", "code": null}, {"id": 368357, "name": "TCN (P-MNIST)", "code": null}, {"id": 368102, "name": "4 layer QRNN (h=2500)", "code": null}, {"id": 257041, "name": "YOLOv3", "code": null}, {"id": 306104, "name": "ResNeXt-101 32x48d", "code": null}, {"id": 368004, "name": "Dropout-LSTM+Noise(Bernoulli) (WT2)", "code": null}, {"id": 368064, "name": "aLSTM(depth-2)+RecurrentPolicy (WT2)", "code": null}, {"id": 370176, "name": "GPT-1", "code": null}, {"id": 306105, "name": "MobileNetV2", "code": null}, {"id": 371983, "name": "FTW (For The Win)", "code": null}, {"id": 368035, "name": "Big-Little Net", "code": null}, {"id": 369018, "name": "Big-Little Net (speech)", "code": null}, {"id": 368066, "name": "AWD-LSTM-MoS+PDR + dynamic evaluation (WT2)", "code": null}, {"id": 368101, "name": "(ensemble): AWD-LSTM-DOC (fin) \u00d7 5 (WT2)", "code": null}, {"id": 369007, "name": "Transformer + Simple Recurrent Unit", "code": null}, {"id": 368059, "name": "AWD-LSTM-MoS + dynamic evaluation (WT2, 2018)", "code": null}, {"id": 368009, "name": "LSTM+NeuralCache", "code": null}, {"id": 369555, "name": "Transformer (Adaptive Input Embeddings) WT103", "code": null}, {"id": 257045, "name": "BERT-Large", "code": null}, {"id": 306108, "name": "MetaMimic", "code": null}, {"id": 368087, "name": "TrellisNet", "code": null}, {"id": 369326, "name": "Mesh-TensorFlow Transformer 2.9B (translation)", "code": null}, {"id": 370525, "name": "Mesh-TensorFlow Transformer 4.9B (language)", "code": null}, {"id": 370722, "name": "Fine-tuned-AWD-LSTM-DOC (fin)", "code": null}, {"id": 368028, "name": "Multi-cell LSTM", "code": null}, {"id": 306110, "name": "GPipe (Transformer)", "code": null}, {"id": 371826, "name": "SPN (ImageNet 128)", "code": null}, {"id": 371889, "name": "StyleGAN", "code": null}, {"id": 306111, "name": "Transformer ELMo", "code": null}, {"id": 369511, "name": "Transformer-XL (257M)", "code": null}, {"id": 306112, "name": "MT-DNN", "code": null}, {"id": 257046, "name": "Hanabi 4 player", "code": null}, {"id": 369043, "name": "GPT-2 (1.5B)", "code": null}, {"id": 366991, "name": "KataGo", "code": null}, {"id": 369189, "name": "NMT Transformer 437M", "code": null}, {"id": 369176, "name": "SciBERT", "code": null}, {"id": 368073, "name": "True-Regularization+Finetune+Dynamic-Eval", "code": null}, {"id": 368351, "name": "WeNet (Penn Treebank)", "code": null}, {"id": 368052, "name": "Transformer-XL + RMS dynamic eval", "code": null}, {"id": 368077, "name": "BERT-Large-CAS (PTB+WT2+WT103)", "code": null}, {"id": 371848, "name": "MuseNet", "code": null}, {"id": 306114, "name": "ResNeXt-101 Billion-scale", "code": null}, {"id": 368063, "name": "AWD-LSTM-DRILL + dynamic evaluation\u2020 (WT2)", "code": null}, {"id": 306115, "name": "CPC v2", "code": null}, {"id": 306116, "name": "EfficientNet-L2", "code": null}, {"id": 257051, "name": "DLRM-2020", "code": null}, {"id": 306119, "name": "XLM", "code": null}, {"id": 306120, "name": "XLNet", "code": null}, {"id": 368013, "name": "Transformer-XL Large + Phrase Induction", "code": null}, {"id": 368026, "name": "AWD-LSTM + MoS + Partial Shuffled", "code": null}, {"id": 306122, "name": "FixRes ResNeXt-101 WSL", "code": null}, {"id": 368358, "name": "LaNet-L (CIFAR-10)", "code": null}, {"id": 365388, "name": "RoBERTa Large", "code": null}, {"id": 306124, "name": "BigBiGAN", "code": null}, {"id": 368018, "name": "EN^2AS with performance reward", "code": null}, {"id": 368081, "name": "Mogrifier (d2, MoS2, MC) + dynamic eval", "code": null}, {"id": 369023, "name": "UDSMProt", "code": null}, {"id": 257055, "name": "Megatron-BERT", "code": null}, {"id": 371869, "name": "Megatron-LM (1.2B)", "code": null}, {"id": 368027, "name": "Megatron-LM (8.3B)", "code": null}, {"id": 368091, "name": "Adaptive Inputs + LayerDrop", "code": null}, {"id": 306125, "name": "ALBERT", "code": null}, {"id": 257056, "name": "AlphaX-1", "code": null}, {"id": 306126, "name": "DistilBERT", "code": null}, {"id": 369159, "name": "M4-50B", "code": null}, {"id": 257059, "name": "T5-11B", "code": null}, {"id": 257058, "name": "T5-3B", "code": null}, {"id": 306127, "name": "BART-large", "code": null}, {"id": 257060, "name": "AlphaStar", "code": null}, {"id": 368067, "name": "Base LM + kNN LM + Continuous Cache", "code": null}, {"id": 369168, "name": "XLM-RoBERTa", "code": null}, {"id": 369328, "name": "CamemBERT", "code": null}, {"id": 368005, "name": "Sandwich Transformer", "code": null}, {"id": 306128, "name": "Noisy Student (L2)", "code": null}, {"id": 306129, "name": "MoCo", "code": null}, {"id": 306130, "name": "MuZero", "code": null}, {"id": 368333, "name": "Transformer - LibriVox + Decoding/Rescoring", "code": null}, {"id": 368010, "name": "Transformer-XL DeFINE (141M)", "code": null}, {"id": 371898, "name": "StyleGAN2", "code": null}, {"id": 371947, "name": "MMLSTM (PTB)", "code": null}, {"id": 371951, "name": "MMLSTM (WT-2)", "code": null}, {"id": 257061, "name": "OpenAI Five", "code": null}, {"id": 257062, "name": "OpenAI Five Rerun", "code": null}, {"id": 306132, "name": "Big Transfer (BiT-L)", "code": null}, {"id": 257063, "name": "AlphaFold", "code": null}, {"id": 257064, "name": "Meena", "code": null}, {"id": 306134, "name": "Theseus 6/768", "code": null}, {"id": 369514, "name": "Perceiver IO (optical flow)", "code": null}, {"id": 368046, "name": "TaLK Convolution", "code": null}, {"id": 257107, "name": "ALBERT-xxlarge", "code": null}, {"id": 306135, "name": "SimCLR", "code": null}, {"id": 368060, "name": "Turing-NLG", "code": null}, {"id": 372320, "name": "FFN SwiGLU", "code": null}, {"id": 368047, "name": "Feedback Transformer", "code": null}, {"id": 370531, "name": "TCAN (WT2)", "code": null}, {"id": 368106, "name": "TransformerXL + spectrum control", "code": null}, {"id": 370529, "name": "Routing Transformer (WT-103)", "code": null}, {"id": 368107, "name": "Tensor-Transformer(1core)+PN (WT103)", "code": null}, {"id": 306136, "name": "ELECTRA", "code": null}, {"id": 306137, "name": "MetNet", "code": null}, {"id": 306140, "name": "CURL", "code": null}, {"id": 257068, "name": "Once for All", "code": null}, {"id": 365990, "name": "UnifiedQA", "code": null}, {"id": 371942, "name": "NAS+ESS (23M)", "code": null}, {"id": 368375, "name": "ContextNet", "code": null}, {"id": 368377, "name": "Conformer", "code": null}, {"id": 368733, "name": "Retrieval-Augmented Generator", "code": null}, {"id": 368341, "name": "DETR", "code": null}, {"id": 354864, "name": "GPT-3 175B (davinci)", "code": null}, {"id": 257072, "name": "GShard (dense)", "code": null}, {"id": 306144, "name": "EfficientDet", "code": null}, {"id": 370248, "name": "DeLighT", "code": null}, {"id": 306133, "name": "ERNIE-GEN (large)", "code": null}, {"id": 368338, "name": "ProBERTa", "code": null}, {"id": 369195, "name": "LUKE", "code": null}, {"id": 369190, "name": "Conformer + Wav2vec 2.0 + Noisy Student", "code": null}, {"id": 368329, "name": "mT5-XXL", "code": null}, {"id": 369146, "name": "German ELECTRA Large", "code": null}, {"id": 306145, "name": "ViT-Base/32", "code": null}, {"id": 306146, "name": "ViT-Huge/14", "code": null}, {"id": 257073, "name": "wave2vec 2.0 LARGE", "code": null}, {"id": 268376, "name": "KEPLER", "code": null}, {"id": 368138, "name": "AlphaFold 2", "code": null}, {"id": 257074, "name": "CPM-Large", "code": null}, {"id": 369335, "name": "ESM1b", "code": null}, {"id": 368078, "name": "CT-MoS (WT2)", "code": null}, {"id": 368012, "name": "ERNIE-Doc (247M)", "code": null}, {"id": 306154, "name": "CLIP (ResNet-50)", "code": null}, {"id": 257076, "name": "CLIP (ViT L/14@336px)", "code": null}, {"id": 257077, "name": "DALL-E", "code": null}, {"id": 306151, "name": "BigSSL", "code": null}, {"id": 257078, "name": "Switch", "code": null}, {"id": 366051, "name": "DeiT-B", "code": null}, {"id": 371972, "name": "DLWP", "code": null}, {"id": 368753, "name": "MSA Transformer", "code": null}, {"id": 306156, "name": "Rational DQN Average", "code": null}, {"id": 368098, "name": "SRU++ Large", "code": null}, {"id": 257104, "name": "Meta Pseudo Labels", "code": null}, {"id": 307047, "name": "Generative BST", "code": null}, {"id": 306163, "name": "M6-T", "code": null}, {"id": 371949, "name": "Unicorn", "code": null}, {"id": 355353, "name": "PLUG", "code": null}, {"id": 369015, "name": "ProtBERT-BFD", "code": null}, {"id": 371533, "name": "ProtT5-XL-U50", "code": null}, {"id": 368328, "name": "ADM", "code": null}, {"id": 369184, "name": "MedBERT", "code": null}, {"id": 257084, "name": "CogView", "code": null}, {"id": 257098, "name": "Transformer local-attention (NesT-B)", "code": null}, {"id": 368326, "name": "ByT5-XXL", "code": null}, {"id": 257085, "name": "ViT-G/14", "code": null}, {"id": 368362, "name": "CoAtNet", "code": null}, {"id": 368718, "name": "EMDR", "code": null}, {"id": 306150, "name": "DeBERTa", "code": null}, {"id": 257103, "name": "ALIGN", "code": null}, {"id": 306166, "name": "Denoising Diffusion Probabilistic Models (LSUN Bedroom)", "code": null}, {"id": 371868, "name": "StyleGAN3-R", "code": null}, {"id": 371879, "name": "StyleGAN3-T", "code": null}, {"id": 370177, "name": "EfficientNetV2-XL", "code": null}, {"id": 369017, "name": "Fold2Seq", "code": null}, {"id": 368080, "name": "Adaptive Input Transformer + RD", "code": null}, {"id": 257087, "name": "ERNIE 3.0", "code": null}, {"id": 306167, "name": "Codex", "code": null}, {"id": 273165, "name": "GOAT", "code": null}, {"id": 257088, "name": "HuBERT", "code": null}, {"id": 257089, "name": "SEER", "code": null}, {"id": 368355, "name": "6-Act Tether", "code": null}, {"id": 369177, "name": "YOLOX-X", "code": null}, {"id": 368347, "name": "W2v-BERT", "code": null}, {"id": 257090, "name": "Jurassic-1-Jumbo", "code": null}, {"id": 367497, "name": "Zidong Taichu", "code": null}, {"id": 368748, "name": "DNABERT", "code": null}, {"id": 306169, "name": "XLMR-XXL", "code": null}, {"id": 368372, "name": "FLAN 137B", "code": null}, {"id": 306170, "name": "MEB", "code": null}, {"id": 368104, "name": "PermuteFormer", "code": null}, {"id": 369563, "name": "HyperCLOVA 204B", "code": null}, {"id": 366990, "name": "PLATO-XL", "code": null}, {"id": 369140, "name": "TrOCR", "code": null}, {"id": 371948, "name": "Turing ULRv5", "code": null}, {"id": 257092, "name": "Megatron-Turing NLG 530B", "code": null}, {"id": 257093, "name": "Yuan 1.0", "code": null}, {"id": 368036, "name": "base LM+GNN+kNN", "code": null}, {"id": 369012, "name": "Eve", "code": null}, {"id": 368092, "name": "S4", "code": null}, {"id": 368135, "name": "CodeT5-base", "code": null}, {"id": 370718, "name": "Masked Autoencoders ViT-H", "code": null}, {"id": 368334, "name": "ViT-G/14 (LiT)", "code": null}, {"id": 371330, "name": "Swin Transformer V2 (SwinV2-G)", "code": null}, {"id": 368364, "name": "BASIC-L", "code": null}, {"id": 35176, "name": "Florence", "code": null}, {"id": 306174, "name": "N\u00dcWA", "code": null}, {"id": 371521, "name": "T-NLRv5 XXL", "code": null}, {"id": 367255, "name": "Gopher (280B)", "code": null}, {"id": 368065, "name": "GLaM", "code": null}, {"id": 369203, "name": "LongT5", "code": null}, {"id": 368725, "name": "Contriever", "code": null}, {"id": 368722, "name": "LDM-1.45B", "code": null}, {"id": 369192, "name": "XGLM-7.5B", "code": null}, {"id": 368055, "name": "ERNIE 3.0 Titan", "code": null}, {"id": 368093, "name": "ERNIE-ViLG", "code": null}, {"id": 369202, "name": "Detic", "code": null}, {"id": 306179, "name": "data2vec (language)", "code": null}, {"id": 306178, "name": "data2vec (speech)", "code": null}, {"id": 306177, "name": "data2vec (vision)", "code": null}, {"id": 370245, "name": "AbLang (heavy sequences)", "code": null}, {"id": 369013, "name": "OntoProtein", "code": null}, {"id": 371250, "name": "InstructGPT 1.3B", "code": null}, {"id": 371237, "name": "InstructGPT 175B", "code": null}, {"id": 371233, "name": "InstructGPT 6B", "code": null}, {"id": 306182, "name": "AlphaCode", "code": null}, {"id": 306180, "name": "RETRO-7B", "code": null}, {"id": 372324, "name": "MaskGIT (ImageNet)", "code": null}, {"id": 257096, "name": "GPT-NeoX-20B", "code": null}, {"id": 257095, "name": "LaMDA", "code": null}, {"id": 368350, "name": "ProteinBERT", "code": null}, {"id": 368348, "name": "ST-MoE", "code": null}, {"id": 368367, "name": "PolyCoder", "code": null}, {"id": 306184, "name": "DeepNet", "code": null}, {"id": 306183, "name": "Statement Curriculum Learning", "code": null}, {"id": 368361, "name": "ViT-G (model soup)", "code": null}, {"id": 368108, "name": "Segatron-XL large, M=384 + HCP", "code": null}, {"id": 371963, "name": "Make-A-Scene", "code": null}, {"id": 273166, "name": "Chinchilla", "code": null}, {"id": 273167, "name": "PaLM (540B)", "code": null}, {"id": 306186, "name": "DALL\u00b7E 2", "code": null}, {"id": 368730, "name": "BERT-RBP", "code": null}, {"id": 343968, "name": "Stable Diffusion (LDM-KL-8-G)", "code": null}, {"id": 268378, "name": "Sparse all-MLP", "code": null}, {"id": 306187, "name": "Flamingo", "code": null}, {"id": 306188, "name": "OPT-175B", "code": null}, {"id": 370974, "name": "DeBERTaV3large + KEAR", "code": null}, {"id": 306190, "name": "UL2", "code": null}, {"id": 306191, "name": "Gato", "code": null}, {"id": 306192, "name": "Imagen", "code": null}, {"id": 371857, "name": "GPT-2 Medium (FlashAttention)", "code": null}, {"id": 368110, "name": "Tranception", "code": null}, {"id": 366986, "name": "CogVideo", "code": null}, {"id": 368096, "name": "DITTO", "code": null}, {"id": 368373, "name": "CoCa", "code": null}, {"id": 306194, "name": "Parti", "code": null}, {"id": 369037, "name": "ProGen2-xlarge", "code": null}, {"id": 306195, "name": "Minerva (540B)", "code": null}, {"id": 368133, "name": "CodeT5-large", "code": null}, {"id": 306196, "name": "NLLB", "code": null}, {"id": 368746, "name": "BLOOM-176B", "code": null}, {"id": 369022, "name": "ESM2-15B", "code": null}, {"id": 369033, "name": "OmegaPLM", "code": null}, {"id": 354863, "name": "AlexaTM 20B", "code": null}, {"id": 365992, "name": "GLM-130B", "code": null}, {"id": 368716, "name": "BlenderBot 3", "code": null}, {"id": 369347, "name": "BEIT-3", "code": null}, {"id": 368374, "name": "PaLI", "code": null}, {"id": 349174, "name": "Whisper", "code": null}, {"id": 369185, "name": "DiffDock", "code": null}, {"id": 368017, "name": "Phenaki", "code": null}, {"id": 369025, "name": "GenSLM", "code": null}, {"id": 369171, "name": "Flan-PaLM 540B", "code": null}, {"id": 369006, "name": "LMSI-Palm", "code": null}, {"id": 368739, "name": "U-PaLM (540B)", "code": null}, {"id": 369151, "name": "eDiff-I", "code": null}, {"id": 368006, "name": "Mogrifier RLSTM (WT2)", "code": null}, {"id": 369155, "name": "mT0-13B", "code": null}, {"id": 369198, "name": "InternImage", "code": null}, {"id": 369545, "name": "EVA-01", "code": null}, {"id": 368109, "name": "Galactica", "code": null}, {"id": 368735, "name": "Fusion in Encoder", "code": null}, {"id": 368082, "name": "AR-LDM", "code": null}, {"id": 367574, "name": "ALM 1.0", "code": null}, {"id": 371482, "name": "Vega v2", "code": null}, {"id": 369026, "name": "RT-1", "code": null}, {"id": 368726, "name": "CaLM", "code": null}, {"id": 368002, "name": "Hybrid H3-2.7B", "code": null}, {"id": 369034, "name": "VALL-E", "code": null}, {"id": 371511, "name": "DreamerV3", "code": null}, {"id": 369009, "name": "Nucleotide Transformer", "code": null}, {"id": 368732, "name": "Ankh_large", "code": null}, {"id": 369200, "name": "MusicLM", "code": null}, {"id": 368354, "name": "DDPM-IP (CelebA)", "code": null}, {"id": 369196, "name": "BLIP-2 (Q-Former)", "code": null}, {"id": 369193, "name": "ViT-22B", "code": null}, {"id": 368340, "name": "BASIC-L + Lion", "code": null}, {"id": 367499, "name": "LLaMA-65B", "code": null}, {"id": 368750, "name": "DiT-XL/2", "code": null}, {"id": 369016, "name": "AudioGen", "code": null}, {"id": 368353, "name": "PaLM-E", "code": null}, {"id": 367636, "name": "Falcon-40B", "code": null}, {"id": 372333, "name": "GPT-4 (Jun 2023)", "code": null}, {"id": 372308, "name": "GPT-4 (Mar 2023)", "code": null}, {"id": 368999, "name": "LEP-AD", "code": null}, {"id": 368069, "name": "PanGu-\u03a3", "code": null}, {"id": 371534, "name": "SigLIP 400M", "code": null}, {"id": 369142, "name": "VideoMAE V2", "code": null}, {"id": 367081, "name": "BloombergGPT", "code": null}, {"id": 369040, "name": "Segment Anything Model", "code": null}, {"id": 368359, "name": "Incoder-6.7B", "code": null}, {"id": 369173, "name": "DINOv2", "code": null}, {"id": 369186, "name": "LLaVA", "code": null}, {"id": 368742, "name": "ImageBind", "code": null}, {"id": 368130, "name": "StarCoder", "code": null}, {"id": 365387, "name": "PaLM 2", "code": null}, {"id": 369334, "name": "InstructBLIP", "code": null}, {"id": 371486, "name": "Med-PaLM 2", "code": null}, {"id": 369165, "name": "CoEdiT-xxl", "code": null}, {"id": 366449, "name": "ONE-PEACE", "code": null}, {"id": 368125, "name": "CodeT5+", "code": null}, {"id": 368721, "name": "Goat-7B", "code": null}, {"id": 372335, "name": "DPO on Pythia-2.8B", "code": null}, {"id": 368325, "name": "PaLI-X", "code": null}, {"id": 368037, "name": "MusicGen", "code": null}, {"id": 369965, "name": "GPT-3.5 Turbo", "code": null}, {"id": 369030, "name": "HyenaDNA", "code": null}, {"id": 370971, "name": "Stable Diffusion XL (SDXL)", "code": null}, {"id": 369204, "name": "Pangu-Weather", "code": null}, {"id": 367509, "name": "InternLM", "code": null}, {"id": 369010, "name": "xTrimoPGLM -100B", "code": null}, {"id": 371811, "name": "GPT3-2.7B (FlashAttention-2)", "code": null}, {"id": 368743, "name": "Llama 2-70B", "code": null}, {"id": 368747, "name": "Llama 2-7B", "code": null}, {"id": 369002, "name": "AudioLM", "code": null}, {"id": 369038, "name": "RT-2", "code": null}, {"id": 369167, "name": "Qwen-VL", "code": null}, {"id": 367533, "name": "Jais", "code": null}, {"id": 367528, "name": "Swift", "code": null}, {"id": 369508, "name": "Falcon-180B", "code": null}, {"id": 367633, "name": "Robot Parkour", "code": null}, {"id": 368737, "name": "AlphaMissense", "code": null}, {"id": 370131, "name": "Amazon Titan", "code": null}, {"id": 372633, "name": "GPT-3.5 Turbo Instruct", "code": null}, {"id": 368729, "name": "FinGPT-13B", "code": null}, {"id": 368043, "name": "Ferret (13B)", "code": null}, {"id": 369523, "name": "RT-2-X", "code": null}, {"id": 371940, "name": "PaLI-3", "code": null}, {"id": 368129, "name": "CODEFUSION (Python)", "code": null}, {"id": 368744, "name": "DiT-XL/2 + CADS", "code": null}, {"id": 369998, "name": "ChatGLM3-6B", "code": null}, {"id": 368349, "name": "Skywork-13B", "code": null}, {"id": 368740, "name": "Yi-34B", "code": null}, {"id": 368337, "name": "BLUUMI", "code": null}, {"id": 368376, "name": "Grok-1", "code": null}, {"id": 369331, "name": "LLaVA 1.5", "code": null}, {"id": 370174, "name": "CogVLM-17B", "code": null}, {"id": 369138, "name": "mPLUG-Owl2", "code": null}, {"id": 372317, "name": "RoFormer", "code": null}, {"id": 369144, "name": "SPHINX (Llama 2 13B)", "code": null}, {"id": 369349, "name": "Volcano 13B", "code": null}, {"id": 369180, "name": "Qwen-Audio-Chat", "code": null}, {"id": 369019, "name": "Nemotron-3-8B", "code": null}, {"id": 370977, "name": "GNoME for crystal discovery", "code": null}, {"id": 369147, "name": "PPLX-70B-Online", "code": null}, {"id": 368736, "name": "Qwen-72B", "code": null}, {"id": 369011, "name": "Mamba-24M (SC09)", "code": null}, {"id": 369174, "name": "Llama Guard", "code": null}, {"id": 369169, "name": "SeamlessM4T", "code": null}, {"id": 369027, "name": "Mixtral 8x7B", "code": null}, {"id": 371938, "name": "W.A.L.T", "code": null}, {"id": 371814, "name": "VILA-13B", "code": null}, {"id": 369518, "name": "CogAgent", "code": null}, {"id": 369532, "name": "FunSearch", "code": null}, {"id": 369156, "name": "Gemini Nano-1", "code": null}, {"id": 369148, "name": "Gemini Nano-2", "code": null}, {"id": 371483, "name": "nekomata-14b", "code": null}, {"id": 372331, "name": "GQA-8-XXL", "code": null}, {"id": 369332, "name": "CoRe", "code": null}, {"id": 371245, "name": "Palmyra X 003", "code": null}, {"id": 369161, "name": "AlphaGeometry", "code": null}, {"id": 369160, "name": "Qwen-VL-Max", "code": null}, {"id": 371236, "name": "Qwen1.5-72B", "code": null}, {"id": 369170, "name": "Aya", "code": null}, {"id": 371971, "name": "Stable Diffusion 3", "code": null}, {"id": 369522, "name": "MegaScale (Production)", "code": null}, {"id": 371244, "name": "Aramco Metabrain AI", "code": null}, {"id": 369524, "name": "MM1-30B", "code": null}, {"id": 369883, "name": "DBRX", "code": null}, {"id": 369542, "name": "ReALM", "code": null}, {"id": 370170, "name": "Reka Core", "code": null}, {"id": 369516, "name": "Llama 3-70B", "code": null}, {"id": 371528, "name": "VILA1.5-13B", "code": null}, {"id": 370141, "name": "Yi-Large", "code": null}, {"id": 371524, "name": "Octo-Base", "code": null}, {"id": 371545, "name": "ALLaM\u00a0adapted 70B", "code": null}, {"id": 370106, "name": "Qwen2-72B", "code": null}, {"id": 370246, "name": "OpenVLA", "code": null}, {"id": 369982, "name": "Nemotron-4 340B", "code": null}, {"id": 370239, "name": "DeepSeek-Coder-V2 236B", "code": null}, {"id": 371516, "name": "Cambrian-1-34B", "code": null}, {"id": 369981, "name": "ESM3 (98B)", "code": null}, {"id": 371845, "name": "SenseChat 5.5", "code": null}, {"id": 371531, "name": "Mathstral", "code": null}, {"id": 370155, "name": "Llama 3.1-405B", "code": null}, {"id": 370085, "name": "Mistral Large 2", "code": null}, {"id": 370093, "name": "AFM-on-device", "code": null}, {"id": 371536, "name": "LLaVA-OV-72B", "code": null}, {"id": 370521, "name": "Table Tennis Agent", "code": null}, {"id": 370975, "name": "Jamba 1.5-Large", "code": null}, {"id": 370561, "name": "DeepSeek-V2.5", "code": null}, {"id": 371474, "name": "Qwen2.5-32B", "code": null}, {"id": 371515, "name": "Oryx 34B", "code": null}, {"id": 371475, "name": "Qwen2.5 Instruct (72B)", "code": null}, {"id": 370534, "name": "Qwen2.5-72B", "code": null}, {"id": 371840, "name": "Telechat2-115B", "code": null}, {"id": 371540, "name": "Llama 3.2 11B", "code": null}, {"id": 371329, "name": "Movie Gen Video", "code": null}, {"id": 371484, "name": "GR-2", "code": null}, {"id": 371235, "name": "Palmyra X 004", "code": null}, {"id": 371653, "name": "RDT-1B", "code": null}, {"id": 371240, "name": "NVLM-D 72B", "code": null}, {"id": 371234, "name": "NVLM-H 72B", "code": null}, {"id": 371232, "name": "NVLM-X 72B", "code": null}, {"id": 371248, "name": "Doubao-pro", "code": null}, {"id": 371242, "name": "Hunyuan-Large", "code": null}, {"id": 370973, "name": "Pixtral Large", "code": null}, {"id": 371473, "name": "Fugatto 1", "code": null}, {"id": 371477, "name": "Infinity", "code": null}, {"id": 371967, "name": "NVILA 15B", "code": null}, {"id": 371535, "name": "Llama 3.3 70B", "code": null}, {"id": 371364, "name": "EXAONE 3.5 32B", "code": null}, {"id": 371937, "name": "Apollo 7B", "code": null}, {"id": 371328, "name": "DeepSeek-V3", "code": null}, {"id": 371974, "name": "STORM-B/8", "code": null}, {"id": 371509, "name": "INTELLECT-MATH", "code": null}, {"id": 371370, "name": "DeepSeek-R1", "code": null}, {"id": 371654, "name": "Eagle 2", "code": null}, {"id": 371539, "name": "Eurus-2-7B-PRIME", "code": null}, {"id": 371865, "name": "Grok 3", "code": null}, {"id": 371365, "name": "QwQ-32B", "code": null}, {"id": 371513, "name": "Hunyuan-TurboS", "code": null}, {"id": 371821, "name": "ERNIE-4.5-VL-424B-A47B (\u6587\u5fc3\u5927\u6a21\u578b4.5)", "code": null}, {"id": 371472, "name": "EXAONE Deep 32B", "code": null}, {"id": 371941, "name": "Diffusion Renderer", "code": null}, {"id": 372634, "name": "DeepSeek-V3 (Mar 2025)", "code": null}, {"id": 371514, "name": "Llama 4 Behemoth (preview)", "code": null}, {"id": 371860, "name": "Llama 4 Maverick", "code": null}, {"id": 371843, "name": "Llama 4 Scout", "code": null}, {"id": 371529, "name": "Pangu Ultra", "code": null}, {"id": 371991, "name": "Qwen3-235B-A22B", "code": null}, {"id": 372346, "name": "DeepSeek-R1 (May 2025)", "code": null}, {"id": 372162, "name": "Qwen3 Embedding", "code": null}, {"id": 372161, "name": "Seed-1.6-Thinking", "code": null}, {"id": 371850, "name": "FGN", "code": null}, {"id": 371886, "name": "EXAONE Path 2.0", "code": null}, {"id": 371820, "name": "Grok 4", "code": null}, {"id": 371815, "name": "Kimi K2", "code": null}, {"id": 371817, "name": "EXAONE 4.0 (32B)", "code": null}, {"id": 371987, "name": "Qwen3-Coder-480B-A35B", "code": null}, {"id": 372632, "name": "Qwen3-235B-A22B (Jul 2025)", "code": null}, {"id": 372358, "name": "Qwen3-235B-A22B-Thinking (Jul 2025)", "code": null}, {"id": 371984, "name": "MindLink-72B", "code": null}, {"id": 372325, "name": "Hierarchical Reasoning Model (HPM)", "code": null}, {"id": 371989, "name": "Qwen Image", "code": null}, {"id": 372366, "name": "GLM-4.5", "code": null}, {"id": 371847, "name": "gpt-oss-120b", "code": null}, {"id": 371855, "name": "gpt-oss-20b", "code": null}, {"id": 372167, "name": "LongCat-Flash", "code": null}, {"id": 371992, "name": "Qwen3-Max", "code": null}, {"id": 372330, "name": "AgentFounder-30B", "code": null}, {"id": 372169, "name": "Qwen3-Omni-30B-A3B", "code": null}, {"id": 372365, "name": "GLM-4.6", "code": null}, {"id": 372318, "name": "Ling-1T", "code": null}, {"id": 372313, "name": "MiniMax-M2", "code": null}, {"id": 372326, "name": "Tongyi DeepResearch", "code": null}, {"id": 372341, "name": "Kimi K2 Thinking", "code": null}, {"id": 372433, "name": "Olmo 3", "code": null}, {"id": 372436, "name": "DeepSeekMath-V2", "code": null}, {"id": 372369, "name": "GLM-4.7", "code": null}, {"id": 372631, "name": "MiniMax-M2.1", "code": null}, {"id": 372368, "name": "HyperCLOVA X SEED 32B Think", "code": null}, {"id": 372370, "name": "A.X K1", "code": null}, {"id": 372364, "name": "VAETKI\n", "code": null}, {"id": 372363, "name": "K-EXAONE", "code": null}, {"id": 372367, "name": "Solar Open 100B\n", "code": null}]}}, "origins": [{"id": 14136, "title": "Parameter, Compute and Data Trends in Machine Learning", "descriptionSnapshot": "We update this chart with the latest available data from our source every month.\n\nThe authors selected the AI systems for inclusion based on the following necessary criteria:\n\u2014 Have an explicit learning component\n\u2014 Showcase experimental results\n\u2014 Advance the state of the art\n\nIn addition, the systems had to meet at least one of the following notability criteria:\n\u2014 Paper has more than 1000 citations\n\u2014 Historical importance\n\u2014 Important state-of-the-art advance\n\u2014 Deployed in a notable context\n\nThe authors note that: \"For new models (from 2020 onward) it is harder to assess these criteria, so we fall back to a subjective selection. We refer to models meeting our selection criteria as 'milestone models.\"\n", "producer": "Epoch AI", "citationFull": "Epoch AI, \u2018Parameter, Compute and Data Trends in Machine Learning\u2019. Published online at epochai.org. Retrieved from: \u2018https://epoch.ai/data/epochdb/visualization\u2019 [online resource]", "urlMain": "https://epoch.ai/mlinputs/visualization", "urlDownload": "https://epoch.ai/data/epochdb/notable_ai_models.csv", "dateAccessed": "2026-03-07", "datePublished": "2025", "license": {"url": "https://creativecommons.org/licenses/by/4.0/", "name": "CC BY 4.0"}}]}