Q3 FY24 Earnings Summary
Training Compute (petaFLOPs)
1010
109
108
107
106
105
104
103
Modern Al is a Data Center Scale Computing Workload
Data centers are becoming Al factories: Data as input, intelligence as output
Before Transformers = 8X/2yrs
Transformers = 215X/2yrs
102
2011
AlexNet
2012
PALM
MT NLG 530B
• Chinchilla
GPT-3⚫
• BLOOM
Microsoft T-NLG
GPT-2⚫
Megatron-NLG.
XLNet
Wav2Vec 2.0
MoCo ResNet50
Xception
BERT Large
Inception V3 •
Seq2Seq Resnet
VGG-19
• GPT-1
Transformer
• ResNext
ELMO
DenseNet201
2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023
Al Training Computational Requirements
Large Language Models, based on the Transformer architecture,
are one of today's most important advanced Al technologies,
involving up to trillions of parameters that learn from text.
111
Developing them is an expensive, time-consuming process that
demands deep technical expertise, distributed data center-scale
infrastructure, and a full-stack accelerated computing approach.
8898
NVIDIA
Τηνια
NVIDIA
NVIDIA
Jaen
Fueling Giant-Scale Al Infrastructure
NVIDIA compute & networking GPU | DPU | CPU
NVIDIAView entire presentation