PitchSend

NVIDIA Investor Presentation Deck

BERT Training 1x 6x V100 A100 UNIFIED AI TRAINING AND INFERENCE ACCELERATION 10.6x T4 1x V100 7x A100 (7 MIGS) BERT Inference ‒‒‒‒ wat www.tatt ரய்மீகா BERT Pre-Training Throughput using Pytorch including (2/3)Phase 1 and (1/3)Phase 2 | Phase 1 Seq Len = 128, Phase 2 Seq Len = 512 V100: DGX-1 Server with 8xV100 using FP32 precision A100: DGX A100 Server with 8xA100 using TF32 precision | BERT Large Inference | T4, V100: TRT 7.1, Precision = FP16, Batch Size =256 | A100 MIG: Pre-production TRT, Batch Size =94, Precision = INT8 with Sparsity

View entire presentation