NVIDIA Investor Presentation Deck slide image

NVIDIA Investor Presentation Deck

Throughput (iterations/s) 10³ 10² 10¹ 10⁰ 10-1 1.5 GB matrix 13860 19600 (1 Sockets) (2 Sockets) 27566 (4 Sockets) Example: Jacobi Iteration 39204 55696 78766 (8 Sockets) (16 Sockets) (32 Sockets) Matrix Dimension 111392 (64 Sockets) 400 GB matrix 157532 (128 Sockets) (256 Sockets) 222784 - Legate CPU A-- Legate GPU + Dask Array Tuned = A = np.random.rand (N, N) b np.random.rand (N) = NumPy for i in range x = (b - Intel (ML) NumPy X = np.zeros (b.shape) d np.diag (A) R = A - np.diag (d) - ■-■ CuPy (n): np. dot (R, x) ) / d
View entire presentation