Experience consistent excellent AI and Machine Learning performance with the revolutionary NVIDIA L40S GPU, perfect for AI training, graphics rendering, video transcoding and virtualization.
Deliver multi-workload acceleration for large language model inference and training, graphics and video applications through the Ada Lovelace architecture.
The L40S GPU delivers exceptional performance with 1466 TFLOPS in Tensor operations, 212 TFLOPS in RT core performance, and 91.6 TFLOPS in Single-precision performance.
Ada Lovelace
48GB GDDR6 with ECC
18,176 pcs.
864 GB/s
350 W
Advanced tensor processing and ray tracing capabilities optimized for AI workloads and high-fidelity rendering tasks.
91.6 teraFLOPS
733 teraFLOPS
1,466 teraFLOPS
212 teraFLOPS
Enterprise-grade NVIDIA L40S GPU servers built on Ada Lovelace architecture, delivering exceptional performance for AI model training, 3D rendering, and video production workflows.
Compare performance metrics and pricing across NVIDIA GPU options to find the best fit for your AI and graphics workloads.
| L40S | A100 | H100 | |
|---|---|---|---|
| Architecture | Ada Lovelace | NVIDIA Ampere | Hopper |
| Memory | 48GB GDDR6 | 80GB HBM2e | 80GB HBM3 |
| Memory Bandwidth | 864 GB/s | 2039 GB/s | 3352 GB/s |
| FP32 | 91.6 TFLOPS | 19.5 TFLOPS | 66.9 TFLOPS |
| TF32 Tensor Core | 366 TFLOPS | 312 TFLOPS | 989 TFLOPS |
| FP16/BF16 Tensor Core | 733 TFLOPS | 624 TFLOPS | 1979 TFLOPS |
| Power | Up to 350W | Up to 400W | Up to 700W |
| Loading... | Loading... | Loading... |
Common questions about deploying and managing your NVIDIA L40S GPU-accelerated servers for AI, rendering, and video production workloads.