The NVIDIA L40S is perfect for AI training, graphics rendering, video transcoding and virtualization.
Global locations
24/7 support
Low prices
5 minutes deployment
Deliver multi-workload acceleration for large language model inference and training, graphics and video applications through the Ada Lovelace architecture.
NVIDIA L40S specs: | |
---|---|
Video memory capacity | 48GB GDDR6 with ECC |
CUDA cores | 18 176 pcs. |
Max Bandwidth | 864 GB/s |
The L40S GPU delivers 1466 TFLOPS in Tensor performance, 212 TFLOPS in RT core performance and 91.6 TFLOPS in Single-precision performance. |
Experience the power of NVIDIA L40S GPU bare metal servers
AI Model Training
The L40S GPU accelerates AI model training through the utilization of structural sparsity and the optimized TF32 format.
Ray-Tracing
The L40S enhances ray tracing performance, speeding up renders for design and engineering workflows.
Security
The L40S GPU ensures 24/7 readiness, meets NEBS Level 3 standards, and enhances data center security with secure boot technology.
DLSS 3
Enhanced rendering and frame rates through DLSS 3, leveraging deep learning innovations for higher FPS and reduced latency.
Loading... |
Loading... |
LLM Training and Inference
L40S leverages fourth-generation Tensor Cores with FP8 support, providing exceptional computing performance for accelerated training and inference of advanced LLM and Generative AI models.
Rendering and 3D graphics
3D workloads are enhanced with the NVIDIA L40S for faster rendering and increased productivity. Work in real-time on intricate designs with high-resolution textures, powering high-fidelity creative workflows.
Streaming and Video content
The NVIDIA L40S GPU boosts streaming and video workloads with three video encoding and decoding engines. Featuring AV1 encoding, it achieves breakthrough performance and improved TCO for streaming, video production and transcription workflows.
Accelerate the most demanding workloads with a dedicated server powered by the NVIDIA L40S GPU.
NVIDIA L40S Specifications: | |
---|---|
FP32 | 4891.6 teraFLOPS |
FP32 Tensor Core | 366 teraFLOPS* |
FP16 | 733 teraFLOPS* |
FP8 | 1,466 teraFLOPS* |
RT Core Performance | 212 teraFLOPS* |
Max Power Consumption | 350 W |
Quick Component Replacement
Deploy your website or application on a GPU dedicated server, backed by a 4 hour part replacement guarantee SLA.
Network Monitoring
Deploy your bare metal cloud servers instantly on a custom-built global network monitored 24/7 for optimal uptime and security.
Support
Expert support is standing by day or night via chat and email.
Deploy your L40S GPU dedicated server today!
Get started