NVIDIA L40S Dedicated Servers

The NVIDIA L40S is perfect for AI training, graphics rendering, video transcoding and virtualization.

Global locations

24/7 support

Low prices

5 minutes deployment

Deliver multi-workload acceleration for large language model inference and training, graphics and video applications through the Ada Lovelace architecture.

NVIDIA L40S specs:
Video memory capacity	48GB GDDR6 with ECC
CUDA cores	18 176 pcs.
Max Bandwidth	864 GB/s
The L40S GPU delivers 1466 TFLOPS in Tensor performance, 212 TFLOPS in RT core performance and 91.6 TFLOPS in Single-precision performance.

Experience the power of NVIDIA L40S GPU bare metal servers

AI Model Training

The L40S GPU accelerates AI model training through the utilization of structural sparsity and the optimized TF32 format.

Ray-Tracing

The L40S enhances ray tracing performance, speeding up renders for design and engineering workflows.

Security

The L40S GPU ensures 24/7 readiness, meets NEBS Level 3 standards, and enhances data center security with secure boot technology.

DLSS 3

Enhanced rendering and frame rates through DLSS 3, leveraging deep learning innovations for higher FPS and reduced latency.


Loading...


Loading...

LLM Training and Inference

L40S leverages fourth-generation Tensor Cores with FP8 support, providing exceptional computing performance for accelerated training and inference of advanced LLM and Generative AI models.

Rendering and 3D graphics

3D workloads are enhanced with the NVIDIA L40S for faster rendering and increased productivity. Work in real-time on intricate designs with high-resolution textures, powering high-fidelity creative workflows.

Streaming and Video content

The NVIDIA L40S GPU boosts streaming and video workloads with three video encoding and decoding engines. Featuring AV1 encoding, it achieves breakthrough performance and improved TCO for streaming, video production and transcription workflows.

Accelerate the most demanding workloads with a dedicated server powered by the NVIDIA L40S GPU.

NVIDIA L40S Specifications:
FP32	4891.6 teraFLOPS
FP32 Tensor Core	366 teraFLOPS*
FP16	733 teraFLOPS*
FP8	1,466 teraFLOPS*
RT Core Performance	212 teraFLOPS*
Max Power Consumption	350 W

Quick Component Replacement

Deploy your website or application on a GPU dedicated server, backed by a 4 hour part replacement guarantee SLA.

Network Monitoring

Deploy your bare metal cloud servers instantly on a custom-built global network monitored 24/7 for optimal uptime and security.

Support

Expert support is standing by day or night via chat and email.

Instant Nvidia L40S dedicated servers

Custom Nvidia L40S configurations