NVIDIA Quadro GV100
Unmatched Power. Unmatched Creative Freedom.
AI, photo realistic rendering, simulation, and VR are transforming professional workflows. Engineers can now create groundbreaking products faster. Architects can design buildings that could only have existed in their imaginations. And artists can render complex photorealistic scenes in seconds instead of hours. As applications continue to be enhanced with these technologies, professional computing tools need to keep pace.
The NVIDIA® Quadro® GV100 is reinventing the workstation to meet the demands of these next-generation workflows. It’s powered by NVDIA Quadro Volta, delivering the extreme memory capacity, scalability, and performance that designers, architects, and scientists need to create, build, and solve the impossible.
Based on a state-of-the-art 12nm FFN (FinFET NVIDIA) high-performance manufacturing process customized for NVIDIA to incorporate 5120 CUDA cores, the NVIDIA Quadro GV100 GPU is the most powerful computing platform for HPC, AI, VR and graphics workloads on professional desktops. Able to deliver more than 7.4 TFLOPS of double-precision (FP64), 14.8 TFLOPS of single-precision (FP32), 29.6 TFLOPS of half-precision (FP16), 59.3 TOPS of integer-precision (INT8), and 118.5 TFLOPs of tensor operation capability, it supports a wide range of compute-intensive workloads flawlessly.
New mixed-precision Tensor Cores purpose-built for deep learning matrix arithmetic, deliver an 8x boost in TFLOPS performance for training, compared to the previous generation. NVIDIA Quadro GV100 utilizes 640 Tensor Cores; each Tensor Core performs 64 floating point fused multiply-add (FMA) operations per clock, and each SM performs a total of 1024 individual floating point operations per clock.

Highlights
CUDA Cores | 5120 |
Tensor Cores | 640 |
Peak Double Precision FP64 Performance | 7.4 TFLOPS |
Peak Single Precision FP32 Performance | 14.8 TFLOPS |
Peak Half Precision FP16 Performance | 29.6 TFLOPS |
Peak Integer Operation (INT8) Performance | 59.3 TOPS |
Deep Learning TFLOPS | 118.5 TFLOPS |
GPU Memory | 32 GB HBM2 |
Memory Interface | 4096-bit |
Memory Bandwidth | 870 GB/s |
System Interface | PCI Express 3.0 x16 |
Display Connectors | DP 1.4 (4) |
Supercharge Rendering with AI
- Work with full fidelity, massive datasets
- Enjoy fluid visual interactivity with Ai-accelerated denoising
Bring Optimal Designs to Market Faster
- Work with higher fidelity CAE simulation models
- Explore more design options with faster solver performance
Enjoy Ultimate Immersive Experiences
- Work with complex, photorealistic datasets in VR
- Enjoy an optimal NVIDIA Holodeck experience
Realize New Opportunities with AI
- Access DL frameworks for AI development via NVIDIA NGC
- Accelerate AI training/inferencing with Tensor Cores and NVLink