Artificial intelligence for self-driving cars. Predicting our climate’s future. A new drug to treat cancer. Some of the world’s most important challenges need to be solved today, but require tremendous amounts of computing to become reality. Today’s large-scale data center relies on lots of interconnected commodity compute nodes, limiting the performance needed to drive these important workloads.
The NVIDIA® Tesla® The NVIDIA Tesla P100 is the world’s most advanced datacenter accelerator ever built, a brand new GPU architecture to deliver the world’s fastest compute node. Powered by four ground-breaking technologies with discontinuous jumps in performance, Tesla P100 enables lightning-fast nodes to deliver the highest absolute performance for HPC and deep learning workloads with infinite computing needs.
|Product Series||Tesla P100|
|Core Type||NVIDIA CUDA|
|Host Interface||PCI Express 3.0 x16|
|Stream Cores||3584 CUDA Cores|
|NVIDIA NVLink™ Interconnect Bandwidth||160 GB/s|
|PCIe x16 Interconnect Bandwidth||32 GB/s|
|CoWoS HBM2 Stacked Memory Capacity||16 GB|
|CoWoS HBM2 Stacked Memory Bandwidth||720 GB/s|
|Max Memory Bandwidth||720 GB/s|
|Peak Double Precision floating point performance (GFLOP)||5.3 TeraFLOPS|
|Peak Single Precision floating point performance (GFLOP)||10.6 TeraFLOPS|
|Half-Precision Performance||21.2 TeraFLOPS|