New A100-powered Amazon EC2 P4d instance available as NVIDIA GPUs reach 10 years on AWS.
Amazon Web Services’ first GPU instance debuted 10 years ago, with the NVIDIA M2050. At that time, CUDA-based applications were focused primarily on accelerating scientific simulations, with the rise of AI and deep learning still a ways off.
Since then, AWS has added to its stable of cloud GPU instances, which has included the K80 (p2), K520 (g3), M60 (g4), V100 (p3/p3dn) and T4 (g4).
The P4d instance provides the highest performance, most cost-effective GPU-based machine learning and high-performance computing applications platform for AWS. Compared to the default FP32 accuracy, the instances reduce the time to train machine learning models by up to 3x with FP16 and up to 6x with TF32.
They also provide exceptional efficiency of inference. Just last month, NVIDIA A100 GPUs swept the MLPerf Inference benchmarks, offering up to 237x