Graphcore

Get started

Grow

IPU-POD

When you're ready to scale, choose IPU-POD₁₂₈ for production deployment in your enterprise datacenter, private or public cloud. Experience massive efficiency and productivity gains when large language training runs are completed in hours or minutes instead of months and weeks. IPU-POD₁₂₈ delivers for AI at scale.

Superior scaling & blazing fast performance
Full systems integration support for datacenter installation.
AI expert support to develop & deploy models at scale.

Order Now Try in the cloud

Product Details
Product Specs
Performance
Useful Docs
Related Products

Start growing your AI compute capacity in the cloud or in your datacenter with IPU-POD₁₂₈.

IPU-POD₁₂₈ is designed for straightforward deployment, integrating effectively with standard datacenter infrastructure, including VMWare virtualization, OpenStack. Slurm and Kubernetes support, so it's simple to automate application deployment, scaling, and management. Virtual-IPU™ technology offers secure multi-tenancy. Developers can build model replicas within and across multiple IPU-PODs and provision IPUs across many IPU-PODs for very large models.

In order to continuously support the increasing super-scale AI HPC environment market demand, we are partnering with Graphcore to upgrade our IPU-POD₆₄s to an IPU-POD₁₂₈ to increase the “Hyperscale AI Services” offering to our customers. Through this upgrade we expect our AI computation scale to increase to 32 PetaFLOPS of AI Compute, allowing for more diverse customers to be able to use KT’s cutting-edge AI computing for training and inference on large-scale AI models.

Mihee Lee, SVP Cloud/DX Business

Korea Telecom

Language

Natural language processing (NLP) delivers business value today for finance firms to biotech leaders, scale-ups to hyperscalers, improving internet search sentiment analysis, fraud detection, chatbots, drug discovery and more. Choose IPU-POD₁₂₈ whether you are running large BERT models in production or starting to explore GPT class models or GNNs.

Vision

State of the art computer vision is driving breakthroughs in medical imaging, claims processing, cosmology, smart cities, self-driving cars, and more. World leading performance for traditional powerhouses like ResNet50 and high accuracy emerging models like EfficientNet are ready to run on IPU-POD₁₂₈ at scale.

Scientific Research

National labs, universities and research institutes are turning to IPU-POD₁₂₈ to solve problems in physics, weather forecasting, computational fluid dynamics, protein folding, oil & gas exploration and more. Take advantage of the IPU's fine grained compute at scale for emerging Graph Neural Networks (GNNs) and probabilistic models, explore sparsity and make the convergence of HPC and AI a reality.

Performance

World-class results whether you want to explore innovative models and new possibilities, faster time to train, higher throughput or performance per TCO dollar.

EXTENSIVE ECOSYSTEM

Software tools and integrations to support every step of the AI lifecycle from development to deployment to improve productivity and AI infrastructure efficiency. And just make it easier to use.

IPUs	128x GC200 IPUs
IPU-M2000s	32x IPU-M2000s
Memory	115.2GB In-Processor-Memory™ and up to 8.2TB Streaming Memory
Performance	32 petaFLOPS FP16.16 8 petaFLOPS FP32
IPU Cores	188,416
Threads	1,130,496
IPU-Fabric	2.8Tbps
Host-Link	100 GE RoCEv2
Software	Poplar TensorFlow, PyTorch, PyTorch Lightning, Keras, Paddle Paddle, Hugging Face, ONNX, HALO OpenBMC, Redfish DTMF, IPMI over LAN, Prometheus, and Grafana Slurm, Kubernetes OpenStack, VMware ESG
System Weight	900kg + Host servers and switches
System Dimensions	32U + Host servers and switches
Host Server	Selection of approved host servers from Graphcore partners
Thermal	Air-Cooled

MLPerf Results

Division

Model

MLPerf Quality Target

Platform

SDK Version

Framework

MLPerf ID

Dataset

Precision

Time to Train (mins)

Unauthorized use strictly prohibited. See www.mlperf.org for more information.

SDK BENCHMARKS

Model

Variant

Platform

SDK Version

Framework

Dataset

Batch Size

Precision

Throughput (items/sec)

For more performance results, visit our Performance Results page

Grow

IPU-POD

Language

Vision

Scientific Research

Performance

EXTENSIVE ECOSYSTEM

MLPerf Results

SDK BENCHMARKS

Explore

IPU-POD

Build

IPU-POD

Grow

IPU-POD

Get the latest Graphcore news