- Oracle claims its Zettascale10 system can hit 16 zettaFLOPS peak
- The mission makes use of round 800,000 Nvidia GPUs unfold throughout knowledge facilities
- OpenAI’s Stargate cluster in Texas runs on Oracle’s new infrastructure
Oracle has introduced what it calls the biggest AI supercomputer within the cloud, the OCI Zettascale10.
The corporate claims the system can ship 16 zettaFLOPS of peak efficiency throughout 800,000 Nvidia GPUs.
That output, when divided, equals about 20 petaflops per GPU, roughly matching the Grace Blackwell GB300 Extremely chip utilized in high-end desktop AI methods.
Community design for large-scale AI workloads
Oracle says the platform is the muse for OpenAI’s Stargate cluster in Abilene, Texas, constructed to deal with a number of the most demanding AI workloads now rising in analysis and industrial use.
“The extremely scalable customized RoCE design maximizes fabric-wide efficiency at gigawatt scale whereas retaining many of the energy centered on compute…,” mentioned Peter Hoeschele, vice chairman, Infrastructure and Industrial Compute, OpenAI.
On the core of the Zettascale10 system is Oracle Acceleron RoCE networking, designed to extend scalability and reliability for data-heavy AI operations.
This structure makes use of community interface playing cards as mini switches, linking GPUs throughout a number of remoted community planes.
The design goals to cut back latency between GPUs and permit jobs to proceed operating if one community path fails.
“That includes Nvidia full-stack AI infrastructure, OCI Zettascale10 supplies the compute cloth wanted to advance state-of-the-art AI analysis and assist organizations in every single place transfer from experimentation to industrialized AI,” mentioned Ian Buck, vice chairman of Hyperscale, Nvidia.
Oracle claims this construction can decrease prices by simplifying tiers inside the community whereas sustaining constant efficiency throughout nodes.
It additionally introduces Linear Pluggable and Receiver Optics to cut back vitality and cooling use with out slicing bandwidth.
Though Oracle’s figures are spectacular, the corporate has not supplied unbiased verification of its 16 zettaFLOPS declare.
Cloud efficiency metrics can fluctuate relying on how throughput is calculated, and Oracle’s comparability could depend on theoretical peaks fairly than sustained charges.
Provided that the system’s marketed complete equals the sum of 800,000 top-end GPUs, real-world effectivity might rely closely on community design and software program optimization.
Analysts could wait to see whether or not the configuration delivers efficiency corresponding to main AI clusters already run by different main cloud suppliers.
The Zettascale10 positions Oracle alongside different main gamers racing to supply the infrastructure behind the most effective GPUs and AI instruments.
The corporate says prospects might practice and deploy giant fashions throughout Oracle’s distributed cloud atmosphere, supported by knowledge sovereignty measures.
Oracle additionally says Zettascale10 gives operational flexibility by unbiased plane-level upkeep, permitting updates with much less downtime.
“With OCI Zettascale10, we’re fusing OCI’s Oracle Acceleron RoCE community structure with next-generation Nvidia AI infrastructure to ship multi-gigawatt AI capability at unmatched scale,” mentioned Mahesh Thiagarajan, government vice chairman, Oracle Cloud Infrastructure.
“Prospects can construct, practice, and deploy their largest AI fashions into manufacturing utilizing much less energy and can have the liberty to function throughout Oracle’s distributed cloud with sturdy knowledge and AI sovereignty…”
Nonetheless, observers observe that different suppliers are constructing their very own large-scale GPU clusters and superior cloud storage methods, which might slim Oracle’s benefit.
This technique will roll out subsequent 12 months, and solely then will it’s clear whether or not the structure can meet demand for scalable, environment friendly, and dependable AI computation.
By way of HPCWire
Observe TechRadar on Google Information and add us as a most popular supply to get our skilled information, evaluations, and opinion in your feeds. Ensure to click on the Observe button!
And naturally you can even observe TechRadar on TikTok for information, evaluations, unboxings in video type, and get common updates from us on WhatsApp too.
You may also like