“The 7700R4 behaves like a single system, with devoted deep buffers to make sure system-wide lossless transport throughout your complete Ethernet-based AI community,” Hull wrote. “DES is topology agnostic, [Ultra Ethernet Consortium (UEC)] prepared, optimized for each coaching and inference workloads, with a 100% environment friendly structure, and affords the wealthy telemetry and sensible options that the trendy AI Middle wants.”
The UEC was based final 12 months by AMD, Arista, Broadcom, Cisco, Eviden, HPE, Intel, Meta and Microsoft, and it now consists of greater than 75 distributors. The consortium is creating applied sciences geared toward rising the size, stability, and reliability of Ethernet networks to fulfill AI’s high-performance networking necessities. UEC specs will outline quite a lot of scalable Ethernet enhancements, together with higher multi-path and packet supply choices in addition to fashionable congestion and telemetry options.
“Community efficiency and availability play an vital position in extracting the very best efficiency out of our AI coaching clusters. It’s for that cause that we’ve continued to push for disaggregation within the backend community materials for our AI clusters,” in accordance with a Meta blog.