Remove Bandwidth Remove Ethernet Remove Network Switch
article thumbnail

Meta’s open AI hardware vision

Engineering at Meta

Networking and bandwidth play an important role in ensuring the clusters’ performance. Our systems consist of a tightly integrated HPC compute system and an isolated high-bandwidth compute network that connects all our GPUs and domain-specific accelerators. Building AI clusters requires more than just GPUs.

Bandwidth 131
article thumbnail

OCP Summit 2024: The open future of networking hardware for AI

Engineering at Meta

DSF-based fabrics allow us to build large, non-blocking fabrics to support high-bandwidth AI clusters. DSF extends our disaggregating network systems to our VoQ-based switched systems that are powered by the open OCP-SAI standard and FBOSS , Meta’s own network operating system for controlling network switches.

Network 117