How Meta trains large language models at scale
Engineering at Meta
JUNE 12, 2024
Data center deployment Once we’ve chosen a GPU and system, the task of placing them in a data center for optimal usage of resources (power, cooling, networking, etc.) There are two leading choices in the industry that fit these requirements: RoCE and InfiniBand fabrics. Both of these options had tradeoffs.
Let's personalize your content