article thumbnail

Hedge 244: Networks for AI

Rule 11

Why is InfiniBand so popular for building AI networks? What about Ethernet for AI? Jeff Tantsura joins Tom Ammon and Russ White to discuss networks for AI workloads. link] download What are the requirements for running AI workloads over a data center fabric? What about Ethernet for AI?

article thumbnail

A RoCE network for distributed AI training at scale

Engineering at Meta

AI networks play an important role in interconnecting tens of thousands of GPUs together, forming the foundational infrastructure for training, enabling large models with hundreds of billions of parameters such as LLAMA 3.1 The growing prevalence of AI has introduced a new era of communication demands.

Network 132
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

HN728: How Drivenets Leverages Ethernet Fabrics For AI Networking (Sponsored)

Packet Pushers

To run AI workloads, a network needs thousands of GPUs and those GPUs must operate in sync. While there are advantages to using Ethernet for AI networking (including engineers well-trained in the protocol and a robust ecosystem), it wasnt designed.

article thumbnail

Hedgehog is the AI network solution builder - plus more

How Funky

If you are actively looking to build out AI network infrastructure and want to utilize white box, cost effective switching, one of the challenges you have is what software you will use to design, deploy, operate and manage those network switches, because doing that by hand via a CLI will not be fun.

article thumbnail

Broadcom's AI Networking Solutions at Networking Field Day 32

How Funky

Broadcom presented at Networking Field Day 32 on July 26, 2023 and they presented on their AI Networking solutions. These are products and architectures that address the needs of those building out AI data center focused networks. Obviously the design will work for regular data center workloads too.

article thumbnail

Network Break 433: NVIDIA Melds Switches, DPUs For AI Networking Fabric; FTC Says Amazon Ring Employee Spied On Female Customers

Packet Pushers

This week's Network Break discusses a new Google offering to interconnect public clouds, NVIDIA's platform for AI networking fabrics using Ethernet switches and DPUs, and Cisco's latest security acquisition.

article thumbnail

NB436: Cisco AI Silicon, DEM. HPE Greenlake AI LLM. FCC Talks Bandwidth Caps.

Packet Pushers

Cisco announces AI Networking versions of SIlicon One ASICs and buys another DEM business. HPE Greenlake adds AI LLM. Cisco announces AI Networking versions of SIlicon One ASICs and buys another DEM business. HPE Greenlake adds AI LLM. FTC talks bandwidth caps. We laughed. FTC talks bandwidth caps.