Data Centers, Networking and Topology - IT Networking Pro Today

A RoCE network for distributed AI training at scale

Engineering at Meta

AUGUST 5, 2024

AI networks play an important role in interconnecting tens of thousands of GPUs together, forming the foundational infrastructure for training, enabling large models with hundreds of billions of parameters such as LLAMA 3.1 Distributed training, in particular, imposes the most significant strain on data center networking infrastructure.

Network

Network Networking Topology Data Centers

Seamless network integration: connecting OpenShift to your data center with Apstra

Juniper

FEBRUARY 18, 2025

Official Juniper Networks Blogs Seamless network integration: connecting OpenShift to your data center with Apstra In today’s fast-paced digital world, businesses demand agility andefficiency from their IT infrastructure. The most commonly deployed templates set up a cloud-scale EVPN-VXLAN fabric.

Data Centers

Data Centers Network Networking Virtual machine

Seamless network integration: connecting OpenShift to your data center with Apstra

Juniper

FEBRUARY 18, 2025

Official Juniper Networks Blogs Seamless network integration: connecting OpenShift to your data center with Apstra In today’s fast-paced digital world, businesses demand agility andefficiency from their IT infrastructure. The most commonly deployed templates set up a cloud-scale EVPN-VXLAN fabric.

Data Centers

Data Centers Network Networking Virtual machine

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Kentik Bridges the Intelligence Gap for Hybrid Cloud Networks

Kentik

AUGUST 20, 2020

As Kentik’s product manager for hybrid cloud, I am always talking to infrastructure and network teams around the world to understand a day-in-their-life. This provides me with an invaluable understanding of the challenges, goals, and priorities that they face today, and also a vision into their future network monitoring needs.

Cloud

Cloud Network Networking Data Centers

Announcing Complete Azure Observability for Kentik Cloud

Kentik

JUNE 27, 2023

Kentik customers move workloads to (and from) multiple clouds, integrate existing hybrid applications with new cloud services, migrate to Virtual WAN to secure private network traffic, and make on-premises data and applications redundant to multiple clouds – or cloud data and applications redundant to the data center.

Cloud

Cloud Firewall Data Centers Topology

How Meta trains large language models at scale

Engineering at Meta

JUNE 12, 2024

Supporting GenAI at scale has meant rethinking how our software, hardware, and network infrastructure come together. Optimal connectivity between GPUs: Large-scale model training involves transferring vast amounts of data between GPUs in a synchronized fashion. requires revisiting trade-offs made for other types of workloads.

Infiniband

Infiniband Data Centers Topology Network

Building Meta’s GenAI Infrastructure

Engineering at Meta

MARCH 12, 2024

We are sharing details on the hardware, network, storage, design, performance, and software that help us extract high throughput and reliability for various AI workloads. Today, we’re sharing details on two versions of our 24,576-GPU data center scale cluster at Meta. We use this cluster design for Llama 3 training.

Infiniband

Infiniband Data Centers Server Network

Live Training: Build Your Own Networking Lab

Rule 11

JUNE 17, 2024

This Friday at 1pm ET, Bruce McDougall and I are teaching a live class on using Containerlab to build and automate network labs. The course begins with obtaining and starting the basic tools required to build and test network labs using open-source and freely available tools. Register here.

Network

Network Networking Topology Data Centers

Massive Scale Visibility Challenges Inside Hyperscale Data Centers

Kentik

MARCH 13, 2018

Hyperscale data centers are true marvels of the age of analytics, enabling a new era of cloud-scale computing that leverages Big Data, machine learning, cognitive computing and artificial intelligence. the compute capacity of these data centers is staggering.

Data Centers

Data Centers Topology Server Artificial Intelligence

Practical Steps for Enhancing Reliability in Cloud Networks - Part I

Kentik

APRIL 4, 2023

When evaluating solutions, whether to internal problems or those of our customers, I like to keep the core metrics fairly simple: will this reduce costs, increase performance, or improve the network’s reliability? It’s often taken for granted by network specialists that there is a trade-off among these three facets. Durability.

Cloud

Cloud Network Networking Bandwidth

Today’s Enterprise WAN Isn’t What It Used To Be

Kentik

MARCH 13, 2023

Whether it’s as simple as ensuring solid connectivity with a SaaS provider or designing a robust, secure, hybrid, and multi-cloud architecture, the enterprise wide area network is all about connecting us to our resources, wherever they are. So what does this mean for today’s enterprise network engineer?

WAN

WAN Wide Area Network Internet Topology

Setting Up a Basic Azure Network

Akins IT

OCTOBER 21, 2019

SECURING YOUR AZURE VIRTUAL NETWORK WITH A NG FIREWALL PART 1- SETTING UP A BASIC AZURE NETWORK When setting up your Azure Virtual Network (VNET), there are some things to consider before getting started. The next step is to determine your overall design and topology. 8) instead for your Azure Virtual Network.

Network

Network Networking IP Address Topology

The Network Traffic Analytics that Enterprises Need

Kentik

JULY 1, 2019

The most commonly used network monitoring tools in enterprises (e.g., solutions such as SolarWinds, ManageEngine, Paessler) were created specifically to handle the basic ups/downs, or faults, with traditional network devices. Cloud Adoption and Network Visibility Gaps.

Network

Network Networking Data Centers Topology

Why is Cisco ACI replacing traditional networks?

The Network DNA

JUNE 10, 2024

Why is Cisco ACI replacing traditional networks? Companies are increasingly moving from traditional networks to SDN-based networks. Cisco Application Centric Infrastructure (ACI) is a Next generation SDN solution and is designed for data centers spine-leaf architecture for the policy-driven solution.

Network

Network Networking Virtual machine IP Address

Arcadia: An end-to-end AI system performance simulator

Engineering at Meta

SEPTEMBER 7, 2023

We’re introducing Arcadia, Meta’s unified system that simulates the compute, memory, and network performance of AI training clusters. Arcadia gives Meta’s researchers and engineers valuable insights into the performance of AI models and workloads in an AI cluster – enabling data-driven decision making in the design of AI clusters.

Topology

Topology Network Networking Application

Strategies for Managing Network Traffic from a Remote Workforce

Kentik

MARCH 19, 2020

When more of the workforce shifts to working remotely, it puts new and different strains on the infrastructure across different parts of the network, especially where VPN gateways connect to the network edge. Kentik provides an easy way to see not only the entire network but also how it’s being used.

VPN

VPN Network Networking Gateway

The Network Also Needs to be Observable, Part 3: Network Telemetry Types

Kentik

JANUARY 28, 2021

In part 2 of this series, I talked about the range of network devices and observation points that generate telemetry data. Over time, this range has expanded, and networks are more diverse than ever. In this blog, I discuss the telemetry data itself. What users and applications are consuming my network bandwidth?

Network

Network Networking DNS IP Address

Maintaining large-scale AI capacity at Meta

Engineering at Meta

JUNE 12, 2024

Meta is currently operating many data centers with GPU training clusters across the world. Our data centers are the backbone of our operations, meticulously designed to support the scaling demands of compute and storage. This transition has not been without its challenges. And what do we mean by maintaining?

Fashion

Fashion Data Centers Artificial Intelligence Server

The Kentik Platform is the Future of Network Operations

Kentik

FEBRUARY 27, 2020

Today we announced an evolutionary leap forward for network operations (NetOps), solving for today’s biggest network challenge: effectively managing hybrid complexity and scale, at speed. From the very beginning, the Kentik Platform was designed to collect the most granular data from terabit-scale networks.

Network

Network Networking LAN WAN

SNMP vs. NetFlow

Kentik

JANUARY 29, 2020

There is a lot of confusion regarding the two primary data sets in network management: SNMP and flow. SNMP is used to collect metadata and metrics about a network device. This critical technology is a basic building block of modeling, measuring, and understanding the network. What is SNMP? What is Flow? When is SNMP Used?

Port

Port Network Networking Data Centers

SNMP vs. Flow

Kentik

JANUARY 29, 2020

There is a lot of confusion regarding the two primary data sets in network management: SNMP and flow. SNMP is used to collect metadata and metrics about a network device. This critical technology is a basic building block of modeling, measuring, and understanding the network. What is SNMP? What is Flow? When is SNMP Used?

Port

Port Network Networking Data Centers

News in Networking: SD-WAN for $3.3B and Facebook’s Free Open/R Networking Tool

Kentik

NOVEMBER 17, 2017

Also this week, Facebook open-sourced its Open/R networking development platform. Facebook uses Open/R to support its wide-area networks, data center fabric and wireless mesh topologies. Alibaba’s next-gen data center uses Cisco tech (SDxCentral). AT&T makes its SD-WAN ‘dynamic’ (SDxCentral).

WAN

WAN Network Networking Data Centers

On-Prem Datacenter to AWS Cloud Connectivity

The Network DNA

OCTOBER 8, 2024

On-Prem Datacenter to AWS Cloud Connectivity Amazon Virtual Private Cloud (Amazon VPC) is a networking solution that allows you to set boundaries around your AWS resources. This isolated area allows you to launch resources in a virtual network that you have defined. Load balancers are important part of the Network ?

Cloud

Cloud Gateway Internet VPN

Visualizing Traffic Exit Points in Complex Networks

Kentik

SEPTEMBER 19, 2018

Let’s face it — today’s networks are complex. The physical topology continues to expand with relentless traffic growth, and a constant stream of new technologies like SDN, Clos architectures, and cloud interconnects make it even harder to understand how services traverse the network between application infrastructure and users or customers.

Network

Network Networking Routers Server

Technology Short Take 179

Scott's Weblog

JUNE 28, 2024

I’m back with another set of links to articles on various data center- and IT-related topics. Networking This post about netlab just reminds me that I really should spend some quality time with it. Daniel Dib has a six-part series (so far) on Cisco vPC in a VXLAN/EVPN network. Welcome to Technology Short Take #179!

Topology

Topology Data Centers Network Networking

InfoSecurity Europe: How to Build a Hybrid Cloud

CATO Networks

MAY 28, 2017

At the upcoming InfoSecurity show in London, our co-founder and CTO, Gur Shatz, will provide practical tips on how to build and secure hybrid clouds at his session Hybrid Cloud Secure Network Integration: Tips and Techniques. The post InfoSecurity Europe: How to Build a Hybrid Cloud appeared first on Cato Networks.

Cloud

Cloud Topology Data Centers Firewall

The business case for SD-WAN: Because MPLS is Not Fit for the Cloud

CATO Networks

NOVEMBER 15, 2017

That means making sure the wide area network (WAN) that connects branch offices, data centers, cloud services and SaaS applications can handle the connectivity needs of digitally empowered global organizations. Software-defined Wide Area Networks ( SD-WAN ) can get the job done. Here is why.

MPLS

MPLS WAN Cloud Wide Area Network

Where No (Enterprise) WAN Has Gone Before

Kentik

JUNE 14, 2023

Enterprise WAN in 2023 Enterprise networking in 2023 is very much the same. To succeed as an engineer in this new network, and to successfully manage the infrastructure and services that deliver applications to people, we must rid ourselves of two-dimensional thinking. In other words, both events were all about cloud networking.

WAN

WAN Data Centers DNS Internet

What is SD-WAN?

CATO Networks

AUGUST 7, 2018

This means that corporate networks must change as well. The answer Software-Defined Wide Area Networks (SD-WANs). SD-WAN brings unparalleled agility and cost savings to networking. SD-WAN does this by separating applications from the underlying network services with a policy-based, virtual overlay. How Does SD-WAN Work?

WAN

WAN MPLS Wide Area Network Data Centers

Traditional WAN vs. SD-WAN: Everything You Need to Know

CATO Networks

AUGUST 22, 2023

The corporate WAN connects an organizations distributed branch locations, data center, cloud-based infrastructure, and remote workers. The WAN needs to offer high-performance and reliable network connectivity to ensure all users and applications can communicate effectively.

WAN

WAN Bandwidth Data Centers SASE

Introduction to VXLAN MP-BGP EVPN Route Types

The Network DNA

JULY 5, 2024

Introduction to VXLAN MP-BGP EVPN Route Types Before we start with the route types lets talk about the EVPN, EVPN (Ethernet Virtual Private Network), enables virtualized Ethernet communication between various network devices. It operates by using IP plus UDP to travel over the physical network.

IP Address

IP Address Ethernet Advertising Unicast

Preparing for the Hybrid Multi-Cloud Endgame

Kentik

APRIL 9, 2018

Ensuring application and network performance looms as a critical concern in environments where a new generation of IT software and computing infrastructure spans multiple geographically distributed data centers and a mix of public and private networks, including SD-WAN overlays. But in whose network?

Cloud

Cloud DevOps Data Centers WAN

Moving Beyond Remote Access VPNs

CATO Networks

JANUARY 27, 2022

A remote access virtual private network (VPN) is a solution designed to securely connect a remote user to the enterprise network. A remote access VPN creates an encrypted tunnel between a remote worker and the enterprise network. This allows traffic to be sent securely between these parties over untrusted public networks.

VPN

VPN SASE Microsegmentation Encryption

How to Configure Static Routes on Cisco

NW Kings

JANUARY 7, 2025

Static routes are fundamental components in networking that help direct traffic efficiently from one network segment to another. We will also delve into practical examples and scenarios where static routing is preferred, giving you a thorough understanding of this essential networking concept. What is a Static Route?

Routers

Routers IP Address Protocol Topology

SD-WAN and Cloud Security

CATO Networks

MAY 6, 2018

No longer an emerging technology, cloud computing is now used in everything from applications, storage, and networking. SD-WAN is used to connect enterprise networks over large geographic distances more efficiently across any available data transport, such as MPLS, LTE, or broadband.

WAN

WAN Cloud Wide Area Network MPLS

IT Managers: Read This Before Leaving Your MPLS Provider

CATO Networks

APRIL 20, 2022

Maybe youre an IT manager or a network engineer. Employees constantly complain about performance While traditional hub-and-spoke networking topology comes with its advantages, when users backhaul to the data center they clog the network with bandwidth-heavy applications like VOIP and file transfer.

MPLS

MPLS WAN SASE Network

Simplicity, Courtesy of the Cloud

CATO Networks

JUNE 24, 2015

Closer to the world of IT infrastructure, we are witnessing an arms race between first-mover Amazon Web Services and challengers Google, Microsoft and IBM to dominate the data center of the future. Network security plumbing is complex and mission critical. Network topology must be understood, and policies created to match it.

Cloud

Cloud Network Security Network Networking

Built-In Multi-Region Replication with Confluent Platform 5.4-preview

Confluent

SEPTEMBER 16, 2019

In a multi-datacenter cluster, network ingress and egress can be very costly—certainly more costly than network traffic within a datacenter. Certainly, a Kafka cluster spanning multiple datacenters will have significantly higher costs for network traffic between datacenters than within a given datacenter. datacenter topology.

Bandwidth

Bandwidth WAN Topology Network

Staying in the Zone: How DoorDash used a service mesh to manage data transfer, reducing hops and cloud spend

DoorDash Engineering

JANUARY 16, 2024

Direct communication in a flat network: Leveraging AWS-CNI , microservice pods in distinct clusters within a cell can communicate directly with each other. This flat network architecture streamlines communication pathways, facilitating efficient interactions between microservices.

Cloud

Cloud IP Address DNS Routers

Why a Backbone Is More Than Just a Bunch of PoPs

CATO Networks

MAY 24, 2021

Since SASEs introduction, many networking and security vendors have rushed to capitalize on the market by partnering with other providers to include cloud backbones as part of their SASE offerings. But you must consider the architecture of this network. Now you must consider, how do we keep a reliable network between these points?

IaaS

IaaS SASE WAN Internet

DNS Zone Setup Best Practices on Azure

Cloudera Blog

FEBRUARY 12, 2024

Most Azure users use hub-spoke network topology. DNS servers are usually deployed in the hub virtual network or an on-prem data center instead of in the Cloudera VNET. DNS servers are usually deployed in the hub virtual network or an on-prem data center instead of in the Cloudera VNET.

DNS

DNS IP Address Server Firewall

Podcast with Kentik CEO Avi Freedman & Jim Metzler

Kentik

OCTOBER 3, 2016

On September 20, Kentik announced Kentik NPM, the first network performance monitoring solution designed for the speed, scale, and architecture of today’s digital business. Our Network Performance Management solution, Kentik NPM, sits on top of the Kentik Detect platform. But now you’ve got a really large big-data problem.

Routers

Routers Internet Network Networking

Top Ten Technology Trends for 2024

Vedcraft

JUNE 16, 2024

Team Topologies approach to organizing software engineering teams has emerged as a great reference for building an effective platform engineering team. Heys movement f rom Cloud to own data center. Click here to see the consolidated list of tools & technologies. Image Credit: FinOps Foundation licensed under CC BY 4.0

Cloud

Cloud Engineering Application Data Centers

Building and deploying MySQL Raft at Meta

Engineering at Meta

MAY 16, 2023

Background To allow for high availability, fault tolerance, and scaling reads, Meta’s MySQL datastore is a massively sharded, geo-replicated deployment with millions of shards, holding petabytes of data. The deployment includes thousands of machines running over several regions and data centers across multiple continents.

Engineering

Engineering Protocol Server Topology

A RoCE network for distributed AI training at scale

Seamless network integration: connecting OpenShift to your data center with Apstra

Webinars

Trending Sources

Seamless network integration: connecting OpenShift to your data center with Apstra

Webinars

Kentik Bridges the Intelligence Gap for Hybrid Cloud Networks

Announcing Complete Azure Observability for Kentik Cloud

How Meta trains large language models at scale

Building Meta’s GenAI Infrastructure

Live Training: Build Your Own Networking Lab

Massive Scale Visibility Challenges Inside Hyperscale Data Centers

Practical Steps for Enhancing Reliability in Cloud Networks - Part I

Today’s Enterprise WAN Isn’t What It Used To Be

Setting Up a Basic Azure Network

The Network Traffic Analytics that Enterprises Need

Why is Cisco ACI replacing traditional networks?

Arcadia: An end-to-end AI system performance simulator

Strategies for Managing Network Traffic from a Remote Workforce

The Network Also Needs to be Observable, Part 3: Network Telemetry Types

Maintaining large-scale AI capacity at Meta

The Kentik Platform is the Future of Network Operations

SNMP vs. NetFlow

SNMP vs. Flow

News in Networking: SD-WAN for $3.3B and Facebook’s Free Open/R Networking Tool

On-Prem Datacenter to AWS Cloud Connectivity

Visualizing Traffic Exit Points in Complex Networks

Technology Short Take 179

InfoSecurity Europe: How to Build a Hybrid Cloud

The business case for SD-WAN: Because MPLS is Not Fit for the Cloud

Where No (Enterprise) WAN Has Gone Before

What is SD-WAN?

Traditional WAN vs. SD-WAN: Everything You Need to Know

Introduction to VXLAN MP-BGP EVPN Route Types

Preparing for the Hybrid Multi-Cloud Endgame

Moving Beyond Remote Access VPNs

How to Configure Static Routes on Cisco

SD-WAN and Cloud Security

IT Managers: Read This Before Leaving Your MPLS Provider

Simplicity, Courtesy of the Cloud

Built-In Multi-Region Replication with Confluent Platform 5.4-preview

Staying in the Zone: How DoorDash used a service mesh to manage data transfer, reducing hops and cloud spend

Why a Backbone Is More Than Just a Bunch of PoPs

DNS Zone Setup Best Practices on Azure

Podcast with Kentik CEO Avi Freedman & Jim Metzler

Top Ten Technology Trends for 2024

Building and deploying MySQL Raft at Meta

Stay Connected