Sat.Jun 08, 2024 - Fri.Jun 14, 2024

article thumbnail

How Meta trains large language models at scale

Engineering at Meta

As we continue to focus our AI research and development on solving increasingly complex problems, one of the most significant and challenging shifts we’ve experienced is the sheer scale of computation required to train large language models (LLMs). Traditionally, our AI model training has involved a training massive number of models that required a comparatively smaller number of GPUs.

article thumbnail

Mosaic AI: Build and deploy production-quality Compound AI Systems

databricks

Over the last year, we have seen a surge of commercial and open-source foundation models showing strong reasoning abilities on general knowledge tasks.

140
140
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Article: Elevating Kubernetes Logging for Enhanced Observability

InfoQ Articles

In this article, we will explore the challenges, strategies, and best practices that will help you achieve seamless log management in your Kubernetes environment.

DevOps 110
article thumbnail

Addressing the Elephant in the Room – Welcome to Today’s Cloudera

Cloudera Blog

Hadoop. The first time that I really became familiar with this term was at Hadoop World in New York City some ten or so years ago. There were thousands of attendees at the event – lining up for book signings and meetings with recruiters to fill the endless job openings for developers experienced with MapReduce and managing Big Data. This was the gold rush of the 21st century, except the gold was data.

Banking 103
article thumbnail

Maintaining large-scale AI capacity at Meta

Engineering at Meta

Meta is currently operating many data centers with GPU training clusters across the world. Our data centers are the backbone of our operations, meticulously designed to support the scaling demands of compute and storage. A year ago, however, as the industry reached a critical inflection point due to the rise of artificial intelligence (AI), we recognized that to lead in the generative AI space we’d need to transform our fleet.

Fashion 138
article thumbnail

Open Sourcing Unity Catalog

databricks

We are excited to announce that we are open sourcing Unity Catalog, the industry’s first open source catalog for data and AI governance.

article thumbnail

Article: Delivering Great Developer Experiences with Platform Engineering

InfoQ Articles

Companies increasingly turn to platform engineering to help scale their development teams and increase developer experience for engineer efficiency. In this virtual panel, we’ll discuss how teams build platforms, set others up for success, work with developers who use their platform, measure their progress, and adapt to new challenges.

More Trending

article thumbnail

Serverless Jupyter Notebooks at Meta

Engineering at Meta

At Meta, Bento , our internal Jupyter notebooks platform, is a popular tool that allows our engineers to mix code, text, and multimedia in a single document. Use cases run the entire spectrum from what we call “lite” workloads that involve simple prototyping to heavier and more complex machine learning workflows. However, even though the lite workflows require limited compute, users still have to go through the same process of reserving and provisioning remote compute – a process that takes time

Server 115
article thumbnail

Introducing Databricks LakeFlow: A unified, intelligent solution for data engineering

databricks

Today, we are excited to announce Databricks LakeFlow, a new solution that contains everything you need to build and operate production data pipelines.

article thumbnail

Article: Streaming HTML – Asynchronous DOM Updates without JavaScript

InfoQ Articles

Web applications provide the best user experience when pages load quickly and display additional data as it becomes available. Developers typically use JavaScript to load data asynchronously, but this adds complexity when compared to server-side rendering. We review a technique that uses the Shadow DOM with HTTP streaming to load pages quickly and display data asynchronously without JavaScript.

article thumbnail

Making an AI Investment: How Finance Institutions are Harnessing the Power of AI and Generative AI

Cloudera Blog

Of all of the emerging tech of the last two decades, artificial intelligence (AI) is tipping the hype scale, causing organizations from all industries to rethink their digital transformation initiatives asking where it fits in. In Financial Services, the projected numbers are staggering. According to a recent McKinsey & Co. article , “The McKinsey Global Institute (MGI) estimates that across the global banking sector, [Generative AI] could add between $200 billion and $340 billion in value a

article thumbnail

Unlocking the power of mixed reality devices with MobileConfig

Engineering at Meta

MobileConfig enables developers to centrally manage a mobile app’s configuration parameters in our data centers. Once a parameter value is changed on our central server, billions of app devices automatically fetch and apply the new value without app updates. These remotely managed configuration parameters serve various purposes such as A/B testing, feature rollout, and app personalization.

article thumbnail

Introducing AI/BI: Intelligent Analytics for Real-World Data

databricks

Today, we are excited to announce Databricks AI/BI , a new type of business intelligence product built from the ground up to deeply.

137
137
article thumbnail

Next-Gen Customer Loyalty Programs with Data Streaming

Confluent

Use Confluent’s data streaming platform to bring real-time insights to customer loyalty programs, creating personalized offers that drive greater retention and revenue.

75
article thumbnail

Where Does Data Governance Fit Into Hybrid Cloud?

Cloudera Blog

At a time when artificial intelligence (AI) and tools like generative AI (GenAI) and large language models (LLMs) have exploded in popularity, getting the most out of organizational data is critical to driving business value and carving out a competitive market advantage. To reach that goal, more businesses are turning toward hybrid cloud infrastructure – with data on-premises, in the cloud, or both – as a means to tap into valuable data.

article thumbnail

Data Retention Policies Must Evolve to Address Emerging Technologies and Data Growth

Dataversity

The emergence of new technologies, including AI, IoT, and blockchain, in addition to the widespread embrace of digital transformation, has driven a dramatic increase in data. The reliance on data analytics to drive data-driven decision-making also requires large volumes of data for meaningful insights. While AI and generative AI (GenAI) tools and systems contribute to […] The post Data Retention Policies Must Evolve to Address Emerging Technologies and Data Growth appeared first on DATAVER

article thumbnail

How FactSet Implemented an Enterprise Generative AI Platform with Databricks and MLflow

databricks

“FactSet’s mission is to empower clients to make data-driven decisions and supercharge their workflows and productivity. To deliver AI-driven solutions across our entire.

Financial 135
article thumbnail

Introducing Build with Confluent: Enabling Partners to Bring Data Streaming Use Cases to Market Faster

Confluent

Build with Confluent helps system integrators develop joint solutions faster, including specialized software bundles, support from data streaming experts to certify offerings, and access to Confluent’s Go-To-Market teams to amplify audience.

75
article thumbnail

HN738: Reducing Complexity With Fortinet’s Unified SASE (Sponsored)

Packet Pushers

Fortinets Unified SASE provides consistent security controls and policies both for traditional campuses and the hybrid workforce. Nirav Shah joins us to explain how Fortinet is positioned to do this: a foundational software developed for 20 years, a network of over 140 POPs, a security lab with over 1,000 researchers, continuous ZTNA verification proxies, and.

SASE 52
article thumbnail

Conversational AI’s Quantum Leap: How RAG Is Enabling Smarter Chatbots

Dataversity

Chatbots were among the first apps that testified to the mainstream adoption of AI and inspired further innovations in the conversational space. Now, it’s time to move on from just responding bots to emphatic companions that further reduce the dependency on human intelligence. RAG-enabled chatbots are proactive in responding to and addressing queries in real […] The post Conversational AI’s Quantum Leap: How RAG Is Enabling Smarter Chatbots appeared first on DATAVERSITY.

article thumbnail

What’s New with Databricks Unity Catalog at Data + AI Summit 2024

databricks

In an era marked by rapid advancements in artificial intelligence and an explosion of data and Gen AI tools, enterprises face fragmented data.

article thumbnail

How to Use Flink SQL, Streamlit, and Kafka: Part 2

Confluent

This is the second part of our series that explains how to create a graph that updates in real time with Streamlit, Kafka, and Flink SQL.

69
article thumbnail

Hedge 230: Preparing for Layoffs

Rule 11

You will probably be laid off at least once in your career–we no longer live a world of “permanent positions,” or even a world where people are in complete control of their “work destiny.” It’s important, then, to prepare to be laid off, made redundant, or impacted by a RIF, today. Mike Bushong joins Eyvonne Sharp, Tom Ammon, and Russ White in a wide-ranging discussion about preparing to be laid off.

52
article thumbnail

AI Has the Potential to Bring Cloud Sprawl Back Under Control

Dataversity

Enterprises continue to produce an explosive volume of data each year, so much so that companies admit up to 60% of their data goes unused, and one-third of enterprises report feeling overwhelmed with rising data quantities. Cloud-based data storage solutions have made it easier than ever to store this vast amount of data, and for companies who […] The post AI Has the Potential to Bring Cloud Sprawl Back Under Control appeared first on DATAVERSITY.

Cloud 59
article thumbnail

Announcing General Availability of Predictive Optimization

databricks

We're excited to announce the General Availability of Databricks Predictive Optimization. This capability intelligently optimizes your table data layouts for faster queries and.

102
102
article thumbnail

AWS and Confluent: Meeting the Requirements of Real-Time Operations

Confluent

Discover how Confluent & AWS streamline cloud migration & data integration for government agencies, optimizing efficiency & enhancing customer experience.

article thumbnail

IPB153: Leveraging ICMPv6 to Troubleshoot Network Issues

Packet Pushers

If you dont blame destination unreachable messages on DNS servers, are you even a real network engineer? All joking aside, Johannes Weber joins the show today to teach us how to use ICMPv6 to troubleshoot network issues, pinpointing if the problem is within your network or outside of it. He walks us through identifying possible. Read more » If you dont blame destination unreachable messages on DNS servers, are you even a real network engineer?

DNS 52
article thumbnail

Ask a Data Ethicist: What Is Data Sovereignty?

Dataversity

Recently, my DATAVERSITY colleague Mark Horseman shared that he’d been getting a lot more questions about Indigenous data sovereignty. We both agreed it would make a great topic for this month’s column. What is Indigenous data sovereignty, why does it matter, and how can we learn more about it? To answer these questions, it helps […] The post Ask a Data Ethicist: What Is Data Sovereignty?

article thumbnail

Data Intelligence and AI Trends: Top products, RAG and more

databricks

Generative AI fever shows no signs of cooling off. As pressure and excitement build to execute strong GenAI strategies, data leaders and practitioners.

98
article thumbnail

AWS and Confluent: Meeting the Requirements of Real-Time Operations

Confluent

Discover how Confluent & AWS streamline cloud migration & data integration for government agencies, optimizing efficiency & enhancing customer experience.

article thumbnail

NAN065: The Excitement and Trepidation of Automation with Scott Robohn

Packet Pushers

In part two of Scott Robohns interview, Scott tells us about his experience starting his own business and co-founding the Network Automation Forum and the AutoCon conference series. He describes the strong desire among many engineers to drive network automation forward, and how AutoCon creates a community to help make that happen. He and Eric. Read more » In part two of Scott Robohns interview, Scott tells us about his experience starting his own business and co-founding the Network Automat

article thumbnail

Essential Steps to Troubleshoot A Network

Obkio

Learn the essential steps to troubleshoot network issues effectively. Start with Obkio's Network Performance Monitoring tool for comprehensive network insights.

Network 52
article thumbnail

Databricks Announces 2024 Global Partner Awards

databricks

The Databricks Partner Ecosystem, comprising over 3,800 partners worldwide, plays a pivotal role in building and delivering premier data and AI solutions globally.

92
article thumbnail

A Journey from Intern to Front-End Developer

Noction

An internship can become your ticket to a career. Cristian, a former intern turned team member, shares insights on his journey and achievements.

52
article thumbnail

PP018: RSA Recap, Including a View from the Event SOC

Packet Pushers

Drew and JJ have recovered from the overstimulation of the RSA expo floor and are ready to discuss their takeaways from the conference. They discuss the surprising emphasis on microsegmentation and storage backups, and the not-so-surprising focus on IoT security and AI-assisted products. They also pull back the curtain on what the conferences own SOC.

article thumbnail

Mitigating Risks and Ensuring Data Integrity Through Legacy Modernization

Dataversity

Legacy systems are the backbone of many businesses, especially those in the industry for decades. These systems ensure a stable and reliable platform for conducting smooth business operations. However, decade-old legacy systems can also pose risks and challenges to data integrity that can affect an organization’s overall growth and success. In response to these challenges, […] The post Mitigating Risks and Ensuring Data Integrity Through Legacy Modernization appeared first on DATAVERSITY.

article thumbnail

2024 Fortune Best Workplaces in Bay Area™ recognizes Databricks

databricks

In the dynamic, innovative landscape of the San Francisco Bay Area, Databricks stands out not just for our groundbreaking data and AI solutions.

87