June, 2024

article thumbnail

Databricks + Tabular

databricks

We are excited to announce that we have agreed to acquire Tabular, Inc, a data management company founded by Ryan Blue, Daniel Weeks.

145
145
article thumbnail

The Economics of Writing Technical Books

Architect Elevator

Yes, you can make money writing books. But never do it for that.

141
141
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

How Meta trains large language models at scale

Engineering at Meta

As we continue to focus our AI research and development on solving increasingly complex problems, one of the most significant and challenging shifts we’ve experienced is the sheer scale of computation required to train large language models (LLMs). Traditionally, our AI model training has involved a training massive number of models that required a comparatively smaller number of GPUs.

article thumbnail

Handling Exchange Online Quarantine Message Using PowerShell

Faris Malaeb

Managing email quarantine in Exchange Online is a regular and important task for maintaining a secure and efficient email environment. Exchange Online provides robust PowerShell cmdlets to handle quarantine messages effectively. Today you will learn the essential cmdlets for managing Exchange Online quarantine messages, ensuring you can handle them with ease and precision.

Email 130
article thumbnail

Enhanced Cybersecurity with Real-Time Log Aggregation and Analysis

Confluent

Leverage Confluent’s data streaming platform to continuously ingest, process, and analyze logs to strengthen your cybersecurity and SIEM.

120
120
article thumbnail

Article: Elevating Kubernetes Logging for Enhanced Observability

InfoQ Articles

In this article, we will explore the challenges, strategies, and best practices that will help you achieve seamless log management in your Kubernetes environment.

DevOps 110
article thumbnail

Mosaic AI: Build and deploy production-quality Compound AI Systems

databricks

Over the last year, we have seen a surge of commercial and open-source foundation models showing strong reasoning abilities on general knowledge tasks.

140
140

More Trending

article thumbnail

Maintaining large-scale AI capacity at Meta

Engineering at Meta

Meta is currently operating many data centers with GPU training clusters across the world. Our data centers are the backbone of our operations, meticulously designed to support the scaling demands of compute and storage. A year ago, however, as the industry reached a critical inflection point due to the rise of artificial intelligence (AI), we recognized that to lead in the generative AI space we’d need to transform our fleet.

Fashion 138
article thumbnail

Troubleshooting Exchange Online Mailbox Provisioning Errors

Faris Malaeb

Incorrect or missed configurations can lead to errors causing mailbox provisioning failures. One such error, often encountered in the Microsoft Admin Center, is represented by the message: “Exchange Online: An unknown error has occurred. Refer to correlation ID GUID.” This post will help you understand how to identify the cause of this error and provide.

130
130
article thumbnail

How to Turn a REST API Into a Data Stream with Kafka and Flink

Confluent

Improve REST API response data w/Kafka and Flink SQL in Confluent Cloud; Automatic connector retriability combats REST flakiness; Demo w/OpenSky data.

Cloud 105
article thumbnail

Article: Zero to Performance Hero: How to Benchmark and Profile Your eBPF Code in Rust

InfoQ Articles

In this article, we will walk through creating a basic eBPF program in Rust. We will intentionally include a performance regression and then use profilers to locate and fix the bug. We will also create benchmarks and track them using a continuous benchmarking tool for CI.

DevOps 109
article thumbnail

Open Sourcing Unity Catalog

databricks

We are excited to announce that we are open sourcing Unity Catalog, the industry’s first open source catalog for data and AI governance.

article thumbnail

Cloudera Unveils Plans for Annual Pride Celebration in Cork

Cloudera Blog

Pride Month is underway and we at Cloudera are looking forward to joining the global celebration of diversity, equity and the ongoing effort for LGBTQ+ ( L esbian, G ay, B isexual, T ransgender, Q ueer/ Q uestioning) rights and recognition. Pride Month serves as a reminder that the fight for equality and equity for members of the LGBTQ+ community is not over.

108
108
article thumbnail

Serverless Jupyter Notebooks at Meta

Engineering at Meta

At Meta, Bento , our internal Jupyter notebooks platform, is a popular tool that allows our engineers to mix code, text, and multimedia in a single document. Use cases run the entire spectrum from what we call “lite” workloads that involve simple prototyping to heavier and more complex machine learning workflows. However, even though the lite workflows require limited compute, users still have to go through the same process of reserving and provisioning remote compute – a process that takes time

Server 115
article thumbnail

The Future of Insurance: A Business Analyst’s Insight into Emerging Trends and Technologies

Dataversity

The insurance industry is undergoing a revolution, mainly driven by the application of advanced emerging technologies. The application and installation of new technologies enable a better future for our industry, where customers will receive maximum efficiency, security, and flexibility. Here, we address the major technologies and trends that influence this transition, shedding light on their […] The post The Future of Insurance: A Business Analyst’s Insight into Emerging Trends and Techno

article thumbnail

The Rising Importance of AI Governance

TDAN

AI governance has become a critical topic in today’s technological landscape, especially with the rise of AI and GenAI. As CEOs express concerns regarding the potential risks with these technologies, it is important to identify and address the biggest risks.

article thumbnail

Article: Using Generative AI in Software Project Management to Bridge Domains and Accelerate Productivity

InfoQ Articles

Gen AI Assistants play to the strengths of professionals with a breadth of experience, particularly software developers who can describe what they want the LLM to complete and critically evaluate the result. These tools enable us to swiftly cross divides of domain language and scale large repetitive tasks down to interesting ones on a human scale.

article thumbnail

Introducing Databricks LakeFlow: A unified, intelligent solution for data engineering

databricks

Today, we are excited to announce Databricks LakeFlow, a new solution that contains everything you need to build and operate production data pipelines.

article thumbnail

Addressing the Elephant in the Room – Welcome to Today’s Cloudera

Cloudera Blog

Hadoop. The first time that I really became familiar with this term was at Hadoop World in New York City some ten or so years ago. There were thousands of attendees at the event – lining up for book signings and meetings with recruiters to fill the endless job openings for developers experienced with MapReduce and managing Big Data. This was the gold rush of the 21st century, except the gold was data.

Banking 103
article thumbnail

Leveraging AI for efficient incident response

Engineering at Meta

We’re sharing how we streamline system reliability investigations using a new AI-assisted root cause analysis system. The system uses a combination of heuristic-based retrieval and large language model-based ranking to speed up root cause identification during investigations. Our testing has shown this new system achieves 42% accuracy in identifying root causes for investigations at their creation time related to our web monorepo.

article thumbnail

The Cool Kids Corner: Lifelong Learning 

Dataversity

Hello! I’m Mark Horseman, and welcome to The Cool Kids Corner. This is my monthly check-in to share with you the people and ideas I encounter as a data evangelist with DATAVERSITY. (Last month, we discussed data leadership.) This month, we’re talking about lifelong learning. In the Data Management space, what drives us to learn […] The post The Cool Kids Corner: Lifelong Learning appeared first on DATAVERSITY.

article thumbnail

Running Apache Kafka® at the Edge Requires Confluent’s Enterprise-Grade Data Streaming Platform

Confluent

Deploy Apache Kafka® at the edge with Confluent to avoid complexities and constraints while accelerating innovation with an enterprise-grade data streaming platform.

80
article thumbnail

Article: Optimizing Spring Boot Config Management with ConfigMaps: Environment Variables or Volume Mounts

InfoQ Articles

Spring Boot stands out as a viable framework for its agility and streamlined workflow. Yet, effective configuration management remains a pivotal factor influencing deployment efficiency and ongoing maintenance. ConfigMaps, a feature in Kubernetes, provides configuration strategies for Spring Boot applications.

article thumbnail

Introducing AI/BI: Intelligent Analytics for Real-World Data

databricks

Today, we are excited to announce Databricks AI/BI , a new type of business intelligence product built from the ground up to deeply.

137
137
article thumbnail

Fueling Enterprise Generative AI with Data: The Cornerstone of Differentiation

Cloudera Blog

More than two-thirds of companies are currently using Generative AI (GenAI) models, such as large language models (LLMs), which can understand and generate human-like text, images, video, music, and even code. However, the true power of these models lies in their ability to adapt to an enterprise’s unique context. By leveraging an organization’s proprietary data, GenAI models can produce highly relevant and customized outputs that align with the business’s specific needs and ob

article thumbnail

The key to a happy Rust/C++ relationship

Engineering at Meta

The history of Rust at Meta goes all the way back to 2016, when we first started using it for source control. Today, it has been widely embraced at Meta and is one of our primary supported server-side languages (along with C++, Python, and Hack). But that doesn’t mean there weren’t any growing pains. Aida G., a member of one of Meta’s first Rust teams, joins Pascal Hartig ( @passy ) on the latest Meta Tech Podcast to dive into the challenges of getting Rust to interact with Meta’s large amount o

article thumbnail

Key Considerations for C-Suite Leaders Involved in Digital Transformation Initiatives

Dataversity

With spending on digital transformation initiatives worldwide projected to hit $3.9 trillion by 2027, the pressure is on organizations – and specifically the C-suite – to ensure that not only are they best positioned to tackle the digital challenges of today but that they can quickly adapt to those of tomorrow as well. C-suite leaders find themselves […] The post Key Considerations for C-Suite Leaders Involved in Digital Transformation Initiatives appeared first on DATAVERSITY.

article thumbnail

How to Scale RAG and Build More Accurate LLMs

Confluent

Struggling to get your GenAI app off the ground? You’re not alone—GenAI apps can be challenging to build. Luckily, taking an event-driven approach can make the process more manageable. Learn how.

80
article thumbnail

Article: Platform as a Runtime - The Next Step in Platform Engineering

InfoQ Articles

As systems become larger and more complex we need to take the concepts of platform engineering to a higher level – to the code level – by creating platforms and abstractions that will reduce cognitive load, help simplify and accelerate software development, and allow for easy maintenance and upgrades to the platform. Let’s move from “platform” to “Platform as a Runtime”.

article thumbnail

How FactSet Implemented an Enterprise Generative AI Platform with Databricks and MLflow

databricks

“FactSet’s mission is to empower clients to make data-driven decisions and supercharge their workflows and productivity. To deliver AI-driven solutions across our entire.

Financial 135
article thumbnail

The Importance of Recognizing Juneteenth

Cloudera Blog

Juneteenth holds profound significance in the history of freedom and equality for Black Americans. Also known as Freedom Day or Emancipation Day, Juneteenth commemorates the anniversary of June 19, 1865, when news of the Emancipation Proclamation reached Galveston, Texas, finally declaring freedom for enslaved Americans held in the Confederacy–more than two years after the proclamation was issued in on January 1, 1863.

article thumbnail

Unlocking the power of mixed reality devices with MobileConfig

Engineering at Meta

MobileConfig enables developers to centrally manage a mobile app’s configuration parameters in our data centers. Once a parameter value is changed on our central server, billions of app devices automatically fetch and apply the new value without app updates. These remotely managed configuration parameters serve various purposes such as A/B testing, feature rollout, and app personalization.

article thumbnail

Technology Short Take 179

Scott's Weblog

Welcome to Technology Short Take #179! I’m back with another set of links to articles on various data center- and IT-related topics. In the interest of full transparency, I’d like to give credit to Russ White for his “Weekend Reads” series of posts, which are similar in nature to my Technology Short Takes. If you aren’t reading Russ’ “Weekend Reads” posts, you’re missing out on a good source of useful information.

article thumbnail

Confluent Is Named Microsoft’s 2024 OSS on Azure Global Partner of the Year

Confluent

Our OSS on Azure Partner of the Year Award highlights Confluent's data streaming solution, cloud Apache Kafka, and fully integrated Azure security, management, billing, and data analytics.

Cloud 78
article thumbnail

Article: Million Dollar Lines of Code - An Engineering Perspective on Cloud Cost Optimization

InfoQ Articles

A single line of code can shape an organization's financial future. Erik Peterson, the CTO and founder at CloudZero, presented an engineering perspective on cloud cost optimization at QCon San Francisco.

article thumbnail

Databricks Named a Leader in The Forrester Wave™: AI Foundation Models for Language, Q2 2024

databricks

We are excited to announce that Forrester has recognized Databricks as a Leader in The Forrester Wave™: AI Foundation Models for Language, Q2.

131
131
article thumbnail

The Award Winning Formula: How Cloudera Empowered OCBC With Trusted Data To Unlock Business Value from AI

Cloudera Blog

Recently, Cloudera, alongside OCBC, were named winners in the“ Best Big Data and Analytics Infrastructure Implementation ” category at The Asian Banker’s Financial Technology Innovation Awards 2024. This recognition underscores the importance of trusted data when building AI and generative AI (GenAI) models and serves as a testament to the impact that reliable data can have in real world use cases.

Banking 83