Sat.Mar 16, 2024 - Fri.Mar 22, 2024

article thumbnail

Lilac Joins Databricks to Simplify Unstructured Data Evaluation for Generative AI

databricks

Today, we are thrilled to announce that Lilac is joining Databricks. Lilac is a scalable, user-friendly tool for data scientists to search, cluster.

142
142
article thumbnail

Threads has entered the fediverse

Engineering at Meta

Threads has entered the fediverse! As part of our beta experience, now available in a few countries, Threads users aged 18+ with public profiles can now choose to share their Threads posts to other ActivityPub-compliant servers. People on those servers can now follow federated Threads profiles and see, like, reply to, and repost posts from the fediverse.

Server 138
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Introducing Tableflow

Confluent

Seamlessly integrate Apache Kafka data into your lakehouse as Apache Iceberg tables, bridging the operational and analytical divide, with Tableflow. Read more in our blog post.

133
133
article thumbnail

Cloudera’s RHEL-volution: Powering the Cloud with Red Hat

Cloudera Blog

As enterprise AI technologies rapidly reshape our digital environment, the foundation of your cloud infrastructure is more critical than ever. That’s why Cloudera and Red Hat , renowned for their open-source solutions, have teamed up to bring Red Hat Enterprise Linux ( RHEL ) to Cloudera on public cloud as the operating system for all of our public cloud platform images.

Cloud 115
article thumbnail

Introducing the Databricks AI Security Framework (DASF)

databricks

We are excited to announce the release of the Databricks AI Security Framework (DASF) version 1.0 whitepaper! The framework is designed to improve.

116
116
article thumbnail

Four New Apache Cassandra 5.0 Features to Be Excited About

Dataversity

With the recent beta release of Apache Cassandra 5.0, now is a great time for teams to give it a spin and discover 5.0’s most interesting and anticipated new capabilities. As I’ve poked around with the new beta, here are four features introduced with open-source Cassandra 5.0 that developer teams should be excited about: 1. Vector […] The post Four New Apache Cassandra 5.0 Features to Be Excited About appeared first on DATAVERSITY.

Education 115
article thumbnail

Best Practices for Confluent Schema Registry

Confluent

Learn the best practices for using Confluent Schema Registry, including using schema IDs, understanding subjects and versions, using data contracts, pre-registering schemas, and more.

111
111

More Trending

article thumbnail

Article: Relational Data at the Edge: How Cloudflare Operates Distributed PostgreSQL Clusters

InfoQ Articles

Explore Cloudflare's distributed PostgreSQL clusters and learn how a cross-region architecture ensures resilience. Discover how data storage and access at the edge deliver massive performance gains by reducing location-sensitive latency and why architecting for degraded states is much harder than for failure states.

article thumbnail

Logarithm: A logging engine for AI training workflows and services

Engineering at Meta

Systems and application logs play a key role in operations, observability, and debugging workflows at Meta. Logarithm is a hosted, serverless, multitenant service, used only internally at Meta, that consumes and indexes these logs and provides an interactive query interface to retrieve and view logs. In this post, we present the design behind Logarithm, and show how it powers AI training debugging use cases.

article thumbnail

Turbocharged Training: Optimizing the Databricks Mosaic AI stack with FP8

databricks

Benchmarking for training (dense) models at scale. We demonstrate great performance (very high MFU) and highlight our use of NVIDIA's Transformer Engine, along with PyTorch FSDP and DTensor.

article thumbnail

Confluent Cloud for Apache Flink Is Now Generally Available

Confluent

Confluent Cloud's serverless Flink offering is now available on all major clouds, offering a unified, managed platform for real-time data processing.

Cloud 93
article thumbnail

Article: Architecting for High Availability in the Cloud with Cellular Architecture

InfoQ Articles

Cellular architecture is a design pattern that helps achieve high availability in multi-tenant applications. The goal is to design your application so that you can deploy all of its components into an isolated "cell" that is fully self-sufficient. It can benefit your customers regarding availability and ensure you hit your SLAs.

Cloud 97
article thumbnail

Better video for mobile RTC with AV1 and HD

Engineering at Meta

At Meta, we support real-time communication (RTC) for billions of people through our apps, including Messenger, Instagram, and WhatsApp. We’ve seen significant benefits by adopting the AV1 codec for RTC. Here’s how we are improving the RTC video quality for our apps with tools like the AV1 codec, the challenges we face, and how we mitigate those challenges.

Bandwidth 104
article thumbnail

Cloudera Recognized as a Great Place to Work in Ireland and Costa Rica

Cloudera Blog

We’re excited to announce that Cloudera has been named the Best Medium Workplace in Ireland , one of the Best Workplaces in Costa Rica , and one of Ireland’s Best Workplaces for Women for 2024. These recognitions underscore Cloudera’s ongoing efforts to prioritize employee well-being, professional development, and collaborative work environments.

article thumbnail

GGML GGUF File Format Vulnerabilities

databricks

The GGUF file format is a binary file format used for storing and loading model weights for the GGML library. The library documentation.

article thumbnail

Article: Leading tech people or staying a software engineer: What to choose? Panel Discussion

InfoQ Articles

In this virtual panel, we explore what made people decide to become a leader and how they did it. And we'll find out if we really have to leave tech forever or if there's a way back into engineering.

article thumbnail

Data Streaming Platforms, Gen AI, and Apache Flink® Reigned Supreme at Kafka Summit London

Confluent

See the highlights from Kafka Summit London 2024, and learn about how data streaming platforms, Gen AI, and Apache Flink® are driving innovation in the data streaming community.

72
article thumbnail

The Role of Quantum Computing in Data Science

Dataversity

Quantum computing is on the cusp of turning the data science world upside down, offering a level of processing power we’ve only dreamed of until now. This new frontier has an incredible potential to reshape the way we approach data analysis, predictive modeling, and solving the kind of complex problems that have always been a tough […] The post The Role of Quantum Computing in Data Science appeared first on DATAVERSITY.

article thumbnail

Unlock deeper marketing insights with Hightouch Campaign Intelligence and Databricks

databricks

Next-generation customer experiences are built upon data and insights derived from various touchpoints. Through these, marketers can detect subtle differences in customer needs.

72
article thumbnail

Article: Zero-Knowledge Proofs for the Layman

InfoQ Articles

This article will introduce you to zero-knowledge proofs, a kind of cryptography you can use to provide the proof you know a secret, such as a private key or the solution to a problem, without ever sharing it to an interested party. While many articles exist on the topic, this will not require any high math knowledge.

article thumbnail

Exploring Apache Flink 1.19: Features, Improvements, and More

Confluent

Read the highlights from the Flink 1.19 release, including standard YAML support for configurations, dynamic source parallelism inference, and SQL and Table API improvements.

72
article thumbnail

The Drive Toward the Autonomous Enterprise Is a Key Focus for IT Leaders in 2024

Dataversity

According to Gartner, 80% of executives see automation as a vital thread that supports informed business decisions. And they’re right. In today’s business landscape, automation has transcended a mere “nice-to-have” and become a fundamental driver of organizational success. It’s not just transforming tasks but reshaping businesses from the inside out.

article thumbnail

The End of Agile – Part 1 (A Brief History of Agile)

TDAN

In recent years, we have seen substantial pushback on many fronts against Agile as a viable and important project management methodology.

article thumbnail

HN726: From Automation to Orchestration for a FinTech Network (Sponsored)

Packet Pushers

Fiserv is one of the largest payment processors in the world, In 2023 it handled more than 35 billion transactions worth $2.03 trillion US dollars. Its network is critical to the business. The organization knew it needed network automation, but early attempts got some things wrong. On todays Heavy Networking we talk about how Fiserv. Read more » Fiserv is one of the largest payment processors in the world, In 2023 it handled more than 35 billion transactions worth $2.03 trillion US dollars.

article thumbnail

Build, Connect, and Consume Intelligent Data Pipelines Seamlessly and Securely

Confluent

Check out the latest features on Confluent Cloud including improvements to 80+ connectors, stream governance updates, enterprise cluster savings, and much more.

article thumbnail

The Best Methodology for Moving AI Data and Keeping It Safe

Dataversity

Artificial intelligence (AI) has the power to change the global economy and potentially, one day, every aspect of our lives. There are numerous possible uses for the technology across industries, and new AI projects and applications are frequently released to the public. The only restriction on AI’s use appears to be the inventiveness of human beings.

article thumbnail

Legal Issues for Data Professionals: AI Creates Hidden Data and IP Legal Problems

TDAN

As data has catapulted to a new and valuable business asset class, and as AI is increasingly used in business operations, the use of AI has created hidden data and IP risks.

article thumbnail

HW023: The Best of WLPC 2024 Phoenix

Packet Pushers

The Wireless LAN Professionals organization just had its 10th annual conference and who better to break it down than WLPC founder (and Heavy Wireless host) Keith Parsons and friend of the show Ferney Munoz. They review their favorite presentations as well as heartwarming moments. Episode Guest Ferney Munoz | Ekahau and CWNP Certified Wireless Network.

article thumbnail

TwinLabs.ai Wins Confluent’s Data Streaming Startup Challenge

Confluent

TwinLabs.ai won the 2024 Data Streaming Startup Challenge. Explore this blog post to see what set them apart and their creative use of data streaming.

59
article thumbnail

The Insider Threat Prevention Primer Your Company Needs

Dataversity

We know them as friends, colleagues, acquaintances, work wives or husbands, and sometimes, the competition. They are the people we spend more time with than our own families. They are our co-workers and employees. They are also our greatest cybersecurity vulnerabilities. Insider threats, which include employees, contractors, or others with direct access to company data and […] The post The Insider Threat Prevention Primer Your Company Needs appeared first on DATAVERSITY.

article thumbnail

Data Is Risky Business: The Opportunity Exists Between Keyboard and Chair

TDAN

I’m doing some research work for a thing (more on that thing later in the column).

article thumbnail

BGP and asymmetric routing

Noction

Asymmetric routing is the situation where packets from A to B follow a different path than packets from B to A.

52
article thumbnail

Confluent Champion Amy on Driving Customer Success

Confluent

Our latest Confluent Champion post features Amy Koh, senior solutions architect at Confluent, and delves into how she is driving customer success.

52
article thumbnail

Keeping Cloud Data Costs in Check

Dataversity

Cloud data workloads are like coffee: They come in many forms and flavors, each with different price points. Just as your daily cappuccino habit will end up costing you dozens of times per month what you’d spend to brew Folgers every morning at home, the way you configure cloud-based data resources and run queries against […] The post Keeping Cloud Data Costs in Check appeared first on DATAVERSITY.

Cloud 64
article thumbnail

IPB147: The Network Engineering Advantages of IPv6

Packet Pushers

For years, Johannes Weber has heard network engineers around the world repeat the myth that IPv6 is more of a hassle than IPv4. So he made a list: Why IPv6 is better than IPv4. Dont worry, solving global address exhaustion isnt on it. In this episode, Johannes goes over his list with precision and passion. Read more » For years, Johannes Weber has heard network engineers around the world repeat the myth that IPv6 is more of a hassle than IPv4.

article thumbnail

The 4th Dimension of Threat Exposure Management

VIAVI Solutions

The decades-old practice of vulnerability management utilizes automated scanning methods to systematically identify and mitigate weaknesses created by unpatched operating systems and applications. In todays highly dynamic hybrid cloud environments, the number of vulnerabilities detected continues to increase constantly, while the time it takes threat actors to recognize and exploit them continues to drop.

article thumbnail

TCP header, TCP header size, TCP checksum mechanism, TCP header structure, options, and format

Noction

Ever wondered how data travels seamlessly over the internet? TCP headers play a crucial role in ensuring every piece of information reaches its destination intact. Learn about TCP header size, structure, checksum mechanism, and more in our latest article!

TCP 52