June, 2023

article thumbnail

Introducing English as the New Programming Language for Apache Spark

databricks

Introduction We are thrilled to unveil the English SDK for Apache Spark, a transformative tool designed to enrich your Spark experience. Apache Spark™.

145
145
article thumbnail

GPT-4 + Streaming Data = Real-Time Generative AI

Confluent

ChatGPT and data streaming can work together for any company. Learn a basic framework for using GPT-4 and streaming to build a real-world production application.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Meta developer tools: Working at scale

Engineering at Meta

Every day, thousands of developers at Meta are working in repositories with millions of files. Those developers need tools that help them at every stage of the workflow while working at extreme scale. In this article we’ll go through a few of the tools in the development process. And, as an added bonus, those we talk about below are open source so you can try them yourself.

Server 132
article thumbnail

Article: Debugging Production: eBPF Chaos

InfoQ Articles

This article shares insights into learning eBPF as a new cloud-native technology which aims to improve Observability and Security workflows. You’ll learn how chaos engineering can help, and get an insight into eBPF based observability and security use cases.

article thumbnail

Announcing Complete Azure Observability for Kentik Cloud

Kentik

Today, the phrase “cloud migration” means a lot more than it used to – gone are the days of the simple lift and shift. Kentik customers move workloads to (and from) multiple clouds, integrate existing hybrid applications with new cloud services, migrate to Virtual WAN to secure private network traffic, and make on-premises data and applications redundant to multiple clouds – or cloud data and applications redundant to the data center.

Cloud 105
article thumbnail

Large Language Models in the Enterprise: It’s Time to Find a Middle Ground

Dataversity

ChatGPT, the conversational chatbot released by OpenAI in November, garnered 100 million users in just two months, making it the fastest-growing consumer app in Internet history. But the technology that underpins ChatGPT is relevant and appealing to businesses as well. As you may already know, GPT stands for generative pre-trained transformer, which is the technology underlying the […] The post Large Language Models in the Enterprise: It’s Time to Find a Middle Ground appeared first on DAT

article thumbnail

Extending Databricks Unity Catalog with an Open Apache Hive Metastore API

databricks

Today, we are excited to announce the preview of a Hive Metastore (HMS) interface for Databricks Unity Catalog, which allows any software compatible.

125
125

More Trending

article thumbnail

Data Centric Revolution: Is Knowledge Ontology the Missing Link?

TDAN

You would think that after knocking around in semantics and knowledge graphs for over two decades I’d have had a pretty good idea about Knowledge Management, but it turns out I didn’t. I think in the rare event the term came up I internally conflated it with Knowledge Graphs and moved on.

article thumbnail

Article: Debugging Outside Your Comfort Zone: Diving Beneath a Trusted Abstraction

InfoQ Articles

This article takes a deep dive through a complex outage in the main database cluster of a payments company. We’ll focus on the aftermath of the incident - the process of understanding what went wrong, recreating the outage in a test cluster, and coming up with a way to stop it from happening again, and dive deep into the internals of Postgres, and learn about how it stores data on disk.

DevOps 104
article thumbnail

Multi-Cloud Made Simple: Announcing Kentik Observability Enhancements for AWS and Google Cloud

Kentik

Enterprises migrate to multi-cloud networks not because they want to, but because they have to. There’s an acquisition. An initiative to reduce costs. A mandate for redundancy. Regardless of the catalyst (and despite a number of benefits), one outcome is always the same: limited visibility into end-to-end performance across AWS, Azure, GCP, and on-prem.

Cloud 97
article thumbnail

Cloud Architecture Mistakes: Organizations Need Shorter Mean Time to Recovery

Dataversity

A common complaint about cloud computing is that the costs of operating in the cloud can get very expensive. In this five-part series, I’ve examined the costly cloud architecture mistakes organizations often make that contribute to those costs, and how an independent cloud platform can solve those problems. Part one explained how organizations can quickly […] The post Cloud Architecture Mistakes: Organizations Need Shorter Mean Time to Recovery appeared first on DATAVERSITY.

Cloud 98
article thumbnail

Lakehouse AI: a data-centric approach to building Generative AI applications

databricks

Generative AI will have a transformative impact on every business. Databricks has been pioneering AI innovations for a decade, actively collaborating with thousands.

article thumbnail

Meta’s Evenstar is transitioning to OCP to accelerate open RAN adoption

Engineering at Meta

Meta is transferring its IP for Evenstar , a program to accelerate the adoption of open RAN technologies, to the Open Compute Project (OCP). Meta will contribute Evenstar’s radio unit design to OCP, giving the telecom industry its first open, white box radio unit solution. The TIP Open RAN community will leverage the Evenstar radio unit reference designs to drive productization, validation, and commercialization of new Open RAN hardware variants.

article thumbnail

Self-Service GitOps for Confluent Cloud

Confluent

Learn how GitOps can work with policy-as-code systems to provide a true self-service model for managing Confluent resources.

Cloud 80
article thumbnail

Article: Shift in Sprint Review Mindset: from Reporting to Inclusive Ideation

InfoQ Articles

Sprint Reviews should foster a dynamic environment of creativity, exploration, and continual refinement, where important product and overall business decisions are taken. In this article, we will explore the substantial mindset shift and routine change from a typical reporting-focused to interactive data-driven culture of Sprint Reviews.

98
article thumbnail

Lack of Proper Test Data Poses Bottleneck

TDAN

The world is in a digital revolution. Business models are increasingly based on software applications. IT is operating at a faster pace than ever before and has become a vital component of modern business. The speed of application development is becoming a decisive factor for a company’s success.

article thumbnail

Data Bias in AI – Can We Beat Evolution Using Technology?

Dataversity

Is there data bias in your business? Recent research indicates that 65% of business and IT executives believe there is currently data bias in their organization, 13% of businesses are currently addressing data bias, and 78% believe data bias will become a bigger concern as AI/ML use increases. What this indicates is that businesses and organizations simultaneously worry […] The post Data Bias in AI – Can We Beat Evolution Using Technology?

article thumbnail

Introducing LakehouseIQ: The AI-Powered Engine that Uniquely Understands your Business

databricks

Today, we are thrilled to announce LakehouseIQ, a knowledge engine that learns the unique nuances of your business and data to power natural.

article thumbnail

How DoorDash Built an Ensemble Learning Model for Time Series Forecasting

DoorDash Engineering

In real-world forecasting applications , it is a challenge to balance accuracy and speed. We can achieve high accuracy by running numerous models and configuration combinations and we gain speed through running fast, computationally inexpensive models. We explore a number of models and configuration combinations at DoorDash to forecast demand on our platform.

article thumbnail

Meet Caroline, Our June 2023 Confluent Champion

Confluent

Meet Caroline Staudenraus—Account Executive in Confluent’s digital native commercial segment. FInd out how she has made a difference at Confluent since she joined in 2018.

69
article thumbnail

Article: Service Assurance in Private LTE/5G Networks

InfoQ Articles

This article talks about service assurance in the context of cellular networks, how private networks pose additional needs, and how an end-to-end service assurance framework can be designed and developed for such networks.

5G 91
article thumbnail

Data is Risky Business: Data Ethics and Governance in the Age of LLMs

TDAN

In the last few months, we have seen the wave of Artificial Intelligence break on the shores of wide-scale business adoption and mainstream media coverage of Large Language Models, most famously ChatGPT.

article thumbnail

4 Data Privacy Best Practices

Dataversity

Data privacy is at the heart of every prominent security threat – what are the top best practices for keeping data private? Some of the major cyber security challenges in 2023 are ransomware, hacking of cloud service vendors, and wiper malware. During ransomware attacks, bad actors obtain or encrypt sensitive information. The victims are urged to pay […] The post 4 Data Privacy Best Practices appeared first on DATAVERSITY.

article thumbnail

Introducing Lakehouse Federation Capabilities in Unity Catalog

databricks

Data teams face many challenges to quickly access the right data primarily due to data fragmentation, time and cost involved in consolidating data.

101
101
article thumbnail

We’re bringing event analytics to your data warehouse

Mixpanel

Most of the data our customers analyze in Mixpanel is from user behavior. But when you add in data from across the company—on revenue , customer profiles, etc.—Mixpanel can deliver much deeper insights to help direct wider decisions about the business. Today, we’re introducing Warehouse Events to make it easier to bring in all that extra data. With Warehouse Events, you will be able to natively connect Mixpanel to tables in your data warehouse so our powerful self-serve analytics Boards and repo

64
article thumbnail

Celebrating Pride 2023 at Confluent

Confluent

In honor of Pride Month, in this blog post, we talk about how inclusive Confluent culture is and highlight the experiences of some of our queer employees.

69
article thumbnail

Article: Embracing ADHD and Other Neurodivergencies in Software Development Teams

InfoQ Articles

In recent years, there has been increased attention to neurodivergencies such as ADHD, hyper-sensitivity, autism, dyslexia, etc. In this article, Dietrich Moerman tells his own story about ADHD while working as a software developer and becoming a team lead, what he learned, and what he found to be working well to help people with ADHD and more to thrive in their teams and companies.

88
article thumbnail

Auditing Database Access and Change

TDAN

The increasing burden of complying with government and industry regulations imposes significant, time-consuming requirements on IT projects and applications. And nowhere is the pressure to comply with regulations greater than on data stored in corporate databases. Organizations must be hyper-vigilant as they implement controls to protect and monitor their data.

article thumbnail

A Field Guide for Launching and Growing a Career in Data Science

Dataversity

In recent years, the demand for data scientists has skyrocketed as organizations recognize the value of data-driven insights. Despite increased on-ramps and educational paths to a career in Data Science, there continues to be a concern amidst this increasing demand: the underrepresentation of women in Data Science and other science, technology, engineering, and mathematics (STEM) […] The post A Field Guide for Launching and Growing a Career in Data Science appeared first on DATAVERSITY.

article thumbnail

How Databricks’ Lakehouse is helping to power a new era for TD Bank Group's Data Transformation

databricks

This blog is the first of a 3-part series chronicling TD Bank's Data Platform transformation and the enablement of their Data as a.

Banking 101
article thumbnail

Mixpanel Ecommerce Analytics

Mixpanel

You have to know your customers well to run a successful ecommerce product or business. Take it from us: We’ve been providing event analytics to help companies build great online shopping experiences for over 10 years. As powerful as Mixpanel has been for ecommerce, today we’re expanding what it can do with the formal launch of Mixpanel Ecommerce Analytics.

article thumbnail

Confluent Wins the 2023 Microsoft Commercial Marketplace Partner of the Year Award

Confluent

Our OSS on Azure Partner of the Year Award highlights Confluent's data streaming solution, cloud Apache Kafka, and fully integrated Azure security, management, billing, and data analytics.

Cloud 64
article thumbnail

Article: How to Manage Full-Stack Java Development with Hilla

InfoQ Articles

This article explores Hilla, an open-source framework that offers an approach to web application development by integrating a Spring Boot Java backend with a reactive TypeScript frontend. It uses either Lit or React, combined with Vaadin’s 40+ open-source UI web components for interface creation. It also generates REST APIs and client access codes, a secure, stateless backend architecture.

article thumbnail

Through the Looking Glass: The Sounds of Data, Part 2

TDAN

In Part 1 of The Sounds of Data, I wrote about the power of sonification, which translates data into sounds and music. I finished the article with a cliffhanger.

article thumbnail

Where No (Enterprise) WAN Has Gone Before

Kentik

We all know the story. As the Enterprise entered the Mutara Nebula, Khan lost sight of his prey. Sensors were inoperable from the firefight, and finding anything in the gaseous cloud was near impossible. Khan maneuvered, stalked, and hunted his mortal enemy with the cautious vengeance of a madman tempered by misguided intelligence and patience. But this type of encounter in space, this new application of battle strategy borne from intelligence without experience, meant Khan was handicapped from

WAN 59
article thumbnail

Announcing MLflow 2.4: LLMOps Tools for Robust Model Evaluation

databricks

LLMs present a massive opportunity for organizations of all scales to quickly build powerful applications and deliver business value. Where data scientists used.

article thumbnail

Revenue analytics in Mixpanel

Mixpanel

When we set out to bring event analytics to everyone , we knew that analytics for company revenue data, in particular, would be a unifying force for the cause. Everyone in an org wants to drive more revenue and see the impact of their work on that revenue growth. Mixpanel can do this well today, and with the new features we have in the works, we’re going to make revenue analytics even more powerful and easy to set up.

Cloud 59