Sat.Jun 24, 2023 - Fri.Jun 30, 2023

article thumbnail

Introducing English as the New Programming Language for Apache Spark

databricks

Introduction We are thrilled to unveil the English SDK for Apache Spark, a transformative tool designed to enrich your Spark experience. Apache Spark™.

145
145
article thumbnail

Meta developer tools: Working at scale

Engineering at Meta

Every day, thousands of developers at Meta are working in repositories with millions of files. Those developers need tools that help them at every stage of the workflow while working at extreme scale. In this article we’ll go through a few of the tools in the development process. And, as an added bonus, those we talk about below are open source so you can try them yourself.

Server 132
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

What Is an Event in the Apache Kafka Ecosystem?

Confluent

Get an introduction into the world of events and event-driven architecture in Apache Kafka. Learn what events are and the role they play in event design, event streaming, and event-driven design.

111
111
article thumbnail

Announcing Complete Azure Observability for Kentik Cloud

Kentik

Today, the phrase “cloud migration” means a lot more than it used to – gone are the days of the simple lift and shift. Kentik customers move workloads to (and from) multiple clouds, integrate existing hybrid applications with new cloud services, migrate to Virtual WAN to secure private network traffic, and make on-premises data and applications redundant to multiple clouds – or cloud data and applications redundant to the data center.

Cloud 105
article thumbnail

Lakehouse AI: a data-centric approach to building Generative AI applications

databricks

Generative AI will have a transformative impact on every business. Databricks has been pioneering AI innovations for a decade, actively collaborating with thousands.

article thumbnail

Large Language Models in the Enterprise: It’s Time to Find a Middle Ground

Dataversity

ChatGPT, the conversational chatbot released by OpenAI in November, garnered 100 million users in just two months, making it the fastest-growing consumer app in Internet history. But the technology that underpins ChatGPT is relevant and appealing to businesses as well. As you may already know, GPT stands for generative pre-trained transformer, which is the technology underlying the […] The post Large Language Models in the Enterprise: It’s Time to Find a Middle Ground appeared first on DAT

article thumbnail

Article: Embracing ADHD and Other Neurodivergencies in Software Development Teams

InfoQ Articles

In recent years, there has been increased attention to neurodivergencies such as ADHD, hyper-sensitivity, autism, dyslexia, etc. In this article, Dietrich Moerman tells his own story about ADHD while working as a software developer and becoming a team lead, what he learned, and what he found to be working well to help people with ADHD and more to thrive in their teams and companies.

88

More Trending

article thumbnail

Introducing LakehouseIQ: The AI-Powered Engine that Uniquely Understands your Business

databricks

Today, we are thrilled to announce LakehouseIQ, a knowledge engine that learns the unique nuances of your business and data to power natural.

article thumbnail

A Field Guide for Launching and Growing a Career in Data Science

Dataversity

In recent years, the demand for data scientists has skyrocketed as organizations recognize the value of data-driven insights. Despite increased on-ramps and educational paths to a career in Data Science, there continues to be a concern amidst this increasing demand: the underrepresentation of women in Data Science and other science, technology, engineering, and mathematics (STEM) […] The post A Field Guide for Launching and Growing a Career in Data Science appeared first on DATAVERSITY.

article thumbnail

Article: Comparative Analysis of Major Distributed File System Architectures: GFS vs. Tectonic vs. JuiceFS

InfoQ Articles

As storage needs continue to grow, traditional disk file systems have revealed their limitations. To address the growing storage demands, distributed file systems have emerged as dynamic and scalable solutions. In this article, we explore the design principles, innovations, and challenges addressed by three representative distributed file systems: Google File System (GFS), Tectonic, and JuiceFS.

article thumbnail

Confluent Wins the 2023 Microsoft Commercial Marketplace Partner of the Year Award

Confluent

Our OSS on Azure Partner of the Year Award highlights Confluent's data streaming solution, cloud Apache Kafka, and fully integrated Azure security, management, billing, and data analytics.

Cloud 64
article thumbnail

Introducing Lakehouse Federation Capabilities in Unity Catalog

databricks

Data teams face many challenges to quickly access the right data primarily due to data fragmentation, time and cost involved in consolidating data.

101
101
article thumbnail

What Does Exactly Happen When A Switch Is Connected To A Portfast Port?

Network Engineering

I am trying to get a thorough understanding of what does happen when a switch is connected to a portfast port on another switch that is part of a stable STP network in case of: 1- the port has no bpduguard. 2- the port has the bpduguard enabled. I know that in case 1, loops can be formed. but why? And does the port really revert to normal STP operations (LSN/LRN/FWD) as soon as it receives the first bpdu?

Port 52
article thumbnail

Article: A Comprehensive Guide to Java's New Feature: Pattern Matching for Switch

InfoQ Articles

Java brings an update with Pattern Matching for Switch. This article provides a detailed exploration of this feature, examining its support for any reference type, inclusion of null values, and introduction of guarded patterns. It also delves into the new runtime exception class - MatchException, and illustrates the compatibility of this feature with traditional switch statements.

78
article thumbnail

How Mobile Premier League Reduced Player Churn with Confluent Cloud

Confluent

Streaming data for gaming enables real-time player matchmaking, personalization, and fraud detection to deliver the best gaming experiences and reduce player churn.

Cloud 64
article thumbnail

Project Lightspeed Update - Advancing Apache Spark Structured Streaming

databricks

In this blog post, we will review the advancements in Spark Structured Streaming since we announced Project Lightspeed a year ago, from performance.

article thumbnail

Heavy Networking 688: Packet-Level Fundamentals With Chris Greer

Packet Pushers

Packet-level fundamentals are essential for network engineers to be able to diagnose and solve network and application problems. On today's Heavy Networking, we dive into the transport layer and packets with packet analysis expert and instructor Chris Greer. Packet-level fundamentals are essential for network engineers to be able to diagnose and solve network and application problems.

article thumbnail

Article: Designing the Jit Analytics Architecture for Scale and Reuse

InfoQ Articles

As a SaaS provider, analytical data at Jit needs to be useful to both their customers and to internal stakeholders. AWS services including EventBridge, Kinesis Data Firehose, and Timestream handle data ingestion and UI platforms from Mixpanel and Segment provide data visualization.

76
article thumbnail

Data Streaming Awards 2023: Call for Nominations Are Now Open!

Confluent

Call for nominations are now open for the Data Streaming Awards 2023. Learn more about the nomination process in this blog post.

59
article thumbnail

Announcing Delta Lake 3.0 with New Universal Format and Liquid Clustering

databricks

We are excited to announce Delta Lake 3.0, the next major release of the Linux Foundation open source Delta Lake Project, available in.

article thumbnail

IPv6 Buzz 129: IPv6 Architecture And Subnetting With Daryll Swer

Packet Pushers

Today's IPv6 Buzz podcast gets into IPv6 architecture and subnetting including how geography fits into IPv6 subnetting, minimum allocation sizes from the RIR to end-users, whether current RIR policies will provide sufficient address space for a future-proof IPv6 architecture, and more. Our guest is Daryll Swer. Today's IPv6 Buzz podcast gets into IPv6 architecture and subnetting including how geography fits into IPv6 subnetting, minimum allocation sizes from the RIR to end-users, whether current

52
article thumbnail

MITRE ATT&CK and How to Apply It to Your Organization

CATO Networks

MITRE ATT&CK is a popular knowledge base that categorizes the Tactics, Techniques and Procedures (TTPs) used by adversaries in cyberattacks. Created by nonprofit organization MITRE, MITRE ATT&CK equips security professionals with valuable insights to comprehend, detect, and counter cyber threats. In this blog post, we dive into the framework, explore different use cases for using it and discuss cross-community collaboration.

article thumbnail

Architecting Real-Time Analytics for Speed and Scale

Dataversity

In today’s fast-paced world, the concept of patience as a virtue seems to be fading away, as people no longer want to wait for anything. If Netflix takes too long to load or the nearest Lyft is too far, users are quick to switch to alternative options. The demand for instant results is not limited […] The post Architecting Real-Time Analytics for Speed and Scale appeared first on DATAVERSITY.

article thumbnail

What’s new with Unity Catalog at Data and AI Summit 2023

databricks

The fundamental principles of governance – accountability, compliance, quality, and transparency – that are essential for data management have now become equally imperative for.

article thumbnail

HS050: The Tech Job Debacle

Packet Pushers

Google, Microsoft, Twitter, META/FB and a few others laid off an estimated 200,000 tech and tech-adjacent folks in recent weeks. Other companies like Fedex and Amazon have made layoffs, many impacting the IT teams.What does that mean for the tech industry? Between AI and our corporate overlords are we all lucky to be employed, and should we go back to working 80 hour in-office weeks?

52
article thumbnail

How Kentik reduces the likelihood of a full-blown cyber-attack before it happens

Kentik

This is part 1 of 3 in a blog series about how to fortify your security posture with Kentik. Kentik is crucial in strengthening the security posture for our customers before, during, and after a cyber attack. We do this by using deeply enriched network data from across your entire data center, cloud, and container footprint to prevent, detect, and respond to cyber threats.

Port 52
article thumbnail

Cyber Detection: A Must-Have in Primary Storage

Dataversity

Enterprise storage is a critical component of a comprehensive corporate cybersecurity strategy. If an enterprise does not include cyber storage resilience in their measures to secure their enterprise IT infrastructure, it’s the equivalent of going on vacation and leaving the back door and back windows of your house open, so you have made it easier for criminals […] The post Cyber Detection: A Must-Have in Primary Storage appeared first on DATAVERSITY.

article thumbnail

Introducing Materialized Views and Streaming Tables for Databricks SQL

databricks

We are thrilled to announce that materialized views and streaming tables are now publicly available in Databricks SQL on AWS and Azure. Streaming.

82
article thumbnail

Day Two Cloud 200: Coaching For Accidental (And On-Purpose) Managers

Packet Pushers

Going from a tech role to manager is more than just a new gig it's a full-blown career change. On today's Day Two Cloud we talk with management coach Steve Dwire about a manager's primary responsibilities, what new managers usually get wrong, management education vs. experience, and how to get better at the job. This episode goes places we didn't expect, so come along for the ride.

Cloud 52
article thumbnail

What is Packet Duplication & How to Identify It

Obkio

Discover the secrets of packet duplication in networks, learn how to identify it, and unleash the power of Obkio's monitoring tool to tackle the issue.

article thumbnail

Enhancing Security and Asset Management with AI/ML in Cato Networks’ SASE Product

CATO Networks

We just introduced what we believe is a unique application of real-time, deep learning (DL) algorithms to network prevention. The announcement is hardly our foray into artificial intelligence (AI) and machine learning (ML). The technologies have long played a pivotal role in augmenting Cato’s SASE security and networking capabilities, enabling advanced threat prevention and efficient asset management.

SASE 52
article thumbnail

Helping Enterprises Responsibly Deploy AI

databricks

The promise of artificial intelligence (AI) is undeniable, but its enormous potential also comes with enormous responsibilities. Companies and organizations around the world.

article thumbnail

HN687 Juniper CORA Coherent Optics Enabling IPoDWDM

Packet Pushers

Its about reducing the cost and complexity of DWDM coherent optical networks. Connecting the DWDM network directly to your router removes the DWDM edge equipment which simplifies operation, reduce cost,space & power while improving provisioning time. How is Juniper entering this market and what do you need to know ? Its about reducing the cost and complexity of DWDM coherent optical networks.

Routers 52
article thumbnail

You Are Overspending on Cloud and SaaS: Here’s Why and What to Do

Dataversity

The shift to public cloud, private cloud, and SaaS is ubiquitous and occurring at an accelerating pace. The benefits of well-known cloud services and infrastructure are easier to deploy and manage and, typically, are cheaper and more efficient than operating a data center. Those same benefits, however, also introduce the potential for overspending, orphaned resources, and duplicate […] The post You Are Overspending on Cloud and SaaS: Here’s Why and What to Do appeared first on DATAVERSITY.

Cloud 52
article thumbnail

What is a UUID, and what is it used for?

Cockroach Labs

When working with a database, its common practice to use some kind of id field to provide a unique identifier for each row in a table. Imagine, for example, a customers table. We wouldnt want to use fields such as name or address as unique identifiers because its possible more than one customer could have the same name, or share the same address, or in some cases even both!

40
article thumbnail

What’s New in Data Engineering and Streaming at Data + AI Summit 2023

databricks

It's Thursday and we are fresh off a week of announcements from the 2023 Data + AI Summit. The theme of this year's.

article thumbnail

Heavy Wireless 005: How To Build A Wi-Fi Community With Ferney Munoz

Packet Pushers

Have you ever wanted to build a community of professionals in your field, but didn't know where to start? In this episode of the Heavy Wireless podcast, Keith Parsons interviews Ferney Munoz, founder of the Tes@s en Wi-Fi community in Latin America, to learn how he built a successful community of Wi-Fi professionals. Have you ever wanted to build a community of professionals in your field, but didn't know where to start?