April, 2023

article thumbnail

Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM

databricks

Two weeks ago, we released Dolly, a large language model (LLM) trained for less than $30 to exhibit ChatGPT-like human interactivity (aka instruction-following).

145
145
article thumbnail

Ukraine’s Wartime Internet from the Inside

Kentik

This February marked a grim milestone in the ongoing war in Ukraine. It has now been over a year since Russian forces invaded its neighbor to the west leading to the largest conflict in Europe since World War II. In the past year, we have used Kentik’s unique datasets to show some of the conflict’s impacts on Ukraine’s external internet connectivity, ranging from DDoS attacks and large outages , to the rerouting of internet service in the southern region of Kherson.

Internet 145
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Article: Software Architecture and Design InfoQ Trends Report - April 2023

InfoQ Articles

This article provides an overview of how the InfoQ editorial team sees the Software Architecture and Design topic evolving in 2023, with a focus on what architects are designing for today.

article thumbnail

Build faster with Buck2: Our open source build system

Engineering at Meta

Buck2, our new open source, large-scale build system , is now available on GitHub. Buck2 is an extensible and performant build system written in Rust and designed to make your build experience faster and more efficient. In our internal tests at Meta, we observed that Buck2 completed builds 2x as fast as Buck1. Buck2, Meta’s open source large-scale build system, is now publicly available via the Buck2 website and the Buck2 GitHub repository.

article thumbnail

The BEST Resources to Level Up Your Data Streaming Knowledge!

Confluent

All the best data streaming resources, tips, and guides to help you learn introductory concepts, streaming architecture basics, common tools and technologies, and more.

105
105
article thumbnail

Internal developer platforms and the cult of Kubernetes

Ben Morris

We all agree that engineering enablement is important, but platform teams can often be a fig leaf for organisational anti-patterns and overly complex Kubernetes implementations.

article thumbnail

Enroll in our New Expert-Led Large Language Models (LLMs) Courses on edX

databricks

Enroll in the introductory course on edX today! The course will begin Summer 2023. New Large Language Model Courses with edX As Large.

122
122

More Trending

article thumbnail

Article: The Silent Platform Revolution: How eBPF Is Fundamentally Transforming Cloud-Native Platforms

InfoQ Articles

There is a silent eBPF revolution reshaping platforms and the cloud-native world in its image, and this is its story.

Cloud 112
article thumbnail

Deploying key transparency at WhatsApp

Engineering at Meta

WhatsApp has launched a new cryptographic security feature to automatically verify a secured connection based on key transparency. The feature requires no additional actions or steps from users and helps ensure that a conversation is secure. Key transparency solutions help strengthen the guarantee that end-to-end encryption provides to private, personal messaging applications in a transparent manner available to all.

article thumbnail

Uniting the Machine Learning and Data Streaming Ecosystems - Part 2

Confluent

Machine learning and data streaming are a perfect match, but have diverging tech stacks. How can we overcome the pitfalls of SQL and the gulf between languages?

105
105
article thumbnail

DoorDash identifies Five big areas for using Generative AI

DoorDash Engineering

In the wake of ChatGPT and Generative AI DoorDash is identifying ways this new technology can enhance the customer’s ordering experience on the platform. The company is exploring the use of Generative AI, a subset of Artificial Intelligence that generates novel content based on existing data, and how it can be implemented effectively with consideration for the privacy and security of personal information.

article thumbnail

How We Performed ETL on One Billion Records For Under $1 With Delta Live Tables

databricks

Today, Databricks sets a new standard for ETL (Extract, Transform, Load) price and performance. While customers have been using Databricks for their ETL.

118
118
article thumbnail

Practical Steps for Enhancing Reliability in Cloud Networks - Part I

Kentik

When evaluating solutions, whether to internal problems or those of our customers, I like to keep the core metrics fairly simple: will this reduce costs, increase performance, or improve the network’s reliability? It’s often taken for granted by network specialists that there is a trade-off among these three facets. If a solution is cheap, it is probably not very performant or particularly reliable.

Cloud 104
article thumbnail

Article: Rapid Startup of Your Cloud-Native Java Applications Without Compromise

InfoQ Articles

This article discusses the significance of startup time in cloud-native computing, highlighting challenges for JVM-based apps. It introduces Liberty InstantOn, which boosts startup times using checkpoint/restore technology, offering fast startup without compromising Java capabilities or facing static compilation trade-offs.

Cloud 108
article thumbnail

How Device Verification protects your WhatsApp account

Engineering at Meta

WhatsApp has launched a new security feature that further helps prevent attackers from using vectors like on-device malware. This security feature, called Device Verification, requires no action or additional steps from users and helps protect your account. This feature is part of our broader work to increase security for our users from the growing threat of malware.

Server 144
article thumbnail

Cloud Architecture Mistakes: Inadequate Showback and Chargeback Options Escalate Costs

Dataversity

In this five-part series, I’m taking a hard look at the common – and costly – mistakes organizations typically make while building a cloud architecture. Part one explained how organizations can quickly lose visibility and control over their data processing,and detailed how to avoid that mistake. Part two looked at why a DIY approach often […] The post Cloud Architecture Mistakes: Inadequate Showback and Chargeback Options Escalate Costs appeared first on DATAVERSITY.

Cloud 98
article thumbnail

Introducing Foundational Templates

Mixpanel

We’re always working on making it easier to get started with Mixpanel. But still, it can be hard to know what questions to ask in order to begin analyzing your data. If you’re new to product analytics, the possibilities can seem endless. And even seasoned pros are looking for a way to get to insights faster. If you’re a Mixpanel user, you can try out templates now by clicking the above image.

92
article thumbnail

Introducing Apache Spark™ 3.4 for Databricks Runtime 13.0

databricks

Today, we are happy to announce the availability of Apache Spark™ 3.4 on Databricks as part of Databricks Runtime 13.0. We extend our s.

article thumbnail

??Kafka Summit London 2023: Level Up Your Kafka Experience!

Confluent

Kafka Summit 2023 brings 60+ sessions, keynotes, and lightning talks, and more from industry leaders. Check out the agenda, highlights, networking events, and more event info.

Network 75
article thumbnail

Article: Dark Side of DevOps - the Price of Shifting Left and Ways to Make it Affordable

InfoQ Articles

Topics like “you build it, you run it” and “shifting testing/security/data governance left” are popular. Moving things to earlier stages of software development, empowering engineers. Yet, what is the cost? What does it mean for the developers who are involved? What are the solutions that can help you keep DevOps and Shifting Left? What can we do to break a grip of the dark side?

DevOps 100
article thumbnail

A fine-grained network traffic analysis with Millisampler

Engineering at Meta

What the research is: Millisampler is one of Meta’s latest characterization tools and allows us to observe, characterize, and debug network performance at high-granularity timescales efficiently. This lightweight network traffic characterization tool for continual monitoring operates at fine, configurable timescales. It collects time series of ingress and egress traffic volumes, number of active flows, incoming ECN marks, and ingress and egress retransmissions.

Network 121
article thumbnail

Why CIOs Should Invest in Business Automation

Dataversity

In an increasingly challenging economic environment, it’s essential that chief information officers (CIOs) take a thoughtful approach to investing capital. The 2023 Gartner® CIO and Technology Executive Survey found that more than half of digital initiatives lag behind leadership expectations, with 59% reporting the initiatives take too long to complete and 52% reporting the initiatives take too […] The post Why CIOs Should Invest in Business Automation appeared first on DATAVERSITY.

article thumbnail

Our next step: Analytics for everyone

Mixpanel

Companies change as they grow, and that’s no different for us. Today, we’re launching our new brand. And it’s a lot more than a refreshed look. Our last big change was a little over three years ago when we went all in on product-led growth (PLG) to get more people into Mixpanel faster and find the product and business answers they needed. We ripped up the B2B SaaS rules and playbook by rolling out a “show, don’t tell” design in the product; ungating content so you don’t have to fill out fo

article thumbnail

A data architecture pattern to maximize the value of the Lakehouse

databricks

One of Lakehouse's outstanding achievements is the ability to combine workloads for modern use cases, such as traditional BI, machine learning & AI.

103
103
article thumbnail

2022 Summer Intern Projects Article #3

DoorDash Engineering

DoorDash offers our summer interns the opportunity to fully integrate with Engineering teams to get the kind of real industry experience that is not taught in the classroom. This is the third blog post in a series of articles showcasing our 2022 summer intern projects. If you missed the first or second article the links are here and here. You can read about each project below.

Banking 75
article thumbnail

Article: Agility and Architecture

InfoQ Articles

Software architecture and agility are often portrayed as incompatible. In reality, they are mutually reinforcing - a sound architecture helps teams build better solutions in a series of short intervals, and gradually evolving a system’s architecture helps by validating and improving it over time.

99
article thumbnail

Meet Steven, Our April 2023 Confluent Champion

Confluent

Steven Zhang is a senior software engineer in the Stream Processing and Analytics organization and is shaping the groundwork for Confluent's upcoming Flink integration.

article thumbnail

Five Must-Have Characteristics of Extraordinary Data Scientists

Dataversity

There’s no better time than right now to be a data scientist. Despite recent large-scale layoffs in major tech firms, the future is bright for data managers, analysts, data wranglers, and consultants. In fact, the number of jobs requiring Data Science skills is expected to grow by 27.9% by 2026, according to the U.S. Bureau of Labor […] The post Five Must-Have Characteristics of Extraordinary Data Scientists appeared first on DATAVERSITY.

article thumbnail

Data Professional Introspective: Capability Maturity Model Comparison

TDAN

For the Enterprise Data Management Council (EDMC), I recently concluded a detailed comparison and mapping of the DMM to the DCAM with the Council’s Product Manager, employing the Soladatus modeling tool with its robust model mapping features. As you might expect, it revealed both structural differences, strengths, and gaps in both models.

article thumbnail

Introducing AI Functions: Integrating Large Language Models with Databricks SQL

databricks

With all the incredible progress being made in the space of Large Language Models, customers have asked us how they can enable their.

101
101
article thumbnail

NetOps for Application Developers: Understanding the Importance of Network Operations in Modern Development

Kentik

One of the great successes of software development in the last ten years has been the relatively decentralized approach to application development made available by containerization, allowing for rapid iteration, service-specific stacks, and (sometimes) elegant deployment and orchestration implementations that piece it all together. At scale, and primarily when carried out in cloud and hybrid-cloud environments, these distributed, service-oriented architectures and deployment strategies create a

article thumbnail

Article: Unleash the Power of Open Source Java Profilers: Comparing VisualVM, JMC, and async-profiler

InfoQ Articles

This article conveys the foundational concepts and different types of Open Source Java profilers. It allows you to choose the best-suited profiler for your needs and comprehend how these tools work in principle. The aim of a profiler is to obtain information on the program execution so that a developer can see how much time a method executed in a given period.

95
article thumbnail

Top 5 Data Technology Trends: Govern It, Stream It, Set It Free

Confluent

As businesses move to meet modern demands, these technologies ensure not only a digital transformation, but data transformation, with new use cases surrounding real-time data.

article thumbnail

Cloud Architecture Mistakes: The High Costs of a DIY Mindset

Dataversity

This is a five-part series about the costly mistakes organizations commonly make while building a cloud architecture. Part one explained how organizations moving to the cloud can quickly lose visibility and control over their data processing and detailed how to avoid that mistake. Part two looks at the ways doing it yourself can go wrong. What would […] The post Cloud Architecture Mistakes: The High Costs of a DIY Mindset appeared first on DATAVERSITY.

Cloud 98
article thumbnail

Principles of Cloud Data Governance For Banks

TDAN

Banks – and their data volumes – are at the epicenter of the world’s digital transformation. The pace of change mirrors the velocity, volume, and variety of data within the industry. It is where new products, new markets, and new touchpoints mean new – often cloud-based – ways to do business in financial services.

Banking 59
article thumbnail

Introducing MLflow 2.3: Enhanced with Native LLM Support and New Features

databricks

With over 13 million monthly downloads, MLflow has established itself as the premier platform for end-to-end MLOps, empowering teams of all sizes to.

101
101
article thumbnail

Data-Driven Defense: Exploring Global Cybersecurity and the Human Factor

Kentik

A security breach often manifests itself in some sort of performance degradation of services — a slow network, an application that isn’t behaving correctly, or in some scenarios, a complete hard down. But this isn’t always the case. Especially when an attacker is interested in covert data exfiltration, a breach may go unnoticed for weeks, months, or even years.