Database, Demo and ETL - Data Science Current

What Is a Lakebase?

databricks

JUNE 11, 2025

Get a Demo DATA + AI SUMMIT JUNE 9–12 | SAN FRANCISCO Data + AI Summit is almost here — don’t miss the chance to join us in San Francisco! Modern development workflow : Branching a database should be as easy as branching a code repository, and it should be near instantaneous. REGISTER Ready to get started?

Database

Database Data Lakes ETL Analytics

Introducing Databricks One

databricks

JUNE 12, 2025

Get a Demo DATA + AI SUMMIT Data + AI Summit Happening Now Watch the free livestream of the keynotes! Join now Ready to get started? 160 Spear Street, 15th Floor San Francisco, CA 94105 1-866-330-0121 See Careers at Databricks © Databricks 2025.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Introducing Agent Bricks: Auto-Optimized Agents Using Your Data

databricks

JUNE 11, 2025

Get a Demo DATA + AI SUMMIT JUNE 9–12 | SAN FRANCISCO Data + AI Summit is almost here — don’t miss the chance to join us in San Francisco! REGISTER Ready to get started? This approach democratizes agent development, allowing domain experts to contribute directly to system improvement without deep technical expertise in AI infrastructure.

Analytics

Analytics Analytics Data Science AI

Announcing managed MCP servers with Unity Catalog and Mosaic AI Integration

databricks

JUNE 18, 2025

160 Spear Street, 15th Floor San Francisco, CA 94105 1-866-330-0121 See Careers at Databricks © Databricks 2025.

AI

AI AI Data Science Artificial Intelligence

Recapping the Cloud Amplifier and Snowflake Demo

Towards AI

JANUARY 28, 2024

Recapping the Cloud Amplifier and Snowflake Demo The combined power of Snowflake and Domo’s Cloud Amplifier is the best-kept secret in data management right now — and we’re reaching new heights every day. If you missed our demo, we dive into the technical intricacies of architecting it below. Instagram) used in the demo Why Snowflake?

ETL

ETL Python Database Data Preparation

Launch HN: Chonkie (YC X25) – Open-Source Library for Advanced Chunking

Hacker News

JUNE 9, 2025

We also offer hosted and on-premise versions with OCR, extra metadata, all embedding providers, and managed vector databases for teams that want a fully managed pipeline. or book a demo: https://cal.com/shreyashn/chonkie-demo. 200k+ tokens) with many SQL snippets, query results and database metadata (e.g.

Database

Database SQL ETL AI

Tackling AI’s data challenges with IBM databases on AWS

IBM Journey to AI blog

MARCH 14, 2024

Also, traditional database management tasks, including backups, upgrades and routine maintenance drain valuable time and resources, hindering innovation. By using fit-for-purpose databases, customers can efficiently run workloads, using the appropriate engine at the optimal cost to optimize analytics for the best price-performance.

AWS

AWS Database ETL AI

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

The SnapLogic Intelligent Integration Platform (IIP) enables organizations to realize enterprise-wide automation by connecting their entire ecosystem of applications, databases, big data, machines and devices, APIs, and more with pre-built, intelligent connectors called Snaps.

Database

Database AWS ETL SQL

How Formula 1® uses generative AI to accelerate race-day issue resolution

AWS Machine Learning Blog

FEBRUARY 18, 2025

The assistant is connected to internal and external systems, with the capability to query various sources such as SQL databases, Amazon CloudWatch logs, and third-party tools to check the live system health status. Creating ETL pipelines to transform log data Preparing your data to provide quality results is the first step in an AI project.

AWS

AWS Database ETL AI

End-to-End model training and deployment with Amazon SageMaker Unified Studio

Flipboard

JULY 3, 2025

Data engineers can create and manage extract, transform, and load (ETL) pipelines directly within Unified Studio using Visual ETL. For Project name , enter a name (for example, demo ). Expand your database starting from glue_db_. The admin also publishes the data to SageMaker Catalog in SageMaker Lakehouse.

ML

ML ML AWS Data Engineering

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

Flipboard

DECEMBER 11, 2024

With the SQL editor, you can query data lakes, databases, data warehouses, and federated data sources. Under Quick setup settings , for Name , enter a name (for example, demo). For Project name , enter a name (for example, demo). Expand your database starting from glue_db_. Choose Continue. option("multiLine", "true").option("header",

SQL

SQL AWS Data Lakes AI

Practical Tips and Tricks for Developers Building RAG Applications

Towards AI

APRIL 23, 2025

The general perception is that you can simply feed data into an embedding model to generate vector embeddings and then transfer these vectors into your vector database to retrieve the desired results. how to perform a vector search Many vector database providers promote their capabilities with descriptors like easy, user-friendly, and simple.

K-nearest Neighbors

K-nearest Neighbors Database ETL Machine Learning

5 Reasons Why SQL is Still the Most Accessible Language for New Data Scientists

ODSC - Open Data Science

APRIL 6, 2023

It’s a foundational skill for working with relational databases Just about every data scientist or analyst will have to work with relational databases in their careers. Another boon for efficient work that SQL provides is its simple and consistent syntax that allows for collaboration across multiple databases.

SQL

SQL Data Scientist Database Data Science

Schema Detection and Evolution in Snowflake

phData

MARCH 1, 2024

As you can see in the above demo, it is incredibly simple to use INFER_SCHEMA and SCHEMA EVOLUTION features to speed up data ingestion into Snowflake. There’s no need for developers or analysts to manually adjust table schemas or modify ETL (Extract, Transform, Load) processes whenever the source data structure changes.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Unleashing the power of Presto: The Uber case study

IBM Journey to AI blog

SEPTEMBER 25, 2023

The evolution of Presto at Uber Beginning of a data analytics journey Uber began their analytical journey with a traditional analytical database platform at the core of their analytics. They stood up a file-based data lake alongside their analytical database. Uber has made the Presto query engine connect to real-time databases.

Data Lakes

Data Lakes Analytics Analytics Clustering

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

Alation

MAY 24, 2022

The Lineage & Dataflow API is a good example enabling customers to add ETL transformation logic to the lineage graph. for the popular database SQL Server. In Alation, lineage provides added advantages of being able to add data flow objects, such as ETL transformations, perform impact analysis, and manually edit lineage.

Data Quality

Data Quality Data Governance ETL Data Observability

Top Data Analytics Skills and Platforms for 2023

ODSC - Open Data Science

APRIL 3, 2023

Data Wrangling: Data Quality, ETL, Databases, Big Data The modern data analyst is expected to be able to source and retrieve their own data for analysis. Competence in data quality, databases, and ETL (Extract, Transform, Load) are essential. Cloud Services: Google Cloud Platform, AWS, Azure.

Analytics

Analytics Analytics Data Analyst Data Science

Data Lineage Through the Decades: Where It’s Going (And Where It’s Been)

Alation

FEBRUARY 7, 2023

There was a software product demo showcasing its ability to scan every layer of your application code, and I was intrigued to see how it worked. The product collected an impressive amount of metadata, from the user interface to the database structure. At the time, I was at a technology conference.

Data Warehouse

Data Warehouse ETL Business Intelligence Business Intelligence

How to Use Exploratory Notebooks [Best Practices]

The MLOps Blog

OCTOBER 20, 2023

Reading & executing from.sql scripts We can use.sql files that are opened and executed from the notebook through a database connector library. connection_params: A dictionary containing PostgreSQL connection parameters, such as 'host', 'port', 'database', 'user', and 'password'.

SQL

SQL Data Scientist Database Python

Bringing Declarative Pipelines to the Apache Spark™ Open Source Project

databricks

JUNE 12, 2025

Get a Demo DATA + AI SUMMIT Data + AI Summit Happening Now Watch the free livestream of the keynotes! Join now Ready to get started? The design draws on years of observing real-world Apache Spark workloads, codifying what we’ve learned into a declarative API that covers the most common patterns - including both batch and streaming flows.

SQL

SQL Data Engineering Data Engineering Data Engineer

What Is a Data Silo?

Alation

OCTOBER 19, 2021

Applications may draw data from different databases, sometimes in different formats. Creating a sustainable data culture means efficiently and accurately integrating data to help prevent future silos, either through the use of scripting or Extract, Transform and Load (ETL) tools. ??Using Subscribe to Alation's Blog.

Data Silos

Data Silos ETL Data Governance Cloud Data

How to Create Alerts in Snowflake

phData

NOVEMBER 30, 2023

We can then give you a demo, learn more about your monitoring needs, and help you to deploy or customize a solution for your organization. Tasks can be used to automate data processing workflows, such as ETL jobs, data ingestion, and data transformation. SQL commands allow users to create, modify, suspend, resume, and drop tasks.

SQL

SQL Cloud Data Database ETL

Top 50+ Data Analyst Interview Questions & Answers

Pickl AI

APRIL 26, 2024

SQL stands for Structured Query Language, essential for querying and manipulating data stored in relational databases. The SELECT statement retrieves data from a database, while SELECT DISTINCT eliminates duplicate rows from the result set. Data Warehousing and ETL Processes What is a data warehouse, and why is it important?

Data Analyst

Data Analyst Data Analysis Data Analysis Machine Learning

Taking the First Steps Toward Enterprise AI

phData

JUNE 7, 2023

Vector Database : A vector database is a specialized database designed to efficiently store, manage, and retrieve high-dimensional vectors, also known as vector embeddings. Vector databases support similarity search operations, allowing users to find vectors most similar to a given query vector.

AI

AI AI Natural Language Processing Machine Learning

Ask HN: Who wants to be hired? (July 2025)

Hacker News

JULY 1, 2025

I'm JD, a Software Engineer with experience touching many parts of the stack (frontend, backend, databases, data & ETL pipelines, you name it). With over 3 years of working with ETL pipelines and REST API integrations and development, I understand how to develop and maintain robust and scalable data systems.

Python

Python AWS SQL ML

Access Amazon Redshift Managed Storage tables through Apache Spark on AWS Glue and Amazon EMR using Amazon SageMaker Lakehouse

Flipboard

MAY 15, 2025

You can bring data from operational databases and applications into your lakehouse in near real time through zero-ETL integrations. To access RMS backed catalog databases from Spark, each RMS database requires its own Spark session catalog configuration. For Project name , enter demo. and Amazon EMR 7.5.0

AWS

AWS SQL Data Lakes Data Warehouse

Databricks at SIGMOD 2025

databricks

JUNE 16, 2025

Get a Demo Login Try Databricks Blog / Data Warehousing / Article Databricks at SIGMOD 2025 Databricks is proud to be a platinum sponsor of SIGMOD 2025 in Berlin, Germany. Accepted Demo Papers Blink twice - automatic workload pinning and regression detection for Versionless Apache Spark using retries.

Data Science

Data Science Artificial Intelligence Artificial Intelligence Business Intelligence

Ask HN: What Are You Working On? (June 2025)

Hacker News

JUNE 29, 2025

I've built the archival and database software on Lucee & MySQL to store images and automate, and I use OpenAI to analyze images and extra meta data. Demos & tutorials: Harness Builder → https://www.youtube.com/watch?v=JfQVB_iTD1I Both are in a pretty rough state, but usable for the intrepid.

AI

AI AI Database Python

Data Science Current

What Is a Lakebase?

Introducing Databricks One

Trending Sources

Introducing Agent Bricks: Auto-Optimized Agents Using Your Data

Announcing managed MCP servers with Unity Catalog and Mosaic AI Integration

Recapping the Cloud Amplifier and Snowflake Demo

Launch HN: Chonkie (YC X25) – Open-Source Library for Advanced Chunking

Tackling AI’s data challenges with IBM databases on AWS

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

How Formula 1® uses generative AI to accelerate race-day issue resolution

End-to-End model training and deployment with Amazon SageMaker Unified Studio

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

Practical Tips and Tricks for Developers Building RAG Applications

5 Reasons Why SQL is Still the Most Accessible Language for New Data Scientists

Schema Detection and Evolution in Snowflake

Unleashing the power of Presto: The Uber case study

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

Top Data Analytics Skills and Platforms for 2023

Data Lineage Through the Decades: Where It’s Going (And Where It’s Been)

How to Use Exploratory Notebooks [Best Practices]

Bringing Declarative Pipelines to the Apache Spark™ Open Source Project

What Is a Data Silo?

How to Create Alerts in Snowflake

Top 50+ Data Analyst Interview Questions & Answers

Taking the First Steps Toward Enterprise AI

Ask HN: Who wants to be hired? (July 2025)

Access Amazon Redshift Managed Storage tables through Apache Spark on AWS Glue and Amazon EMR using Amazon SageMaker Lakehouse

Databricks at SIGMOD 2025

Ask HN: What Are You Working On? (June 2025)

Stay Connected