Clustering, Data Lakes and Demo - Data Science Current

Unleashing the power of Presto: The Uber case study

IBM Journey to AI blog

SEPTEMBER 25, 2023

When a query is constructed, it passes through a cost-based optimizer, then data is accessed through connectors, cached for performance and analyzed across a series of servers in a cluster. Because of its distributed nature, Presto scales for petabytes and exabytes of data.

Data Lakes

Data Lakes Analytics Analytics Clustering

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Amazon Redshift uses SQL to analyze structured and semi-structured data across data warehouses, operational databases, and data lakes, using AWS-designed hardware and ML to deliver the best price-performance at any scale. Enter a stack name, such as Demo-Redshift. yaml locally.

ML

ML ML AWS Data Warehouse

Pictures and Highlights from ODSC Europe 2023

ODSC - Open Data Science

JULY 22, 2023

Expo Hall ODSC events are more than just data science training and networking events. On both days, we had our AI Expo & Demo Hall where over a dozen of our partners set up to showcase their latest developments, tools, frameworks, and other offerings. You can read the recap here and watch the full keynote here.

Apache Kafka

Apache Kafka Machine Learning Machine Learning Data Science

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Content filtering breakthrough: Snorkel client reaches 96% recall in 3 days

Snorkel AI

MARCH 26, 2024

Snorkel Flow’s programmatic labeling process starts with labeling functions—essentially programmable rules to label data. Snorkel Flow users can build labeling functions according to various data features—from continuous variable thresholds to vector embedding clusters. Book a demo today.

Machine Learning

Machine Learning Machine Learning Data Lakes Data Science

Content filtering breakthrough: Snorkel client reaches 96% recall in 3 days

Snorkel AI

MARCH 26, 2024

Snorkel Flow’s programmatic labeling process starts with labeling functions—essentially programmable rules to label data. Snorkel Flow users can build labeling functions according to various data features—from continuous variable thresholds to vector embedding clusters. Book a demo today.

Machine Learning

Machine Learning Machine Learning Data Lakes AI

Why Silicon Valley is the Go-To Place for Artificial Intelligence

ODSC - Open Data Science

AUGUST 7, 2023

Databricks Databricks is the developer of Delta Lake, an open-source project that brings reliability to data lakes for machine learning and other cases. Their platform was developed for working with Spark and provides automated cluster management and Python-style notebooks.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Machine Learning Machine Learning

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

It won’t be a long demo, it’ll be a very quick demo of what you can do and how you can operationalize stuff in Snowflake. And so data scientists might be leveraging one compute service and might be leveraging an extracted CSV for their experimentation. The demo is actually very simple.

SQL

SQL ML ML Python

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

It won’t be a long demo, it’ll be a very quick demo of what you can do and how you can operationalize stuff in Snowflake. And so data scientists might be leveraging one compute service and might be leveraging an extracted CSV for their experimentation. The demo is actually very simple.

SQL

SQL ML ML Python

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

It provides tools and components to facilitate end-to-end ML workflows, including data preprocessing, training, serving, and monitoring. Kubeflow integrates with popular ML frameworks, supports versioning and collaboration, and simplifies the deployment and management of ML pipelines on Kubernetes clusters.

Machine Learning

Machine Learning Machine Learning ML ML

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

An ML platform standardizes the technology stack for your data team around best practices to reduce incidental complexities with machine learning and better enable teams across projects and workflows. We ask this during product demos, user and support calls, and on our MLOps LIVE podcast. Data engineers are mostly in charge of it.

Machine Learning

Machine Learning Machine Learning Data Scientist ML

What Does GPT-3 Mean For the Future of MLOps? With David Hershey

The MLOps Blog

JUNE 5, 2023

A lot of them are demos at that point, they’re still not products. You have your: feature store model registry data from a data lake The data is then moved across this workflow, modeled and then deployed, Now there’s a good link between your development environments and the production environment where it’s monitoring.

ML

ML ML Machine Learning Machine Learning

Building a Business with a Real-Time Analytics Stack, Streaming ML Without a Data Lake, and…

ODSC - Open Data Science

MAY 24, 2023

Building a Business with a Real-Time Analytics Stack, Streaming ML Without a Data Lake, and Google’s PaLM 2 Building a Pizza Delivery Service with a Real-Time Analytics Stack The best businesses react quickly and with informed decisions. Here’s a use case of how you can use a real-time analytics stack to build a pizza delivery service.

Data Lakes

Data Lakes ML ML Analytics

How to Build an End-To-End ML Pipeline

The MLOps Blog

MAY 9, 2023

The pipelines are interoperable to build a working system: Data (input) pipeline (data acquisition and feature management steps) This pipeline transports raw data from one location to another. Model/training pipeline This pipeline trains one or more models on the training data with preset hyperparameters. Kale v0.7.0.

ML

ML ML Machine Learning Machine Learning

Access Amazon Redshift Managed Storage tables through Apache Spark on AWS Glue and Amazon EMR using Amazon SageMaker Lakehouse

Flipboard

MAY 15, 2025

These organizations have a huge demand for lakehouse solutions that combine the best of data warehouses and data lakes to simplify data management with easy access to all data from their preferred engines. For Project name , enter demo. For Lakehouse catalog name , enter rms-catalog-demo.

AWS

AWS SQL Data Lakes Data Warehouse

Data Science Current

Unleashing the power of Presto: The Uber case study

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Webinars

Trending Sources

Pictures and Highlights from ODSC Europe 2023

Webinars

Content filtering breakthrough: Snorkel client reaches 96% recall in 3 days

Content filtering breakthrough: Snorkel client reaches 96% recall in 3 days

Why Silicon Valley is the Go-To Place for Artificial Intelligence

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snowflake Snowpark: cloud SQL and Python ML pipelines

MLOps Landscape in 2023: Top Tools and Platforms

Definite Guide to Building a Machine Learning Platform

What Does GPT-3 Mean For the Future of MLOps? With David Hershey

Building a Business with a Real-Time Analytics Stack, Streaming ML Without a Data Lake, and…

How to Build an End-To-End ML Pipeline

Access Amazon Redshift Managed Storage tables through Apache Spark on AWS Glue and Amazon EMR using Amazon SageMaker Lakehouse

Stay Connected