Artificial Intelligence, Cloud Data and Data Pipeline

The power of remote engine execution for ETL/ELT data pipelines

IBM Journey to AI blog

MAY 15, 2024

Data engineers build data pipelines, which are called data integration tasks or jobs, as incremental steps to perform data operations and orchestrate these data pipelines in an overall workflow. Organizations can harness the full potential of their data while reducing risk and lowering costs.

Data Pipeline

Data Pipeline ETL SQL Database

Shaping the future: OMRON’s data-driven journey with AWS

AWS Machine Learning Blog

APRIL 3, 2025

OMRONs data strategyrepresented on ODAPalso allowed the organization to unlock generative AI use cases focused on tangible business outcomes and enhanced productivity. Xinyi Zhou is a Data Engineer at Omron Europe, bringing her expertise to the ODAP team led by Emrah Kaya.

AWS

AWS Data Governance Data Silos SQL

How Dataiku and Snowflake Strengthen the Modern Data Stack

phData

NOVEMBER 4, 2024

Snowflake’s cloud-agnosticism, separation of storage and compute resources, and ability to handle semi-structured data have exemplified Snowflake as the best-in-class cloud data warehousing solution. Snowflake supports data sharing and collaboration across organizations without the need for complex data pipelines.

Machine Learning

Machine Learning Machine Learning Data Science ML

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Discovering the Role of Data Science in a Cloud World

Pickl AI

DECEMBER 26, 2024

Key Features Tailored for Data Science These platforms offer specialised features to enhance productivity. Managed services like AWS Lambda and Azure Data Factory streamline data pipeline creation, while pre-built ML models in GCPs AI Hub reduce development time. Below are key strategies for achieving this.

Data Science

Data Science Cloud Computing Machine Learning Machine Learning

A Guide to Choose the Best Data Science Bootcamp

Data Science Dojo

JULY 3, 2024

Big Data Technologies : Handling and processing large datasets using tools like Hadoop, Spark, and cloud platforms such as AWS and Google Cloud. Data Processing and Analysis : Techniques for data cleaning, manipulation, and analysis using libraries such as Pandas and Numpy in Python.

Data Science

Data Science Machine Learning Machine Learning Data Visualization

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

AWS Machine Learning Blog

OCTOBER 18, 2023

Purina used artificial intelligence (AI) and machine learning (ML) to automate animal breed detection at scale. Tayo Olajide is a seasoned Cloud Data Engineering generalist with over a decade of experience in architecting and implementing data solutions in cloud environments.

AWS

AWS ML ML Machine Learning

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Flipboard

NOVEMBER 24, 2023

JuMa is tightly integrated with a range of BMW Central IT services, including identity and access management, roles and rights management, BMW Cloud Data Hub (BMW’s data lake on AWS) and on-premises databases. Furthermore, the notebooks can be integrated into the corporate Git repositories to collaborate using version control.

ML

ML ML AWS AI

Mainframe Technology Trends for 2024

Precisely

JANUARY 18, 2024

And, they’re still a key element of the infrastructure that makes private clouds possible at many organizations. Instead of performing major surgery on their critical business systems, enterprises are opting for real-time data integration built around inherently reliable and scalable change data capture (CDC) technology.

AWS

AWS Artificial Intelligence Artificial Intelligence Cloud Computing

Modular functions design for Advanced Driver Assistance Systems (ADAS) on AWS

AWS Machine Learning Blog

FEBRUARY 23, 2023

Amazon SageMaker Ground Truth is a fully managed data labeling service that provides flexibility to build and manage custom workflows. With Ground Truth, you can label image, video, and point cloud data for object detection, object tracking, and semantic segmentation tasks.

AWS

AWS ML ML Machine Learning

Announcing the 2024 Data Engineering & Ai X Innovation Summits

ODSC - Open Data Science

JANUARY 2, 2024

Data Engineering Summit Our second annual Data Engineering Summit will be in-person for the first time! Like our first Data Engineering Summit , this event will bring together the leading experts in data engineering and thousands of practitioners to explore different strategies for making data actionable.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

The Data Integration Solution Checklist: Top 10 Considerations

Precisely

MAY 13, 2024

As enterprise technology landscapes grow more complex, the role of data integration is more critical than ever before. Wide support for enterprise-grade sources and targets Large organizations with complex IT landscapes must have the capability to easily connect to a wide variety of data sources.

Data Governance

Data Governance Data Pipeline Cloud Data Data Quality

11 Open-Source Data Engineering Tools Every Pro Should Use

ODSC - Open Data Science

FEBRUARY 6, 2024

This open-source streaming platform enables the handling of high-throughput data feeds, ensuring that data pipelines are efficient, reliable, and capable of handling massive volumes of data in real-time. Its open-source nature means it’s continually evolving, thanks to contributions from its user community.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

The Modern Data Stack Explained: What The Future Holds

Alation

JANUARY 17, 2023

These tools are used to manage big data, which is defined as data that is too large or complex to be processed by traditional means. How Did the Modern Data Stack Get Started? The rise of cloud computing and cloud data warehousing has catalyzed the growth of the modern data stack.

Data Warehouse

Data Warehouse ETL Tableau Cloud Data

How to Choose a Futureproof Data Integration Solution

Precisely

MAY 23, 2024

Whatever your approach may be, enterprise data integration has taken on strategic importance. Artificial intelligence (AI) algorithms are trained to detect anomalies. Today’s enterprises need real-time or near-real-time performance, depending on the specific application. Timing matters.

Data Governance

Data Governance ETL Data Pipeline Azure

Manufacturing Questions phData Can Answer with Data

phData

JULY 18, 2024

Large manufacturers are starting to use computer vision artificial intelligence (AI) to detect defects cheaper and more efficiently than using human eyes. Detecting product defects can be time-consuming, costly (both for paying people to catch them and if you don’t catch them soon enough), and tedious.

Machine Learning

Machine Learning Machine Learning Natural Language Processing Data Engineering

Data Trends for 2023

Precisely

FEBRUARY 10, 2023

Cloud Adoption Will Continue Steadily Cloud computing and its inherent scalability and elasticity offer distinct advantages, especially with respect to AI/ML and advanced analytics. As cloud data platforms and powerful analytics tools gain in popularity, the march toward the cloud continues at a rapid pace.

DataOps

DataOps Data Observability ML ML

How to Choose a Futureproof Data Integration Solution

Precisely

MAY 23, 2024

Whatever your approach may be, enterprise data integration has taken on strategic importance. Artificial intelligence (AI) algorithms are trained to detect anomalies. Today’s enterprises need real-time or near-real-time performance, depending on the specific application. Timing matters.

Data Governance

Data Governance ETL Data Pipeline Azure

Exploring the AI and data capabilities of watsonx

IBM Journey to AI blog

JULY 17, 2023

Real-time analytics and BI: Combine data from existing sources with new data to unlock new, faster insights without the cost and complexity of duplicating and moving data across different environments. The post Exploring the AI and data capabilities of watsonx appeared first on IBM Blog.

AI

AI AI Machine Learning Machine Learning

Choosing the Right ETL Platform: Benefits for Data Integration

Pickl AI

OCTOBER 15, 2024

Talend Talend is a leading open-source ETL platform that offers comprehensive solutions for data integration, data quality , and cloud data management. It supports both batch and real-time data processing , making it highly versatile. It is well known for its data provenance and seamless data routing capabilities.

ETL

ETL Azure AWS Data Governance

The Ultimate Modern Data Stack Migration Guide

phData

JULY 18, 2023

With the birth of cloud data warehouses, data applications, and generative AI , processing large volumes of data faster and cheaper is more approachable and desired than ever. First up, let’s dive into the foundation of every Modern Data Stack, a cloud-based data warehouse.

Data Warehouse

Data Warehouse Analytics Analytics Cloud Data

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

[link] Ahmad Khan, head of artificial intelligence and machine learning strategy at Snowflake gave a presentation entitled “Scalable SQL + Python ML Pipelines in the Cloud” about his company’s Snowpark service at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022.

SQL

SQL ML ML Python

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

[link] Ahmad Khan, head of artificial intelligence and machine learning strategy at Snowflake gave a presentation entitled “Scalable SQL + Python ML Pipelines in the Cloud” about his company’s Snowpark service at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022.

SQL

SQL ML ML Python

Data Science Current

The power of remote engine execution for ETL/ELT data pipelines

Shaping the future: OMRON’s data-driven journey with AWS

Webinars

Trending Sources

How Dataiku and Snowflake Strengthen the Modern Data Stack

Webinars

Discovering the Role of Data Science in a Cloud World

A Guide to Choose the Best Data Science Bootcamp

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Mainframe Technology Trends for 2024

Modular functions design for Advanced Driver Assistance Systems (ADAS) on AWS

Announcing the 2024 Data Engineering & Ai X Innovation Summits

The Data Integration Solution Checklist: Top 10 Considerations

11 Open-Source Data Engineering Tools Every Pro Should Use

The Modern Data Stack Explained: What The Future Holds

How to Choose a Futureproof Data Integration Solution

Manufacturing Questions phData Can Answer with Data

Data Trends for 2023

How to Choose a Futureproof Data Integration Solution

Exploring the AI and data capabilities of watsonx

Choosing the Right ETL Platform: Benefits for Data Integration

The Ultimate Modern Data Stack Migration Guide

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snowflake Snowpark: cloud SQL and Python ML pipelines

Stay Connected