Data Models, Data Pipeline and Data Science

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog

MAY 20, 2024

Continuous Integration and Continuous Delivery (CI/CD) for Data Pipelines: It is a Game-Changer with AnalyticsCreator! The need for efficient and reliable data pipelines is paramount in data science and data engineering. They transform data into a consistent format for users to consume.

Data Pipeline

Data Pipeline Data Warehouse Azure Data Lakes

Data science vs data analytics: Unpacking the differences

IBM Journey to AI blog

SEPTEMBER 19, 2023

Though you may encounter the terms “data science” and “data analytics” being used interchangeably in conversations or online, they refer to two distinctly different concepts. Meanwhile, data analytics is the act of examining datasets to extract value and find answers to specific questions.

Data Science

Data Science Analytics Analytics Data Scientist

Essential data engineering tools for 2023: Empowering for management and analysis

Data Science Dojo

JULY 6, 2023

Data engineering tools are software applications or frameworks specifically designed to facilitate the process of managing, processing, and transforming large volumes of data. Spark offers a rich set of libraries for data processing, machine learning, graph processing, and stream processing.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Data Science Connect

JANUARY 27, 2023

Data engineering is a crucial field that plays a vital role in the data pipeline of any organization. It is the process of collecting, storing, managing, and analyzing large amounts of data, and data engineers are responsible for designing and implementing the systems and infrastructure that make this possible.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. They create data pipelines, ETL processes, and databases to facilitate smooth data flow and storage. Read more to know.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Best Data Engineering Tools Every Engineer Should Know

Pickl AI

MARCH 19, 2025

Learning these tools is crucial for building scalable data pipelines. offers Data Science courses covering these tools with a job guarantee for career growth. Introduction Imagine a world where data is a messy jungle, and we need smart tools to turn it into useful insights.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

Google Cloud Vertex AI Google Cloud Vertex AI provides a unified environment for both automated model development with AutoML and custom model training using popular frameworks. Qwak Qwak is a fully-managed, accessible, and reliable ML platform to develop and deploy models and monitor the entire machine learning pipeline.

Machine Learning

Machine Learning Machine Learning ML ML

Discover the Most Important Fundamentals of Data Engineering

Pickl AI

NOVEMBER 4, 2024

Summary: The fundamentals of Data Engineering encompass essential practices like data modelling, warehousing, pipelines, and integration. Understanding these concepts enables professionals to build robust systems that facilitate effective data management and insightful analysis. What is Data Engineering?

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Find Your AI Solutions at the ODSC West AI Expo

ODSC - Open Data Science

OCTOBER 15, 2023

Institute of Analytics The Institute of Analytics is a non-profit organization that provides data science and analytics courses, workshops, certifications, research, and development. The courses and workshops cover a wide range of topics, from basic data science concepts to advanced machine learning techniques.

Machine Learning

Machine Learning Machine Learning Data Pipeline AI

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

It simplifies feature access for model training and inference, significantly reducing the time and complexity involved in managing data pipelines. Additionally, Feast promotes feature reuse, so the time spent on data preparation is reduced greatly. Matúš Chládek is a Senior Engineering Manager for ML Ops at Zeta Global.

AWS

AWS Machine Learning Machine Learning ML

Unlocking Tabular Data’s Hidden Potential

ODSC - Open Data Science

MAY 10, 2023

Unfortunately, even the data science industry — which should recognize tabular data’s true value — often underestimates its relevance in AI. Many mistakenly equate tabular data with business intelligence rather than AI, leading to a dismissive attitude toward its sophistication.

Data Scientist

Data Scientist Data Science Deep Learning Deep Learning

Best 8 Data Version Control Tools for Machine Learning 2024

DagsHub

DECEMBER 11, 2023

The following points illustrates some of the main reasons why data versioning is crucial to the success of any data science and machine learning project: Storage space One of the reasons of versioning data is to be able to keep track of multiple versions of the same data which obviously need to be stored as well.

Machine Learning

Machine Learning Machine Learning Data Lakes Data Science

Where Does Fivetran Fit into The Modern Data Stack?

phData

JULY 17, 2023

In order to fully leverage this vast quantity of collected data, companies need a robust and scalable data infrastructure to manage it. This is where Fivetran and the Modern Data Stack come in. Data modeling, data cleanup, etc. Fivetran is a critical member of the Modern Data Stack. What is Fivetran?

Data Warehouse

Data Warehouse Data Pipeline Cloud Data ETL

A Recipe For AI Strategy

ODSC - Open Data Science

FEBRUARY 8, 2024

How can we build up toward our vision in terms of solvable data problems and specific data products? data sources or simpler data models) of the data products we want to build? A data product may aim to automate a decision, or it may aim to assist a human decision-maker. What are we working towards?

Data Science

Data Science AI AI Data Scientist

Building Scalable AI Pipelines with MLOps: A Guide for Software Engineers

ODSC - Open Data Science

OCTOBER 7, 2024

In today’s landscape, AI is becoming a major focus in developing and deploying machine learning models. It isn’t just about writing code or creating algorithms — it requires robust pipelines that handle data, model training, deployment, and maintenance. Model Training: Running computations to learn from the data.

Machine Learning

Machine Learning Machine Learning AI AI

Beyond The Data: Eugenia Pais, Sr. Data Engineer

phData

JULY 22, 2024

I’ve moved from building user interfaces and backend systems to designing data models, creating data pipelines, and gaining valuable insights from complex datasets. This journey has reshaped my career from a software generalist to a data specialist. Outside of work, what's your life like?

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

What Industries are Hiring for Different Jobs in AI

ODSC - Open Data Science

APRIL 26, 2023

As you can imagine, data science is a pretty loose term or big tent idea overall. Though just about every industry imaginable utilizes the skills of a data-focused professional, each has its own challenges, needs, and desired outcomes. What makes this job title unique is the “Swiss army knife” approach to data.

Data Analyst

Data Analyst Machine Learning Machine Learning Power BI

Implementing GenAI in Practice

Iguazio

JANUARY 22, 2024

Production App - Build resilient and modular production pipelines with automation, scale, testing, observability, versioning, security, risk handling, etc. Monitoring - Monitor all resources, data, model and application metrics to ensure performance. This helps cleanse the data.

Data Pipeline

Data Pipeline ML ML Data Warehouse

Top ETL Tools: Unveiling the Best Solutions for Data Integration

Pickl AI

JUNE 7, 2024

This blog will delve into ETL Tools, exploring the top contenders and their roles in modern data integration. Let’s unlock the power of ETL Tools for seamless data handling. Also Read: Top 10 Data Science tools for 2024. It is a process for moving and managing data from various sources to a central data warehouse.

ETL

ETL Data Quality Data Pipeline Data Warehouse

How to use foundation models and trusted governance to manage AI workflow risk

IBM Journey to AI blog

OCTOBER 16, 2023

It includes processes that trace and document the origin of data, models and associated metadata and pipelines for audits. A data store lets a business connect existing data with new data and discover new insights with real-time analytics and business intelligence.

AI

AI AI Data Warehouse ML

The Official Machine Learning and AI Platform of Hacktoberfest 2023

DagsHub

SEPTEMBER 10, 2023

DagsHub is a centralized platform to host and manage machine learning projects, including code, data, models, experiments, annotations, model registry, and more! Intermediate Data Pipeline : Build data pipelines using DVC for automation and versioning of Open Source Machine Learning projects.

Machine Learning

Machine Learning Machine Learning Data Pipeline AI

What Do Data Scientists Do? A Guide to AI Maturity, Challenges, and Solutions

DataRobot Blog

SEPTEMBER 13, 2022

At this level, the data science team will be small or nonexistent. Businesses will then require more information-literate staff, but they’ll need to contend with an ongoing shortage of data scientists. By now, data scientists have witnessed success optimizing internal operations and external offerings through AI.

Data Scientist

Data Scientist ML ML AI

Generative AI in Software Development

Mlearning.ai

JUNE 16, 2023

Generative AI can be used to automate the data modeling process by generating entity-relationship diagrams or other types of data models and assist in UI design process by generating wireframes or high-fidelity mockups. GPT-4 Data Pipelines: Transform JSON to SQL Schema Instantly Blockstream’s public Bitcoin API.

AI

AI AI Data Analysis Data Analysis

ML Collaboration: Best Practices From 4 ML Teams

The MLOps Blog

DECEMBER 28, 2022

Union of business and data teams The success of ML projects lies in the strong collaboration between the data team and the business team. Such continuous alliance of the business team helps the data science team to create ML models that have the potential to add significant business value.

ML

ML ML Data Scientist Machine Learning

Data architecture strategy for data quality

IBM Journey to AI blog

JANUARY 5, 2023

The right data architecture can help your organization improve data quality because it provides the framework that determines how data is collected, transported, stored, secured, used and shared for business intelligence and data science use cases. What does a modern data architecture do for your business?

Data Quality

Data Quality Data Lakes Data Warehouse Big Data

What Free Tools Pair Well With The Snowflake AI Data Cloud?

phData

OCTOBER 17, 2024

By leveraging version control, testing, and documentation features, dbt Core enables teams to ensure data quality and consistency across their pipelines while integrating seamlessly with modern data warehouses. Aside from migrations, Data Source is also great for data quality checks and can generate data pipelines.

AI

AI AI SQL Data Quality

What Lays Ahead in 2024? AI/ML Predictions for the New Year

Iguazio

DECEMBER 18, 2023

For data science practitioners, productization is key, just like any other AI or ML technology. This will require investing resources in the entire AI and ML lifecycle, including building the data pipeline, scaling, automation, integrations, addressing risk and data privacy, and more.

ML

ML ML AI AI

Capital One’s data-centric solutions to banking business challenges

Snorkel AI

MAY 12, 2023

Three experts from Capital One ’s data science team spoke as a panel at our Future of Data-Centric AI conference in 2022. Please welcome to the stage, Senior Director of Applied ML and Research, Bayan Bruss; Director of Data Science, Erin Babinski; and Head of Data and Machine Learning, Kishore Mosaliganti.

Machine Learning

Machine Learning Machine Learning ML ML

Capital One’s data-centric solutions to banking business challenges

Snorkel AI

MAY 12, 2023

Three experts from Capital One ’s data science team spoke as a panel at our Future of Data-Centric AI conference in 2022. Please welcome to the stage, Senior Director of Applied ML and Research, Bayan Bruss; Director of Data Science, Erin Babinski; and Head of Data and Machine Learning, Kishore Mosaliganti.

Machine Learning

Machine Learning Machine Learning ML ML

What Lays Ahead in 2024? AI/ML Predictions for the New Year

Iguazio

DECEMBER 18, 2023

For data science practitioners, productization is key, just like any other AI or ML technology. This will require investing resources in the entire AI and ML lifecycle, including building the data pipeline, scaling, automation, integrations, addressing risk and data privacy, and more.

ML

ML ML AI AI

Demystifying Time Series Database: A Comprehensive Guide

Pickl AI

JULY 8, 2024

Security is Paramount Implement robust security measures to protect sensitive time series data. Integration with Data Pipelines and Analytics TSDBs often work in tandem with other data tools to create a comprehensive data ecosystem for analysis and insights generation.

Database

Database Data Pipeline Machine Learning Machine Learning

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Mlearning.ai

FEBRUARY 16, 2023

Thus, the solution allows for scaling data workloads independently from one another and seamlessly handling data warehousing, data lakes , data sharing, and engineering. Therefore, you’ll be empowered to truncate and reprocess data if bugs are detected and provide an excellent raw data source for data scientists.

Data Warehouse

Data Warehouse Business Intelligence Business Intelligence Database

Best 8 Experiment Tracking Tools for Machine Learning 2024

DagsHub

DECEMBER 5, 2023

With collaborative coding tools, DagsHub provides a central location for data science teams to visualize, compare, and review their experiments, eliminating the need to set up any infrastructure. DagsHub detects and supports DVC's metrics and params file formats, and it also sets up a DVC remote where we can version our data.

Machine Learning

Machine Learning Machine Learning ML ML

Data Governance for Dummies: Your Questions, Answered

Alation

FEBRUARY 17, 2023

What frameworks and operating models have you seen work well? The firms that get data governance and management “right” bring people together and leverage a set of capabilities: (1) Agile; (2) Six sigma; (3) data science; and (4) project management tools. Establishing a solid vision and mission is key.

Data Governance

Data Governance Data Quality Data Analyst Data Pipeline

The Ultimate Modern Data Stack Migration Guide

phData

JULY 18, 2023

Advanced Analytics: Snowflake’s platform is purposefully engineered to cater to the demands of machine learning and AI-driven data science applications in a cost-effective manner. No Hardware Provisioning: No hardware to provision, just a t-shirt-sized warehouse available as needed within seconds.

Data Warehouse

Data Warehouse Analytics Analytics Cloud Data

Mastering Version Control for ML Models: Best Practices You Need to Know

DagsHub

AUGUST 29, 2024

Data can change a lot, models may also quickly evolve and dependencies become old-fashioned which makes it hard to maintain consistency or reproducibility. With weak version control, teams could face problems like inconsistent data, model drift , and clashes in their code.

ML

ML ML Python Machine Learning

How to Build an End-To-End ML Pipeline

The MLOps Blog

MAY 9, 2023

A typical machine learning pipeline with various stages highlighted | Source: Author Common types of machine learning pipelines In line with the stages of the ML workflow (data, model, and production), an ML pipeline comprises three different pipelines that solve different workflow stages. Happy pipelining!

ML

ML ML Machine Learning Machine Learning

Building Safe Enterprise AI Systems in a Databricks Ecosystem with Securiti’s Gencore AI

Data Science Dojo

APRIL 3, 2025

You must ensure continuous governance and security of your AI models and systems to prevent bias, data leaks, or any unauthorized AI interactions. The partnership between Databricks and Gencore AI enables enterprises to develop AI applications with robust security measures, optimized data pipelines, and comprehensive governance.

AI

AI AI Data Pipeline Data Preparation

Data Scientists in the Age of AI Agents and AutoML

Towards AI

JANUARY 22, 2025

Simply put, focusing solely on data analysis, coding or modeling will no longer cuts it for most corporate jobs. In this regard, I believe the future of data science belongs to those: who can connect the dots and deliver results across the entire data lifecycle.

Data Scientist

Data Scientist EDA AI Exploratory Data Analysis

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data science vs data analytics: Unpacking the differences

Webinars

Trending Sources

Essential data engineering tools for 2023: Empowering for management and analysis

Webinars

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Best Data Engineering Tools Every Engineer Should Know

MLOps Landscape in 2023: Top Tools and Platforms

Discover the Most Important Fundamentals of Data Engineering

Find Your AI Solutions at the ODSC West AI Expo

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

Unlocking Tabular Data’s Hidden Potential

Best 8 Data Version Control Tools for Machine Learning 2024

Where Does Fivetran Fit into The Modern Data Stack?

A Recipe For AI Strategy

Building Scalable AI Pipelines with MLOps: A Guide for Software Engineers

Beyond The Data: Eugenia Pais, Sr. Data Engineer

What Industries are Hiring for Different Jobs in AI

Implementing GenAI in Practice

Top ETL Tools: Unveiling the Best Solutions for Data Integration

How to use foundation models and trusted governance to manage AI workflow risk

The Official Machine Learning and AI Platform of Hacktoberfest 2023

What Do Data Scientists Do? A Guide to AI Maturity, Challenges, and Solutions

Generative AI in Software Development

ML Collaboration: Best Practices From 4 ML Teams

Data architecture strategy for data quality

What Free Tools Pair Well With The Snowflake AI Data Cloud?

What Lays Ahead in 2024? AI/ML Predictions for the New Year

Capital One’s data-centric solutions to banking business challenges

Capital One’s data-centric solutions to banking business challenges

What Lays Ahead in 2024? AI/ML Predictions for the New Year

Demystifying Time Series Database: A Comprehensive Guide

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Best 8 Experiment Tracking Tools for Machine Learning 2024

Data Governance for Dummies: Your Questions, Answered

The Ultimate Modern Data Stack Migration Guide

Mastering Version Control for ML Models: Best Practices You Need to Know

How to Build an End-To-End ML Pipeline

Building Safe Enterprise AI Systems in a Databricks Ecosystem with Securiti’s Gencore AI

Data Scientists in the Age of AI Agents and AutoML

Stay Connected