Data Scientist, Deep Learning and ML

What Comes After HDF5? Seeking a Data Storage Format for Deep Learning

KDnuggets

NOVEMBER 9, 2021

In this article we are discussing that HDF5 is one of the most popular and reliable formats for non-tabular, numerical data. But this format is not optimized for deep learning work. This article suggests what kind of ML native data format should be to truly serve the needs of modern data scientists.

Deep Learning

Deep Learning Deep Learning Data Scientist ML

A Comprehensive Guide on Hyperparameter Tuning and its Techniques

Analytics Vidhya

FEBRUARY 21, 2022

This article was published as a part of the Data Science Blogathon. Image designed by the author – Shanthababu Introduction Every ML Engineer and Data Scientist must understand the significance of “Hyperparameter Tuning (HPs-T)” while selecting your right machine/deep learning model and improving the performance of the model(s).

Deep Learning

Deep Learning Deep Learning Data Scientist Data Science

10 AI Conferences in the USA (2025): Connect with Top AI and Data Minds

Data Science Dojo

FEBRUARY 13, 2025

If you want to stay ahead in the world of big data, AI, and data-driven decision-making, Big Data & AI World 2025 is the perfect event to explore the latest innovations, strategies, and real-world applications. This event offers cutting-edge discussions, hands-on workshops, and deep dives into AI advancements.

Big Data

Big Data Big Data AI AI

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Data Science Dojo

OCTOBER 31, 2024

For data scientists, this shift has opened up a global market of remote data science jobs, with top employers now prioritizing skills that allow remote professionals to thrive. Here’s everything you need to know to land a remote data science job, from advanced role insights to tips on making yourself an unbeatable candidate.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Journeying into the realms of ML engineers and data scientists

Dataconomy

MAY 16, 2023

Machine learning engineer vs data scientist: two distinct roles with overlapping expertise, each essential in unlocking the power of data-driven insights. As businesses strive to stay competitive and make data-driven decisions, the roles of machine learning engineers and data scientists have gained prominence.

Data Scientist

Data Scientist ML ML Machine Learning

Revolutionize your ML workflow: 5 drag and drop tools for streamlining your pipeline

Data Science Dojo

APRIL 3, 2023

Drag and drop tools have revolutionized the way we approach machine learning (ML) workflows. Gone are the days of manually coding every step of the process – now, with drag-and-drop interfaces, streamlining your ML pipeline has become more accessible and efficient than ever before. H2O.ai H2O.ai

ML

ML ML Machine Learning Machine Learning

Accelerate your ML lifecycle using the new and improved Amazon SageMaker Python SDK – Part 1: ModelTrainer

AWS Machine Learning Blog

DECEMBER 12, 2024

The new SDK is designed with a tiered user experience in mind, where the new lower-level SDK ( SageMaker Core ) provides access to full breadth of SageMaker features and configurations, allowing for greater flexibility and control for ML engineers. This is usually achieved by providing the right set of parameters when using an Estimator.

ML

ML ML Python AWS

Survey: Massive Retooling Around Large Language Models Underway

insideBIGDATA

OCTOBER 14, 2023

A recent survey of data scientists and engineers revealed that over half (53.3%) of today’s machine learning (ML) teams are planning on deploying a large language model (LLM) application of their own into production “within the next 12 months” or “as soon as possible”.

Data Scientist

Data Scientist Machine Learning Machine Learning ML

Generative AI – Understanding the ethics and societal impact of emerging trends

Data Science Dojo

MARCH 31, 2023

Artificial intelligence (AI), machine learning (ML), and data science have become some of the most significant topics of discussion in today’s technological era. Matul, who has experience working as an AI scientist at amazon, focused on dialogue machines and natural language understanding.

Data Scientist

Data Scientist AI Data Science AI

How to Visualize Deep Learning Models

The MLOps Blog

NOVEMBER 14, 2023

Deep learning models are typically highly complex. While many traditional machine learning models make do with just a couple of hundreds of parameters, deep learning models have millions or billions of parameters. This is where visualizations in ML come in.

Deep Learning

Deep Learning Deep Learning Data Scientist Machine Learning

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 16, 2024

Amazon SageMaker supports geospatial machine learning (ML) capabilities, allowing data scientists and ML engineers to build, train, and deploy ML models using geospatial data. Identify areas of interest We begin by illustrating how SageMaker can be applied to analyze geospatial data at a global scale.

ML

ML ML Clustering Machine Learning

Build and deploy ML models using Maximo Visual Inspection

IBM Data Science in Practice

MARCH 21, 2023

Deep learning models built using Maximo Visual Inspection (MVI) are used for a wide range of applications, including image classification and object detection. These models train on large datasets and learn complex patterns that are difficult for humans to recognize. What are the types of image processing ML models?

ML

ML ML Deep Learning Deep Learning

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Flipboard

NOVEMBER 24, 2023

In an increasingly digital and rapidly changing world, BMW Group’s business and product development strategies rely heavily on data-driven decision-making. With that, the need for data scientists and machine learning (ML) engineers has grown significantly.

ML

ML ML AWS AI

How to Become a Generative AI Engineer in 2025?

Towards AI

JANUARY 29, 2025

Generative AI is powered by advanced machine learning techniques, particularly deep learning and neural networks, such as Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs). Roles like AI Engineer, Machine Learning Engineer, and Data Scientist are increasingly requiring expertise in Generative AI.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Things Data Scientists Should Know About Productionizing Machine Learning

ODSC - Open Data Science

FEBRUARY 9, 2023

It is often too much to ask for the data scientist to become a domain expert. However, in all cases the data scientist must develop strong domain empathy to help define and solve the right problems. Nina Zumel and John Mount, Practical Data Science with R, 2nd Ed. But this statement also goes upstream.

Data Scientist

Data Scientist Machine Learning Machine Learning ML

A Comprehensive Step-by-Step Guide to Become an Industry Ready Data Science Professional

Analytics Vidhya

FEBRUARY 24, 2021

ArticleVideo Book Introduction to Artificial Intelligence and Machine Learning Artificial Intelligence (AI) and its sub-field Machine Learning (ML) have taken the world by storm. The post A Comprehensive Step-by-Step Guide to Become an Industry Ready Data Science Professional appeared first on Analytics Vidhya.

Data Science

Data Science Artificial Intelligence Artificial Intelligence Machine Learning

How Booking.com modernized its ML experimentation framework with Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 12, 2024

Sharing in-house resources with other internal teams, the Ranking team machine learning (ML) scientists often encountered long wait times to access resources for model training and experimentation – challenging their ability to rapidly experiment and innovate. If it shows online improvement, it can be deployed to all the users.

ML

ML ML AWS Machine Learning

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

AWS Machine Learning Blog

OCTOBER 16, 2024

It uses deep learning to convert audio to text quickly and accurately. Amazon Transcribe offers deep learning capabilities, which can handle a wide range of speech and acoustic characteristics, in addition to its scalability to process anywhere from a few hundred to over tens of thousands of calls daily, also played a pivotal role.

AWS

AWS AI AI Machine Learning

Top data science conferences you must attend in 2023

Data Science Dojo

JANUARY 13, 2023

Women in Data Science (WiDS) – California, United States Women in Data Science (WiDS) is an annual conference held at Stanford University, California, United States and other locations worldwide. The conference is focused on the representation, education, and achievements of women in the field of data science.

Data Science

Data Science Data Mining Data Mining Data Mining

Train and deploy ML models in a multicloud environment using Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 20, 2023

In these scenarios, as you start to embrace generative AI, large language models (LLMs) and machine learning (ML) technologies as a core part of your business, you may be looking for options to take advantage of AWS AI and ML capabilities outside of AWS in a multicloud environment.

ML

ML ML Azure AWS

A comprehensive comparison of RPA and ML

Dataconomy

MARCH 27, 2023

However, while RPA and ML share some similarities, they differ in functionality, purpose, and the level of human intervention required. In this article, we will explore the similarities and differences between RPA and ML and examine their potential use cases in various industries. What is machine learning (ML)?

ML

ML ML Machine Learning Machine Learning

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 1: PySDK Improvements

Flipboard

NOVEMBER 30, 2023

Amazon SageMaker is a fully managed service that enables developers and data scientists to quickly and effortlessly build, train, and deploy machine learning (ML) models at any scale. SageMaker makes it straightforward to deploy models into production directly through API calls to the service.

ML

ML ML AWS Python

Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 29, 2024

About the Authors Shreyas Subramanian is a Principal Data Scientist and helps customers by using generative AI and deep learning to solve their business challenges using AWS services. Chris Pecora is a Generative AI Data Scientist at Amazon Web Services.

AI

AI AI ML ML

Top 5 large language models and generative AI bootcamps

Data Science Dojo

OCTOBER 27, 2023

Data Science Dojo Large Language Models Bootcamp The Data Science Dojo Large Language Models Bootcamp is a 5-day in-person bootcamp that teaches you everything you need to know about large language models (LLMs) and their real-world applications. Who should attend?

Natural Language Processing

Natural Language Processing AI Data Science AI

The innovators behind intelligent machines: A look at ML engineers

Dataconomy

MAY 2, 2023

The machine learning systems developed by Machine Learning Engineers are crucial components used across various big data jobs in the data processing pipeline. Additionally, Machine Learning Engineers are proficient in implementing AI or ML algorithms. Is ML engineering a stressful job?

ML

ML ML Machine Learning Machine Learning

5 Reasons Why SQL is Still the Most Accessible Language for New Data Scientists

ODSC - Open Data Science

APRIL 6, 2023

For budding data scientists and data analysts, there are mountains of information about why you should learn R over Python and the other way around. But why is SQL, or Structured Query Language , so important to learn? These are used to extract, transform, and load (ETL) data between different systems.

SQL

SQL Data Scientist Database Data Science

Use Snowflake as a data source to train ML models with Amazon SageMaker

AWS Machine Learning Blog

MARCH 8, 2023

Amazon SageMaker is a fully managed machine learning (ML) service. With SageMaker, data scientists and developers can quickly and easily build and train ML models, and then directly deploy them into a production-ready hosted environment. We add this data to Snowflake as a new table.

ML

ML ML AWS Python

Life beyond the leaderboard

DrivenData Labs

MAY 12, 2025

competition, winning solutions used deep learning approaches from facial recognition tasks (particularly ArcFace and EfficientNet) to help the Bureau of Ocean and Energy Management and NOAA Fisheries monitor endangered populations of beluga whales by matching overhead photos with known individuals.

Algorithm

Algorithm Machine Learning Machine Learning Deep Learning

Cloud Data Science 4

Data Science 101

JANUARY 24, 2020

It was an exciting cloud data science week. Microsoft DP-100 Certification Updated – The Microsoft Data Scientist certification exam has been updated to cover the latest Azure Machine Learning tools. Choosing the Right ML Tools – This video walks thru the Google Machine Learning Decision Pyramid.

Cloud Data

Cloud Data Data Science Azure Machine Learning

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

AWS Machine Learning Blog

MAY 10, 2023

Project Jupyter is a multi-stakeholder, open-source project that builds applications, open standards, and tools for data science, machine learning (ML), and computational science. Given the importance of Jupyter to data scientists and ML developers, AWS is an active sponsor and contributor to Project Jupyter.

ML

ML ML AWS AI

A Comprehensive Step-by-Step Guide to Become an Industry-Ready Data Science Professional

Analytics Vidhya

NOVEMBER 6, 2020

Introduction to Artificial Intelligence and Machine Learning Artificial Intelligence (AI) and its sub-field Machine Learning (ML) have taken the world by storm. The post A Comprehensive Step-by-Step Guide to Become an Industry-Ready Data Science Professional appeared first on Analytics Vidhya.

Data Science

Data Science Artificial Intelligence Artificial Intelligence Machine Learning

Accelerate development of ML workflows with Amazon Q Developer in Amazon SageMaker Studio

AWS Machine Learning Blog

SEPTEMBER 23, 2024

Machine learning (ML) projects are inherently complex, involving multiple intricate steps—from data collection and preprocessing to model building, deployment, and maintenance. To start our ML project predicting the probability of readmission for diabetes patients, you need to download the Diabetes 130-US hospitals dataset.

ML

ML ML AWS Data Scientist

Improving ML Datasets with Cleanlab, a Standard Framework for Data-Centric AI

ODSC - Open Data Science

MARCH 22, 2023

Be sure to check out his session, “ Improving ML Datasets with Cleanlab, a Standard Framework for Data-Centric AI ,” there! Anybody who has worked on a real-world ML project knows how messy data can be. Our goal is to enable all developers to find and fix data issues as effectively as today’s best data scientists.

ML

ML ML Data Scientist AI

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Amazon Redshift is the most popular cloud data warehouse that is used by tens of thousands of customers to analyze exabytes of data every day. SageMaker Studio is the first fully integrated development environment (IDE) for ML. The next step is to build ML models using features selected from one or multiple feature groups.

ML

ML ML AWS Data Warehouse

AI vs. Machine Learning vs. Deep Learning vs. Neural Networks: What’s the difference?

IBM Journey to AI blog

JULY 6, 2023

While artificial intelligence (AI), machine learning (ML), deep learning and neural networks are related technologies, the terms are often used interchangeably, which frequently leads to confusion about their differences. Machine learning is a subset of AI. It can ingest unstructured data in its raw form (e.g.,

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

MLCoPilot: Empowering Large Language Models with Human Intelligence for ML Problem Solving

Towards AI

MAY 3, 2023

They investigate the most suitable algorithms, identify the best weights and hyperparameters, and might even collaborate with fellow data scientists in the community to develop an effective strategy. This is where ML CoPilot enters the scene. Vector databases can store them and are designed for search and data mining.

ML

ML ML Machine Learning Machine Learning

Open-source packages for using speech data in ML

DrivenData Labs

APRIL 8, 2025

These applications are all enabled by a strong ecosystem of open-source Python packages for working with image data. Packages like rasterio and pydicom make it possible for data scientists to contribute without becoming experts in satellites or medical imagery. Overall, we recommend openSMILE for general ML applications.

ML

ML ML Machine Learning Machine Learning

MLOps: A complete guide for building, deploying, and managing machine learning models

Data Science Dojo

AUGUST 24, 2023

ML models have grown significantly in recent years, and businesses increasingly rely on them to automate and optimize their operations. However, managing ML models can be challenging, especially as models become more complex and require more resources to train and deploy. What is MLOps?

Machine Learning

Machine Learning Machine Learning ML ML

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning Blog

NOVEMBER 27, 2024

Mixed Precision Training with FP8 As shown in figure below, FP8 is a datatype supported by NVIDIA’s H100 and H200 GPUs, enables efficient deep learning workloads. More details about FP8 can be found at FP8 Formats For Deep Learning. Surya Kari is a Senior Generative AI Data Scientist at AWS.

AWS

AWS Clustering ML ML

Top 10 Deep Learning Platforms in 2024

DagsHub

JULY 25, 2024

Source: Author Introduction Deep learning, a branch of machine learning inspired by biological neural networks, has become a key technique in artificial intelligence (AI) applications. Deep learning methods use multi-layer artificial neural networks to extract intricate patterns from large data sets.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

MLOps and DevOps: Why Data Makes It Different

O'Reilly Media

OCTOBER 19, 2021

As with many burgeoning fields and disciplines, we don’t yet have a shared canonical infrastructure stack or best practices for developing and deploying data-intensive applications. What does a modern technology stack for streamlined ML processes look like? Why: Data Makes It Different. All ML projects are software projects.

ML

ML ML Data Scientist AWS

Build a dynamic, role-based AI agent using Amazon Bedrock inline agents

AWS Machine Learning Blog

FEBRUARY 13, 2025

About the authors Ishan Singh is a Generative AI Data Scientist at Amazon Web Services, where he helps customers build innovative and responsible generative AI solutions and products. With a strong background in AI/ML, Ishan specializes in building Generative AI solutions that drive business value. Nitin Eusebius is a Sr.

AI

AI AI AWS ML

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

AWS Machine Learning Blog

APRIL 19, 2023

The DJL is a deep learning framework built from the ground up to support users of Java and JVM languages like Scala, Kotlin, and Clojure. The DJL is a deep learning framework built from the ground up to support users of Java and JVM languages like Scala, Kotlin, and Clojure. We recently developed four more new models.

ML

ML ML Deep Learning Deep Learning

How Veriff decreased deployment time by 80% using Amazon SageMaker multi-model endpoints

AWS Machine Learning Blog

OCTOBER 16, 2023

As an AI-powered solution, Veriff needs to create and run dozens of machine learning (ML) models in a cost-effective way. These models range from lightweight tree-based models to deep learning computer vision models, which need to run on GPUs to achieve low latency and improve the user experience.

Data Scientist

Data Scientist ML ML AWS

What Comes After HDF5? Seeking a Data Storage Format for Deep Learning

A Comprehensive Guide on Hyperparameter Tuning and its Techniques

Webinars

Trending Sources

10 AI Conferences in the USA (2025): Connect with Top AI and Data Minds

Webinars

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Journeying into the realms of ML engineers and data scientists

Revolutionize your ML workflow: 5 drag and drop tools for streamlining your pipeline

Accelerate your ML lifecycle using the new and improved Amazon SageMaker Python SDK – Part 1: ModelTrainer

Survey: Massive Retooling Around Large Language Models Underway

Generative AI – Understanding the ethics and societal impact of emerging trends

How to Visualize Deep Learning Models

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

Build and deploy ML models using Maximo Visual Inspection

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

How to Become a Generative AI Engineer in 2025?

Things Data Scientists Should Know About Productionizing Machine Learning

A Comprehensive Step-by-Step Guide to Become an Industry Ready Data Science Professional

How Booking.com modernized its ML experimentation framework with Amazon SageMaker

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

Top data science conferences you must attend in 2023

Train and deploy ML models in a multicloud environment using Amazon SageMaker

A comprehensive comparison of RPA and ML

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 1: PySDK Improvements

Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock

Top 5 large language models and generative AI bootcamps

The innovators behind intelligent machines: A look at ML engineers

5 Reasons Why SQL is Still the Most Accessible Language for New Data Scientists

Use Snowflake as a data source to train ML models with Amazon SageMaker

Life beyond the leaderboard

Cloud Data Science 4

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

A Comprehensive Step-by-Step Guide to Become an Industry-Ready Data Science Professional

Accelerate development of ML workflows with Amazon Q Developer in Amazon SageMaker Studio

Improving ML Datasets with Cleanlab, a Standard Framework for Data-Centric AI

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

AI vs. Machine Learning vs. Deep Learning vs. Neural Networks: What’s the difference?

MLCoPilot: Empowering Large Language Models with Human Intelligence for ML Problem Solving

Open-source packages for using speech data in ML

MLOps: A complete guide for building, deploying, and managing machine learning models

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Top 10 Deep Learning Platforms in 2024

MLOps and DevOps: Why Data Makes It Different

Build a dynamic, role-based AI agent using Amazon Bedrock inline agents

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

How Veriff decreased deployment time by 80% using Amazon SageMaker multi-model endpoints

Stay Connected