Data Scientist, EDA and ML - Data Science Current

Predicting the 2024 U.S. Presidential Election Winner Using Machine Learning

Towards AI

NOVEMBER 4, 2024

The points to cover in this article are as follows: Generating synthetic data to illustrate ML modelling for election outcomes. Providing some insights into how data scientists might approach real-life election predictions. Model Fitting and Training: Various ML models trained on sub-patterns in data.

Machine Learning

Machine Learning Machine Learning Exploratory Data Analysis EDA

I Won $10,000 in a Machine Learning Competition — Here’s My Complete Strategy

Flipboard

JUNE 16, 2025

The world’s leading publication for data science, AI, and ML professionals. I’ve worked as a data scientist in FinTech for six years. The data came as a.parquet file that I downloaded using duckdb. You don’t need a PhD to be a data scientist or win a ML competition.

Machine Learning

Machine Learning Machine Learning Data Science Artificial Intelligence

Harness the power of AI and ML using Splunk and Amazon SageMaker Canvas

AWS Machine Learning Blog

AUGUST 12, 2024

Instead, organizations are increasingly looking to take advantage of transformative technologies like machine learning (ML) and artificial intelligence (AI) to deliver innovative products, improve outcomes, and gain operational efficiencies at scale. Data is presented to the personas that need access using a unified interface.

ML

ML ML AWS AI

Modernize and migrate on-premises fraud detection machine learning workflows to Amazon SageMaker

AWS Machine Learning Blog

JUNE 5, 2025

In this post, we share how Radial optimized the cost and performance of their fraud detection machine learning (ML) applications by modernizing their ML workflow using Amazon SageMaker. Businesses need for fraud detection models ML has proven to be an effective approach in fraud detection compared to traditional approaches.

Machine Learning

Machine Learning Machine Learning AWS ML

ML Collaboration: Best Practices From 4 ML Teams

The MLOps Blog

DECEMBER 28, 2022

The onset of the pandemic has triggered a rapid increase in the demand and adoption of ML technology. Building ML team Following the surge in ML use cases that have the potential to transform business, the leaders are making a significant investment in ML collaboration, building teams that can deliver the promise of machine learning.

ML

ML ML Data Scientist Machine Learning

Different Plots Used in Exploratory Data Analysis (EDA)

Heartbeat

JANUARY 24, 2024

The importance of EDA in the machine learning world is well known to its users. Making visualizations is one of the finest ways for data scientists to explain data analysis to people outside the business. Exploratory data analysis can help you comprehend your data better, which can aid in future data preprocessing.

Exploratory Data Analysis

Exploratory Data Analysis EDA Data Analysis Data Analysis

Navigating the Exciting Stages: The Journey of a Machine Learning Project Life Cycle

Towards AI

FEBRUARY 3, 2024

From Predicting the behavior of a customer to automating many tasks, Machine learning has shown its capacity to convert raw data into actionable insights. Even though converting raw data into actionable insights, it is not determined by ML algorithms alone. The success of any ML project depends on a well-structured lifecycle.

Machine Learning

Machine Learning Machine Learning Exploratory Data Analysis Data Scientist

ML | Data Preprocessing in Python

Pickl AI

DECEMBER 3, 2024

Introduction Data preprocessing is a critical step in the Machine Learning pipeline, transforming raw data into a clean and usable format. With the explosion of data in recent years, it has become essential for data scientists and Machine Learning practitioners to understand and effectively apply preprocessing techniques.

Python

Python ML ML Exploratory Data Analysis

LLMOps demystified: Why it’s crucial and best practices for 2023

Data Science Dojo

AUGUST 28, 2023

Similar to traditional Machine Learning Ops (MLOps), LLMOps necessitates a collaborative effort involving data scientists, DevOps engineers, and IT professionals. Some projects may necessitate a comprehensive LLMOps approach, spanning tasks from data preparation to pipeline production.

Exploratory Data Analysis

Exploratory Data Analysis Data Preparation Machine Learning Machine Learning

31 Questions that Shape Fortune 500 ML Strategy

Towards AI

JUNE 5, 2023

As such, my intention with this blog is not to duplicate those definitions but rather to encourage you to question and evaluate your current ML strategy. While ML algorithms & code play a crucial role in success, it’s just a small piece of the large puzzle. Source: Image by the author.

ML

ML ML Data Scientist EDA

11 Open Source Data Exploration Tools You Need to Know in 2023

ODSC - Open Data Science

FEBRUARY 24, 2023

These tools will help make your initial data exploration process easy. ydata-profiling GitHub | Website The primary goal of ydata-profiling is to provide a one-line Exploratory Data Analysis (EDA) experience in a consistent and fast solution. This tool automatically detects problems in an ML dataset.

Exploratory Data Analysis

Exploratory Data Analysis Data Visualization Data Analysis Data Analysis

Data Acquisition & Exploration: Exploring 5 Key MLOps Questions using AWS SageMaker

Towards AI

JUNE 24, 2023

The ’31 Questions that Shape Fortune 500 ML Strategy’ highlighted key questions to assess the maturity of an ML system. A robust ML platform offers managed solutions to easily address these aspects. Collaboration] How can multiple data scientists collaborate in real-time on the same dataset?

AWS

AWS Data Scientist ML ML

Accelerate client success management through email classification with Hugging Face on Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 12, 2023

The machine learning (ML) model classifies new incoming customer requests as soon as they arrive and redirects them to predefined queues, which allows our dedicated client success agents to focus on the contents of the emails according to their skills and provide appropriate responses. A test endpoint is deployed for testing purposes.

Data Science

Data Science Data Scientist AWS ML

Building ML Platform in Retail and eCommerce

The MLOps Blog

MAY 31, 2023

And eCommerce companies have a ton of use cases where ML can help. The problem is, with more ML models and systems in production, you need to set up more infrastructure to reliably manage everything. And because of that, many companies decide to centralize this effort in an internal ML platform. But how to build it?

ML

ML ML Algorithm Machine Learning

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

AWS Machine Learning Blog

JULY 31, 2023

Although machine learning (ML) can provide valuable insights, ML experts were needed to build customer churn prediction models until the introduction of Amazon SageMaker Canvas. It also enables you to evaluate the models using advanced metrics as if you were a data scientist.

ML

ML ML Data Preparation Machine Learning

How To Learn Python For Data Science?

Pickl AI

NOVEMBER 4, 2024

Its robust ecosystem of libraries and frameworks tailored for Data Science, such as NumPy, Pandas, and Scikit-learn, contributes significantly to its popularity. Moreover, Python’s straightforward syntax allows Data Scientists to focus on problem-solving rather than grappling with complex code.

Data Science

Data Science Python Machine Learning Machine Learning

Predicting new and existing product sales in semiconductors using Amazon Forecast

AWS Machine Learning Blog

APRIL 6, 2023

& AWS Machine Learning Solutions Lab (MLSL) Machine learning (ML) is being used across a wide range of industries to extract actionable insights from data to streamline processes and improve revenue generation. For further assistance in terms of designing and developing ML solutions, please free to get in touch with the MLSL team.

Machine Learning

Machine Learning Machine Learning ML ML

Data Science Career FAQs Answered: Educational Background

Mlearning.ai

MAY 23, 2023

Answering one of the most common questions I get asked as a Senior Data Scientist — What skills and educational background are necessary to become a data scientist? Photo by Eunice Lituañas on Unsplash To become a data scientist, a combination of technical skills and educational background is typically required.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Vertex AI: Guide to Google’s Unified Machine Learning Platform

Pickl AI

AUGUST 28, 2024

Introduction In the rapidly evolving landscape of Machine Learning , Google Cloud’s Vertex AI stands out as a unified platform designed to streamline the entire Machine Learning (ML) workflow. This unified approach enables seamless collaboration among data scientists, data engineers, and ML engineers.

Machine Learning

Machine Learning Machine Learning ML ML

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 2: SageMaker notebooks and Studio

AWS Machine Learning Blog

MAY 30, 2023

Since its introduction, we have helped hundreds of customers optimize their workloads, set guardrails, and improve the visibility of their machine learning (ML) workloads’ cost and usage. Notebooks contain everything needed to run or recreate an ML workflow. SageMaker manages creating the instance and related resources.

AWS

AWS ML ML EDA

Things You Can do Using Kangas Library in Data Science

Heartbeat

FEBRUARY 13, 2023

It is designed to make it easy to track and monitor experiments and conduct exploratory data analysis (EDA) using popular Python visualization frameworks. Introducing Kangas A powerful software application for working with large amounts of multimedia data. We pay our contributors, and we don’t sell ads.

Data Science

Data Science Python Deep Learning Deep Learning

Monitoring Your Time Series Model in Comet

Heartbeat

MARCH 21, 2023

We will carry out some EDA on our dataset, and then we will log the visualizations onto the Comet experimentation website or platform. Time Series Models Time series models are a type of statistical model that are used to analyze and make predictions about data that is collected over time. Without further ado, let’s begin.

Exploratory Data Analysis

Exploratory Data Analysis EDA Machine Learning Machine Learning

Enhancing Customer Churn Prediction with Continuous Experiment Tracking

Heartbeat

SEPTEMBER 28, 2023

To address this challenge, data scientists harness the power of machine learning to predict customer churn and develop strategies for customer retention. Continuous Experiment Tracking with Comet ML Comet ML is a versatile tool that helps data scientists optimize machine learning experiments.

Machine Learning

Machine Learning Machine Learning Support Vector Machines ML

Unveiling Market Dynamics: Winners of the Google Trends Analysis and Predictive Modeling

Ocean Protocol

MAY 24, 2024

The challenge required a detailed analysis of Google Trends data, integration of additional data sources, and the application of advanced ML methods to predict market behaviors. Data scientists across various expertise levels engaged in this challenge to determine Google Trends’ impact on cryptocurrency valuations.

EDA

EDA Exploratory Data Analysis Data Scientist ML

Machine Learning Operations (MLOPs) with Azure Machine Learning

ODSC - Open Data Science

JULY 19, 2023

Machine Learning Operations (MLOps) can significantly accelerate how data scientists and ML engineers meet organizational needs. A well-implemented MLOps process not only expedites the transition from testing to production but also offers ownership, lineage, and historical data about ML artifacts used within the team.

Machine Learning

Machine Learning Machine Learning Azure Data Science

Roadmap to Learn Data Science for Beginners and Freshers in 2023

Becoming Human

MAY 15, 2023

Note : Now, Start joining Data Science communities on social media platforms. These communities will help you to be updated in the field, because there are some experienced data scientists posting the stuff, or you can talk with them so they will also guide you in your journey.

Data Science

Data Science Machine Learning Machine Learning Database

How I cleared AWS Machine Learning Specialty with three weeks of preparation (I will burst some…

Mlearning.ai

FEBRUARY 2, 2023

How I cleared AWS Machine Learning Specialty with three weeks of preparation (I will burst some myths of the online exam) How I prepared for the test, my emotional journey during preparation, and my actual exam experience Certified AWS ML Specialty Badge source Introduction:- I recently gave and cleared AWS ML certification on 29th Dec 2022.

Machine Learning

Machine Learning Machine Learning AWS ML

Introducing our New Book: Implementing MLOps in the Enterprise

Iguazio

DECEMBER 14, 2023

Drawing from their extensive experience in the field, the authors share their strategies, methodologies, tools and best practices for designing and building a continuous, automated and scalable ML pipeline that delivers business value. The book is poised to address these exact challenges.

ML

ML ML Data Science Data Preparation

Explaining PCA

Mlearning.ai

MARCH 22, 2023

Principal Component Analysis(PCA) is an essential algorithm in a data scientist's toolkit. This makes it particularly useful for analyzing large datasets with many variables, where it can be difficult to visualize and interpret the data. . BECOME a WRITER at MLearning.ai

Data Scientist

Data Scientist Machine Learning Machine Learning EDA

Announcing the Winners of ‘The NFL Fantasy Football’ Data Challenge

Ocean Protocol

SEPTEMBER 29, 2023

This data challenge took NFL player performance data and fantasy points from the last 6 seasons to calculate forecasted points to be scored in the 2024 NFL season that began Sept. AI / ML offers tools to give a competitive edge in predictive analytics, business intelligence, and performance metrics.

Cross Validation

Cross Validation Predictive Analytics Exploratory Data Analysis EDA

Even More Demo Sessions Coming to ODSC East to Help You Build AI Better

ODSC - Open Data Science

APRIL 29, 2023

a comprehensive approach to the ML pipeline. Latest trends/methods in Feature Engineering for Time Series Forecasting Dr. Joshua Gordon｜Senior Data Scientist｜DotData This workshop will introduce you to the fundamentals and practical applications of feature engineering as they apply to time series forecasting.

Data Science

Data Science Exploratory Data Analysis AI AI

Decoding METAR Data: Insights from the Ocean Protocol Data Challenge

Ocean Protocol

MARCH 11, 2024

METAR, Miami International Airport (KMIA) on March 9, 2024, at 15:00 UTC In the recently concluded data challenge hosted on Desights.ai , participants used exploratory data analysis (EDA) and advanced artificial intelligence (AI) techniques to enhance aviation weather forecasting accuracy.

Exploratory Data Analysis

Exploratory Data Analysis Machine Learning Machine Learning EDA

Tracking Your Sentiment Analysis With Comet

Heartbeat

JANUARY 30, 2023

But they need a lot of labeled training data, and the dataset could be biased. In order to accomplish this, we will perform some EDA on the Disneyland dataset, and then we will view the visualization on the Comet experimentation website or platform. In this article, we’ll learn how to link Comet with Disneyland Sentiment Analysis.

EDA

EDA Machine Learning Machine Learning Exploratory Data Analysis

Meet the winners of the Kelp Wanted challenge

DrivenData Labs

APRIL 10, 2024

Michal Wierzbinski ¶ Place: 2nd Place Prize: $3,000 Hometown: Rabka-Zdroj (near the city of Cracow), Poland Username: xultaeculcis Social Media: GitHub , LinkedIn Background: ML Engineer specializing in building Deep Learning solutions for Geospatial industry in a cloud native fashion. What motivated you to compete in this challenge?

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Feature Engineering in Machine Learning

Pickl AI

JANUARY 3, 2024

The growing application of Machine Learning also draws interest towards its subsets that add power to ML models. Key takeaways Feature engineering transforms raw data for ML, enhancing model performance and significance. EDA, imputation, encoding, scaling, extraction, outlier handling, and cross-validation ensure robust models.

Machine Learning

Machine Learning Machine Learning Exploratory Data Analysis Cross Validation

Top 10 Data Science Projects on GitHub

Pickl AI

JUNE 7, 2023

Data Science projects require you perform different projects and track changes in your project using a version code. If you want to become an efficient Data Scientist and grab that job role you’ve been looking for, you need to work on Github for Data Science projects.

Data Science

Data Science Deep Learning Deep Learning Clustering

Artificial Intelligence Using Python: A Comprehensive Guide

Pickl AI

JULY 12, 2024

Here are a few of the key concepts that you should know: Machine Learning (ML) This is a type of AI that allows computers to learn without being explicitly programmed. Machine Learning algorithms are trained on large amounts of data, and they can then use that data to make predictions or decisions about new data.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Python Natural Language Processing

Linear Regression for tech start-up company Cars4U in Python

Mlearning.ai

FEBRUARY 28, 2023

As a data scientist at Cars4U, I had to come up with a pricing model that can effectively predict the price of used cars and can help the business in devising profitable strategies using differential pricing. Bivariate EDA Contrary to intuition, Kilometers_Driven does not seem to have a relationship with the price.

Python

Python EDA Exploratory Data Analysis Data Analysis

Create and visualize image data with Kangas for computer vision tasks

Heartbeat

MAY 24, 2023

Create DataGrids with image data using Kangas, and load and visualize image data from hugging face Photo by Genny Dimitrakopoulou on Unsplash Visualizing data to carry out a detailed EDA, especially for image data, is critical. We pay our contributors, and we don’t sell ads.

Deep Learning

Deep Learning Deep Learning EDA ML

Nurturing a Strong Data Science Foundation for Beginners

Mlearning.ai

JULY 11, 2023

For instance, feature engineering and exploratory data analysis (EDA) often require the use of visualization libraries like Matplotlib and Seaborn. In the data science industry, effective communication and collaboration play a crucial role. Moreover, tools like Power BI and Tableau can produce remarkable results.

Data Science

Data Science Exploratory Data Analysis Azure Power BI

Sentiment Analysis with Python and Streamlit

Heartbeat

JANUARY 25, 2023

Build and deploy your own sentiment classification app using Python and Streamlit Source:Author Nowadays, working on tabular data is not the only thing in Machine Learning (ML). Data formats like image, video, text, etc., This approach is mostly referred to for small datasets where ML models can not be effective.

Python

Python Deep Learning Deep Learning ML

Harnessing Machine Learning on Big Data with PySpark on AWS

ODSC - Open Data Science

AUGUST 9, 2023

This is a straightforward and mostly clear-cut question — most of us can likely classify a dish as a dessert or not simply by reading its name, which makes it an excellent candidate for a simple ML model. Step 3: Train, Test, and Evaluate Model Once the data is processed and transformed, we can split it into a training set and a testing set.

Machine Learning

Machine Learning Machine Learning AWS Big Data

Room Occupancy Detection

Heartbeat

FEBRUARY 6, 2024

From the above EDA, it is clear that the room's temperature, light, and CO2 levels are good occupancy indicators. Editorially independent, Heartbeat is sponsored and published by Comet, an MLOps platform that enables data scientists & ML teams to track, compare, explain, & optimize their experiments.

Exploratory Data Analysis

Exploratory Data Analysis Data Analysis Data Analysis Machine Learning

Beyond prompting: getting production quality LLM performance with Snorkel Flow

Snorkel AI

AUGUST 9, 2023

ML practitioners are increasingly coming to appreciate that while foundation models like LLMs provide a fantastic foundation for AI applications, best results are achieved with additional data-centric development. We corrected these via prompt where possible and then later via additional labeled data for fine-tuning.

EDA

EDA AI AI Data Scientist

Predicting the 2024 U.S. Presidential Election Winner Using Machine Learning

I Won $10,000 in a Machine Learning Competition — Here’s My Complete Strategy

Trending Sources

Harness the power of AI and ML using Splunk and Amazon SageMaker Canvas

Modernize and migrate on-premises fraud detection machine learning workflows to Amazon SageMaker

ML Collaboration: Best Practices From 4 ML Teams

Different Plots Used in Exploratory Data Analysis (EDA)

Navigating the Exciting Stages: The Journey of a Machine Learning Project Life Cycle

ML | Data Preprocessing in Python

LLMOps demystified: Why it’s crucial and best practices for 2023

31 Questions that Shape Fortune 500 ML Strategy

11 Open Source Data Exploration Tools You Need to Know in 2023

Data Acquisition & Exploration: Exploring 5 Key MLOps Questions using AWS SageMaker

Accelerate client success management through email classification with Hugging Face on Amazon SageMaker

Building ML Platform in Retail and eCommerce

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

How To Learn Python For Data Science?

Predicting new and existing product sales in semiconductors using Amazon Forecast

Data Science Career FAQs Answered: Educational Background

Vertex AI: Guide to Google’s Unified Machine Learning Platform

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 2: SageMaker notebooks and Studio

Things You Can do Using Kangas Library in Data Science

Monitoring Your Time Series Model in Comet

Enhancing Customer Churn Prediction with Continuous Experiment Tracking

Unveiling Market Dynamics: Winners of the Google Trends Analysis and Predictive Modeling

Machine Learning Operations (MLOPs) with Azure Machine Learning

Roadmap to Learn Data Science for Beginners and Freshers in 2023

How I cleared AWS Machine Learning Specialty with three weeks of preparation (I will burst some…

Introducing our New Book: Implementing MLOps in the Enterprise

Explaining PCA

Announcing the Winners of ‘The NFL Fantasy Football’ Data Challenge

Even More Demo Sessions Coming to ODSC East to Help You Build AI Better

Decoding METAR Data: Insights from the Ocean Protocol Data Challenge

Tracking Your Sentiment Analysis With Comet

Meet the winners of the Kelp Wanted challenge

Feature Engineering in Machine Learning

Top 10 Data Science Projects on GitHub

Artificial Intelligence Using Python: A Comprehensive Guide

Linear Regression for tech start-up company Cars4U in Python

Create and visualize image data with Kangas for computer vision tasks

Nurturing a Strong Data Science Foundation for Beginners

Sentiment Analysis with Python and Streamlit

Harnessing Machine Learning on Big Data with PySpark on AWS

Room Occupancy Detection

Beyond prompting: getting production quality LLM performance with Snorkel Flow

Stay Connected