Cloud Computing, Data Engineering and Python

8 Ways to Scale your Data Science Workloads

KDnuggets

JULY 22, 2025

With Connected Sheets, a business user could open a Sheet, enter data for a new property (square footage, number of bedrooms, location), and a formula can call a BQML model to return a price estimate. No Python or API wrangling needed - just a Sheets formula calling a model. It provides a Python API intentionally similar to pandas.

Data Science

Data Science Natural Language Processing Machine Learning Machine Learning

I Won $10,000 in a Machine Learning Competition — Here’s My Complete Strategy

Flipboard

JUNE 16, 2025

In this post, I’ll show you exactly how I did it with detailed explanations and Python code snippets, so you can replicate this approach for your next machine learning project or competition. The world’s leading publication for data science, data analytics, data engineering, machine learning, and artificial intelligence professionals.

Machine Learning

Machine Learning Machine Learning Data Science Artificial Intelligence

Educating a New Generation of Workers

O'Reilly Media

NOVEMBER 26, 2024

Entirely new paradigms rise quickly: cloud computing, data engineering, machine learning engineering, mobile development, and large language models. To further complicate things, topics like cloud computing, software operations, and even AI don’t fit nicely within a university IT department.

Cloud Computing

Cloud Computing AWS Azure Machine Learning

Webinars

Precision in Motion: Why Process Optimization Is the Future of Manufacturing

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Automating GitHub Workflows with Claude 4

KDnuggets

JUNE 13, 2025

His vision is to build an AI product using a graph neural network for students struggling with mental illness.

Natural Language Processing

Natural Language Processing Data Science Machine Learning Machine Learning

Announcing the First Speakers for ODSC West 2025

ODSC - Open Data Science

JULY 14, 2025

Michelle Yi, Co-Founder of Generationship Michelle Yi is a technology leader who specializes in machine learning and cloud computing. He can teach you about Data Analysis, Java, Python, PostgreSQL, Microservices, Containers, Kubernetes, and some JavaScript.

Machine Learning

Machine Learning Machine Learning ML ML

Announcing the First Speakers for ODSC West 2025

ODSC - Open Data Science

JULY 3, 2025

Michelle Yi, Co-Founder of Generationship Michelle Yi is a technology leader who specializes in machine learning and cloud computing. He can teach you about Data Analysis, Java, Python, PostgreSQL, Microservices, Containers, Kubernetes, and some JavaScript.

Machine Learning

Machine Learning Machine Learning ML ML

Top Technical Skills You Must Have as a Developer in 2025

Flipboard

JUNE 16, 2025

Python: The demand for Python remains high due to its versatility and extensive use in web development, data science, automation, and AI. Python, the language that became the most used language in 2024, is the top choice for job seekers who want to pursue any career in AI. However, the competition is high.

Python

Python AWS Machine Learning Machine Learning

Automatically Build AI Workflows with Magical AI

KDnuggets

JUNE 16, 2025

Josep writes on all things AI, covering the application of the ongoing explosion in the field.

Natural Language Processing

Natural Language Processing Data Science AI AI

The Evolving Role of the Modern Data Practitioner

ODSC - Open Data Science

MARCH 5, 2025

In the ever-expanding world of data science, the landscape has changed dramatically over the past two decades. Once defined by statistical models and SQL queries, todays data practitioners must navigate a dynamic ecosystem that includes cloud computing, software engineering best practices, and the rise of generative AI.

Data Science

Data Science Cloud Computing SQL Machine Learning

Ask HN: Who is hiring? (July 2025)

Hacker News

JULY 1, 2025

Senior/Staff+ Engineer. Good at Go, Kubernetes (Understanding how to manage stateful services in a multi-cloud environment) We have a Python service in our Recommendation pipeline, so some ML/Data Science knowledge would be good. Python/Django deeply internalized; ideally Vue (or React) skills as well.

Python

Python AWS ML ML

Using AWS S3 with Python boto3

Analytics Vidhya

DECEMBER 5, 2022

The post Using AWS S3 with Python boto3 appeared first on Analytics Vidhya. It allows users to store and retrieve files quickly and securely from anywhere. Users can combine S3 with other services to build numerous scalable […].

AWS

AWS Python Data Science Analytics

Introduction to Google Firebase Cloud Storage using Python

Analytics Vidhya

JULY 16, 2022

It aims to replace conventional backend servers for web and mobile applications by offering multiple services on the same platform like authentication, real-time database, Firestore (NoSQL database), cloud functions, […]. The post Introduction to Google Firebase Cloud Storage using Python appeared first on Analytics Vidhya.

Python

Python Database Data Science Analytics

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Data Science Connect

JANUARY 27, 2023

Data engineering is a crucial field that plays a vital role in the data pipeline of any organization. It is the process of collecting, storing, managing, and analyzing large amounts of data, and data engineers are responsible for designing and implementing the systems and infrastructure that make this possible.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

One-stop-shop for Connecting Snowflake to Python!

Analytics Vidhya

MAY 25, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon In this article, we will learn to connect the Snowflake database. The post One-stop-shop for Connecting Snowflake to Python! appeared first on Analytics Vidhya.

Python

Python Data Science Database Analytics

Crafting Serverless ETL Pipeline Using AWS Glue and PySpark

Analytics Vidhya

DECEMBER 26, 2022

This article was published as a part of the Data Science Blogathon. Overview ETL (Extract, Transform, and Load) is a very common technique in data engineering. It involves extracting the operational data from various sources, transforming it into a format suitable for business needs, and loading it into data storage systems.

ETL

ETL AWS Data Engineering Data Engineering

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Science Blog

SEPTEMBER 19, 2023

In the contemporary age of Big Data, Data Warehouse Systems and Data Science Analytics Infrastructures have become an essential component for organizations to store, analyze, and make data-driven decisions. using for loops in Python). Infrastructure as Code (IaC) can be a game-changer in this scenario.

Data Warehouse

Data Warehouse Azure SQL Database

TigerEye (YC S22) Is Hiring a Full Stack Engineer

Hacker News

NOVEMBER 19, 2024

Here are a few of the things that you might do as an AI Engineer at TigerEye: - Design, develop, and validate statistical models to explain past behavior and to predict future behavior of our customers’ sales teams - Own training, integration, deployment, versioning, and monitoring of ML components - Improve TigerEye’s existing metrics collection and (..)

Computer Science

Computer Science Computer Science ETL ML

Understanding Dask in Depth

Analytics Vidhya

FEBRUARY 5, 2023

Introduction Many different datasets are available for data scientists, machine learning engineers, and data engineers. Finding the best tools to evaluate each dataset […] The post Understanding Dask in Depth appeared first on Analytics Vidhya.

Data Scientist

Data Scientist Machine Learning Machine Learning Data Engineering

Basic Concept and Backend of AWS Elasticsearch

Analytics Vidhya

OCTOBER 4, 2022

It is a Lucene-based search engine developed in Java but supports clients in various languages such as Python, C#, Ruby, and PHP. It takes unstructured data from multiple sources as input and stores it […]. Introduction Elasticsearch is a search platform with quick search capabilities.

AWS

AWS Data Science Python Analytics

What Does a Data Engineer’s Career Path Look Like?

Smart Data Collective

NOVEMBER 8, 2020

This explains the current surge in demand for data engineers, especially in data-driven companies. That said, if you are determined to be a data engineer , getting to know about big data and careers in big data comes in handy. You should learn how to write Python scripts and create software.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Azure Data Engineer Jobs

Pickl AI

APRIL 6, 2023

Accordingly, one of the most demanding roles is that of Azure Data Engineer Jobs that you might be interested in. The following blog will help you know about the Azure Data Engineering Job Description, salary, and certification course. How to Become an Azure Data Engineer?

Azure

Azure Data Engineering Data Engineering Data Engineering

Discover the Most Important Fundamentals of Data Engineering

Pickl AI

NOVEMBER 4, 2024

Summary: The fundamentals of Data Engineering encompass essential practices like data modelling, warehousing, pipelines, and integration. Understanding these concepts enables professionals to build robust systems that facilitate effective data management and insightful analysis. What is Data Engineering?

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

5 Data Engineering and Data Science Cloud Options for 2023

ODSC - Open Data Science

MAY 5, 2023

Data science and data engineering are incredibly resource intensive. By using cloud computing, you can easily address a lot of these issues, as many data science cloud options have databases on the cloud that you can access without needing to tinker with your hardware.

Data Science

Data Science Data Engineering Data Engineering Data Engineering

A Guide to Choose the Best Data Science Bootcamp

Data Science Dojo

JULY 3, 2024

Data science bootcamps are intensive short-term educational programs designed to equip individuals with the skills needed to enter or advance in the field of data science. They cover a wide range of topics, ranging from Python, R, and statistics to machine learning and data visualization.

Data Science

Data Science Machine Learning Machine Learning Data Visualization

Top NLP Skills, Frameworks, Platforms, and Languages for 2023

ODSC - Open Data Science

FEBRUARY 17, 2023

Computer science, math, statistics, programming, and software development are all skills required in NLP projects. Cloud Computing, APIs, and Data Engineering NLP experts don’t go straight into conducting sentiment analysis on their personal laptops. Knowing some SQL is also essential.

Deep Learning

Deep Learning Deep Learning Data Science Natural Language Processing

The 2021 Executive Guide To Data Science and AI

Applied Data Science

AUGUST 2, 2021

Team Building the right data science team is complex. With a range of role types available, how do you find the perfect balance of Data Scientists , Data Engineers and Data Analysts to include in your team? The Data Engineer Not everyone working on a data science project is a data scientist.

Data Science

Data Science Data Scientist Data Analyst Machine Learning

30 Best Data Science Books to Read in 2023

Analytics Vidhya

FEBRUARY 28, 2023

Introduction Data science has taken over all economic sectors in recent times. To achieve maximum efficiency, every company strives to use various data at every stage of its operations.

Data Science

Data Science Data Preparation Big Data Big Data

Data Science Blogathon 30th Edition- Women in Data Science

Analytics Vidhya

MARCH 8, 2023

The Biggest Data Science Blogathon is now live! Martin Uzochukwu Ugwu Analytics Vidhya is back with the largest data-sharing knowledge competition- The Data Science Blogathon. Knowledge is power. Sharing knowledge is the key to unlocking that power.”―

Data Science

Data Science Analytics Analytics Apache Hadoop

Top 10 Guest Authors on Analytics Vidhya in 2022

Analytics Vidhya

DECEMBER 12, 2022

Data science is one of India’s rapidly growing and in-demand industries, with far-reaching applications in almost every domain. Not just the leading technology giants in India but medium and small-scale companies are also betting on data science to revolutionize how business operations are performed.

Analytics

Analytics Analytics Data Science Data Visualization

AWS Lambda: A Convenient Way to Send Emails and Analyze Logs

Analytics Vidhya

JANUARY 1, 2023

This article was published as a part of the Data Science Blogathon. convenient Introduction AWS Lambda is a serverless computing service that lets you run code in response to events while having the underlying compute resources managed for you automatically.

AWS

AWS Data Science Analytics Analytics

Introduction to Amazon API Gateway using AWS Lambda

Analytics Vidhya

JANUARY 1, 2023

Introduction What is an API? In simple terms, API is a messenger; let’s understand this with some examples. Let’s say you are hungry and you need to cook something at home. If you want to make noodles, you just take the ingredients out of the cupboard, fire up the stove, and make it yourself. This […].

AWS

AWS Analytics Analytics Data Engineering

Data Science Blogathon 28th Edition

Analytics Vidhya

JANUARY 8, 2023

Hey, are you the data science geek who spends hours coding, learning a new language, or just exploring new avenues of data science? The post Data Science Blogathon 28th Edition appeared first on Analytics Vidhya. If all of these describe you, then this Blogathon announcement is for you!

Data Science

Data Science Analytics Analytics Hadoop

How to Develop Serverless Code Using Azure Functions?

Analytics Vidhya

JANUARY 30, 2023

Introduction Azure Functions is a serverless computing service provided by Azure that provides users a platform to write code without having to provision or manage infrastructure in response to a variety of events. Azure functions allow developers […] The post How to Develop Serverless Code Using Azure Functions?

Azure

Azure Database Analytics Analytics

A Brief Introduction to the Concept of Data Warehouse

Analytics Vidhya

JULY 6, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Introduction A Data Warehouse is Built by combining data from multiple. The post A Brief Introduction to the Concept of Data Warehouse appeared first on Analytics Vidhya.

Data Warehouse

Data Warehouse Data Science Analytics Analytics

Join DataHour Sessions With Industry Experts

Analytics Vidhya

FEBRUARY 17, 2023

Introduction Are you curious about the latest advancements in the data tech industry? Perhaps you’re hoping to advance your career or transition into this field. In that case, we invite you to check out DataHour, a series of webinars led by experts in the field.

Analytics

Analytics Analytics Data Pipeline Data Warehouse

Analysis of Retail Data Insights With PySpark & Databricks

Analytics Vidhya

JANUARY 30, 2023

Introduction Data has become an essential part of our daily lives in today’s digital age. From searching for a product on e-commerce platforms to placing an order and receiving it at home, we are constantly generating and consuming data.

Analytics

Analytics Analytics Azure Data Engineering

Beginners Guide on Apache Spark & RDDs

Analytics Vidhya

APRIL 4, 2022

This article was published as a part of the Data Science Blogathon. As we all have observed, the growth of data how helps the companies to get insights into data, and that insight is used for the growth of Business. Introduction An ultimate beginners guide on Apache Spark & RDDs!

Data Science

Data Science Analytics Analytics Data Engineering

The innovators behind intelligent machines: A look at ML engineers

Dataconomy

MAY 2, 2023

What do machine learning engineers do: They analyze data and select appropriate algorithms Programming skills To excel in machine learning, one must have proficiency in programming languages such as Python, R, Java, and C++, as well as knowledge of statistics, probability theory, linear algebra, and calculus.

ML

ML ML Machine Learning Machine Learning

Deploying PySpark Machine Learning models with Google Cloud Platform using Streamlit

Analytics Vidhya

JUNE 27, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Introduction In this article, I will be demonstrating how to deploy. The post Deploying PySpark Machine Learning models with Google Cloud Platform using Streamlit appeared first on Analytics Vidhya.

Machine Learning

Machine Learning Machine Learning Data Science Analytics

Top 10 Jobs in AI and the Right AI Skills

Pickl AI

JANUARY 13, 2025

Key Skills Proficiency in programming languages like Python and R. Strong understanding of data preprocessing and algorithm development. Data Scientist Data Scientists analyze complex data sets to extract meaningful insights that inform business decisions. Proficiency in programming languages like Python and SQL.

AI

AI AI Machine Learning Machine Learning

Host the Spark UI on Amazon SageMaker Studio

AWS Machine Learning Blog

AUGUST 8, 2023

For the SageMaker Processing job, you can configure the Spark event log location directly from the SageMaker Python SDK. With the Spark UI hosted on SageMaker, machine learning (ML) and data engineering teams can use scalable cloud compute to access and analyze Spark logs from anywhere and speed up their project delivery.

AWS

AWS Clustering Machine Learning Machine Learning

How to become a Data Scientist in 2023?

Pickl AI

JANUARY 17, 2023

The data would be further interpreted and evaluated to communicate the solutions to business problems. There are various other professionals involved in working with Data Scientists. This includes Data Engineers, Data Analysts, IT architects, software developers, etc.

Data Scientist

Data Scientist Data Science Machine Learning Machine Learning

The Top AI Slides from ODSC West 2024

ODSC - Open Data Science

NOVEMBER 19, 2024

Mustafa Hajij introduced TopoX, a comprehensive Python suite for topological deep learning. This session demonstrated how to leverage these tools using Python and PyTorch, offering attendees practical techniques to apply in their research and projects. Introduction to Containers for Data Science / Data Engineering with Michael A.

Deep Learning

Deep Learning Deep Learning Data Science AI

Watch Now: The Top West 2024 Recordings

ODSC - Open Data Science

NOVEMBER 18, 2024

Introduction to Containers for Data Science/Data Engineering Michael A Fudge | Professor of Practice, MSIS Program Director | Syracuse University’s iSchool In this hands-on session, you’ll learn how to leverage the benefits of containers for DS and data engineering workflows.

Deep Learning

Deep Learning Deep Learning Database Data Science

8 Ways to Scale your Data Science Workloads

I Won $10,000 in a Machine Learning Competition — Here’s My Complete Strategy

Webinars

Trending Sources

Educating a New Generation of Workers

Webinars

Automating GitHub Workflows with Claude 4

Announcing the First Speakers for ODSC West 2025

Announcing the First Speakers for ODSC West 2025

Top Technical Skills You Must Have as a Developer in 2025

Automatically Build AI Workflows with Magical AI

The Evolving Role of the Modern Data Practitioner

Ask HN: Who is hiring? (July 2025)

Using AWS S3 with Python boto3

Introduction to Google Firebase Cloud Storage using Python

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

One-stop-shop for Connecting Snowflake to Python!

Crafting Serverless ETL Pipeline Using AWS Glue and PySpark

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

TigerEye (YC S22) Is Hiring a Full Stack Engineer

Understanding Dask in Depth

Basic Concept and Backend of AWS Elasticsearch

What Does a Data Engineer’s Career Path Look Like?

Azure Data Engineer Jobs

Discover the Most Important Fundamentals of Data Engineering

5 Data Engineering and Data Science Cloud Options for 2023

A Guide to Choose the Best Data Science Bootcamp

Top NLP Skills, Frameworks, Platforms, and Languages for 2023

The 2021 Executive Guide To Data Science and AI

30 Best Data Science Books to Read in 2023

Data Science Blogathon 30th Edition- Women in Data Science

Top 10 Guest Authors on Analytics Vidhya in 2022

AWS Lambda: A Convenient Way to Send Emails and Analyze Logs

Introduction to Amazon API Gateway using AWS Lambda

Data Science Blogathon 28th Edition

How to Develop Serverless Code Using Azure Functions?

A Brief Introduction to the Concept of Data Warehouse

Join DataHour Sessions With Industry Experts

Analysis of Retail Data Insights With PySpark & Databricks

Beginners Guide on Apache Spark & RDDs

The innovators behind intelligent machines: A look at ML engineers

Deploying PySpark Machine Learning models with Google Cloud Platform using Streamlit

Top 10 Jobs in AI and the Right AI Skills

Host the Spark UI on Amazon SageMaker Studio

How to become a Data Scientist in 2023?

The Top AI Slides from ODSC West 2024

Watch Now: The Top West 2024 Recordings

Stay Connected