2018, Algorithm and Data Scientist - Data Science Current

Community Spotlight: Brett Mullins

DrivenData Labs

MAY 18, 2023

How did you get started in data science? Like many data scientists in the 2010s, I stumbled my way into the field. Afterward, I worked as research assistant at the Fiscal Research Center - a research group at GSU - on a project measuring income mobility in Georgia using government administrative data.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Machine Learning Interview Questions to Land the Perfect Data Science Job

Smart Data Collective

DECEMBER 3, 2021

Are you looking to get a job in big data? The Bureau of Labor Statistics reports that there were over 31,000 people working in this field back in 2018. However, it is not easy to get a career in big data. You need to make sure that you can answer them accurately, articulately and succinctly to get a job as a data scientist.

Machine Learning

Machine Learning Machine Learning Data Science Big Data

First Step to Object Detection Algorithms

Heartbeat

FEBRUARY 6, 2023

How do Object Detection Algorithms Work? There are two main categories of object detection algorithms. Two-Stage Algorithms: Two-stage object detection algorithms consist of two different stages. Single-stage object detection algorithms do the whole process through a single neural network model.

Algorithm

Algorithm Deep Learning Deep Learning ML

Tensor Processing Units (TPUs)

Dataconomy

MARCH 19, 2025

They are essential for processing large amounts of data efficiently, particularly in deep learning applications. Developed by Google, these devices are application-specific integrated circuits (ASICs) that enhance the performance of AI algorithms, particularly for tasks related to neural networks and deep learning.

Machine Learning

Machine Learning Machine Learning Natural Language Processing Deep Learning

TensorFlow

Dataconomy

MARCH 20, 2025

Its innovative data flow architecture enables users to execute complex statistical analyses and create sophisticated models efficiently. Overview of TensorFlow TensorFlow emerged as a key tool for data scientists and statisticians, facilitating the implementation of machine learning models.

Machine Learning

Machine Learning Machine Learning Deep Learning Deep Learning

Fast and accurate zero-shot forecasting with Chronos-Bolt and AutoGluon

AWS Machine Learning Blog

DECEMBER 2, 2024

O Texts (2018). [3] Caner Turkmen is a Senior Applied Scientist at Amazon Web Services, where he works on research problems at the intersection of machine learning and forecasting. Before joining AWS, he worked in the management consulting industry as a data scientist, serving the financial services and telecommunications sectors.

Machine Learning

Machine Learning Machine Learning Deep Learning Deep Learning

Improving ML Datasets with Cleanlab, a Standard Framework for Data-Centric AI

ODSC - Open Data Science

MARCH 22, 2023

While most ML classes teach students about modeling a fixed dataset, experienced data scientists know that improving data brings higher ROI than tinkering with models. Our goal is to enable all developers to find and fix data issues as effectively as today’s best data scientists. How does cleanlab work?

ML

ML ML Data Scientist AI

How To Make a Career in GenAI In 2024

Towards AI

DECEMBER 28, 2023

GenAI I serve as the Principal Data Scientist at a prominent healthcare firm, where I lead a small team dedicated to addressing patient needs. Over the past 11 years in the field of data science, I’ve witnessed significant transformations.

Natural Language Processing

Natural Language Processing Deep Learning Deep Learning Python

What a data scientist should know about machine learning kernels?

Mlearning.ai

APRIL 13, 2023

Photo by Robo Wunderkind on Unsplash In general , a data scientist should have a basic understanding of the following concepts related to kernels in machine learning: 1. Support Vector Machine Support Vector Machine ( SVM ) is a supervised learning algorithm used for classification and regression analysis. What are kernels?

Machine Learning

Machine Learning Machine Learning Data Scientist Support Vector Machines

In the Counterfeit Boom, AI is on Both Sides

ODSC - Open Data Science

AUGUST 31, 2023

The data set only helps it if it’s well-trained and supervised. A Mongolian pharmaceutical company engaged in a pilot study in 2018 to detect fake drugs, an initiative with the potential to save hundreds of thousands of lives. Experts train AI specifically on how to fight counterfeit products.

Natural Language Processing

Natural Language Processing AI AI Data Scientist

Beyond data: Cloud analytics mastery for business brilliance

Dataconomy

SEPTEMBER 4, 2023

Predictive analytics: Predictive analytics leverages historical data and statistical algorithms to make predictions about future events or trends. Machine learning and AI analytics: Machine learning and AI analytics leverage advanced algorithms to automate the analysis of data, discover hidden patterns, and make predictions.

Analytics

Analytics Analytics Big Data Analytics Big Data Analytics

3 Takeaways from Gartner’s 2018 Data and Analytics Summit

DataRobot Blog

APRIL 1, 2018

Today’s data management and analytics products have infused artificial intelligence (AI) and machine learning (ML) algorithms into their core capabilities. These modern tools will auto-profile the data, detect joins and overlaps, and offer recommendations. 2) Line of business is taking a more active role in data projects.

Analytics

Analytics Analytics Data Preparation Augmented Analytics

Predict football punt and kickoff return yards with fat-tailed distribution using GluonTS

Flipboard

FEBRUARY 2, 2023

The player tracking data contains the player’s position, direction, acceleration, and more (in x,y coordinates). There are around 3,000 and 4,000 plays from four NFL seasons (2018–2021) for punt and kickoff plays, respectively. The data distribution for punt and kickoff are different.

Cross Validation

Cross Validation ML ML Machine Learning

Llama 4 family of models from Meta are now available in SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2025

billion, representing a significant increase of approximately 223% over the 7-year period. ### Net Income Growth | Year | Net Income (in billion USD) | | | | | 2014 | 0.236 | | 2015 | 0.596 | | 2016 | 2.37 | | 2017 | 3.03 | | 2018 | 10.07 | | 2019 | 11.59 | | 2020 | 18.68 | | 2021 | 33.4 | | 2022 | 18.7 billion to a projected $574.78

AWS

AWS Machine Learning Machine Learning ML

A Vision for the Future: How Computer Vision is Transforming Robotics

Heartbeat

MARCH 14, 2023

By incorporating computer vision methods and algorithms into robots, they are able to view and understand their environment. Object recognition and tracking algorithms include the CamShift algorithm , Kalman filter , and Particle filter , among others.

Deep Learning

Deep Learning Deep Learning Algorithm Machine Learning

NLP-Powered Data Extraction for SLRs and Meta-Analyses

Towards AI

JULY 20, 2023

An additional 2018 study found that each SLR takes nearly 1,200 total hours per project. This ongoing process straddles the intersection between evidence-based medicine, data science, and artificial intelligence (AI). dollars apiece.

Natural Language Processing

Natural Language Processing ML ML Support Vector Machines

Generate a counterfactual analysis of corn response to nitrogen with Amazon SageMaker JumpStart solutions

AWS Machine Learning Blog

APRIL 3, 2023

The structure of the causal model is initially learned from data, whereas subject matter expertise (trusted literature or empirical beliefs) is used to postulate additional dependencies and independencies between random variables and intervention variables, as well as asserting the structure is causal.

Database

Database AWS Machine Learning Machine Learning

Machine Learning Engineering in the Real World

ODSC - Open Data Science

SEPTEMBER 21, 2023

After all, this is what machine learning really is; a series of algorithms rooted in mathematics that can iterate some internal parameters based on data. This is understandable: a report by PwC in 2018 suggested that 30% of UK jobs will be impacted by automation by the 2030s Will Robots Really Steal Our Jobs?

Machine Learning

Machine Learning Machine Learning ML ML

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

AWS Machine Learning Blog

AUGUST 16, 2023

Our solution is based on the DINO algorithm and uses the SageMaker distributed data parallel library (SMDDP) to split the data over multiple GPU instances. The images document the land cover, or physical surface features, of ten European countries between June 2017 and May 2018. tif" --include "_B03.tif" during training.

ML

ML ML Data Scientist AWS

10 Keys to AI Success in 2021

DataRobot Blog

FEBRUARY 4, 2021

The same report estimates that in 2018 alone, AI contributed $2 trillion to the global GDP. While it has been fun for data scientists to test what machine learning can do, companies that invest huge resources into their AI solutions want to see results. “AI could contribute up to $15.7

Data Scientist

Data Scientist AI AI Citizen Data Scientist

How foundation models and data stores unlock the business potential of generative AI

IBM Journey to AI blog

AUGUST 1, 2023

Together with data stores, foundation models make it possible to create and customize generative AI tools for organizations across industries that are looking to optimize customer care, marketing, HR (including talent acquisition) , and IT functions. An open-source model, Google created BERT in 2018.

AI

AI AI Machine Learning Machine Learning

Predicting new and existing product sales in semiconductors using Amazon Forecast

AWS Machine Learning Blog

APRIL 6, 2023

We also demonstrate the performance of our state-of-the-art point cloud-based product lifecycle prediction algorithm. Challenges One of the challenges we faced while using fine-grained or micro-level modeling like product-level models for sale prediction was missing sales data. We next calculated the MAPE for the actual sales values.

Machine Learning

Machine Learning Machine Learning ML ML

Identifying defense coverage schemes in NFL’s Next Gen Stats

AWS Machine Learning Blog

FEBRUARY 10, 2023

Quantitative evaluation We utilize 2018–2020 season data for model training and validation, and 2021 season data for model evaluation. We design an algorithm that automatically identifies the ambiguity between these two classes as the overlapping region of the clusters. Each season consists of around 17,000 plays.

ML

ML ML Machine Learning Machine Learning

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

nnIn 1996, Moret founded the ACM Journal of Experimental Algorithmics, and he remained editor in chief of the journal until 2003. About the Authors Xin Huang is a Senior Applied Scientist for Amazon SageMaker JumpStart and Amazon SageMaker built-in algorithms. He focuses on developing scalable machine learning algorithms.

AWS

AWS Python Machine Learning Machine Learning

Overcoming the Crypto Downturn, and the Road to a Free New Data Economy

Ocean Protocol

DECEMBER 30, 2022

By 2018, about 200 ICO projects were funded. I have nothing but optimism for this space because if you look back to 2014 or 2018 and where we are now–it’s an exponential curve, and we’re still early.” I want individuals to have the ability to take control of their own data and to monetize it in whatever way.

Data Scientist

Data Scientist Algorithm AI AI

Formula 1 Prediction Challenge: 2024 Mexico Grand Prix

Ocean Protocol

OCTOBER 8, 2024

Using historical data from the 2018–2023 Mexico Grand Prix and race data from the 2024 season, participants will analyze variables such as lap times, stint numbers, tire compounds, and pit stop timing. Whether you’re a seasoned data pro or just starting, there’s a place for you in our vibrant community of data scientists.

Machine Learning

Machine Learning Machine Learning Data Science Data Scientist

Customer Data Culture: The Innovators Have Already Reinvented Themselves

Alation

FEBRUARY 13, 2020

After 116 years in business, legendary guitar maker Gibson filed for bankruptcy in 2018. A commitment to using data to become a customer-first organization rather than a self-described “developer-led community.”. Collaborating together, they were able to align interests and algorithms to generate a breakthrough data product.

Decision Science

Decision Science Analytics Analytics Data Science

Best Colleges for Data Science Course Online in India

Pickl AI

APRIL 10, 2023

As per the recent report by Nasscom and Zynga, the number of data science jobs in India is set to grow from 2,720 in 2018 to 16,500 by 2025. Top 5 Colleges to Learn Data Science (Online Platforms) 1. The focus of this e-learning platform is to build proficiency in Data Science. Course Fee : The course fee starts from Rs.

Data Science

Data Science Machine Learning Machine Learning Python

An Introduction to AI Impact Statements

DataRobot

DECEMBER 6, 2021

Data scientists have been so preoccupied with whether they could build an algorithm, they didn’t stop to think about whether they should. The report identifies key accountability practices around the principles of governance, data, performance, and monitoring to help federal agencies and others use AI responsibly.

AI

AI AI Artificial Intelligence Artificial Intelligence

How Marubeni is optimizing market decisions using AWS machine learning and analytics

AWS Machine Learning Blog

MARCH 8, 2023

SageMaker enables Marubeni to run ML and numerical optimization algorithms in a single environment. His team applies data science and digital technologies to support Marubeni Power growth strategies. Before joining Marubeni, Hernan was a Data Scientist at Columbia University. He holds a Ph.D. in Computer Engineering.

AWS

AWS Machine Learning Machine Learning Analytics

NLP in Legal Discovery: Unleashing Language Processing for Faster Case Analysis

Heartbeat

AUGUST 23, 2023

Consider a scenario where legal practitioners are armed with clever algorithms capable of analyzing, comprehending, and extracting key insights from massive collections of legal papers. Algorithms can automatically detect and extract key items. But what if there was a technique to quickly and accurately solve this language puzzle?

Natural Language Processing

Natural Language Processing Algorithm Artificial Intelligence Artificial Intelligence

Unleashing the Power of Deep Learning: Revolutionizing Recommender Systems

Heartbeat

OCTOBER 18, 2023

By leveraging the power of neural networks, deep learning techniques breathe new life into recommendation algorithms, empowering them to handle complex data and surpass the limitations of their predecessors. Netflix’s movies and TV shows are recommended based on user ratings, viewing history, and platform interactions.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Create high-quality datasets with Amazon SageMaker Ground Truth and FiftyOne

AWS Machine Learning Blog

MAY 5, 2023

Solution overview Ground Truth is a fully self-served and managed data labeling service that empowers data scientists, machine learning (ML) engineers, and researchers to build high-quality datasets. To learn more about Ground Truth, refer to Label Data , Amazon SageMaker Data Labeling FAQs , and the AWS Machine Learning Blog.

Machine Learning

Machine Learning Machine Learning AWS ML

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 2

AWS Machine Learning Blog

JANUARY 13, 2023

The data can be accessed via a PhysioNet repository, and details of the data access process can be found here [1]. The eICU data is ideal for developing ML algorithms, decision support tools, and advancing clinical research. We defined it as a binary classification task, where each data sample spans a 1-hour window.

AWS

AWS Analytics Analytics Machine Learning

Meta-Learning: Learning to Learn in Machine Learning

Heartbeat

JANUARY 29, 2024

At its core, Meta-Learning equips algorithms with the aptitude to quickly grasp new tasks and domains based on past experiences, paving the way for unparalleled problem-solving skills and generalization abilities. Reptile is a meta-learning algorithm that falls under model-agnostic meta-learning approaches.

Machine Learning

Machine Learning Machine Learning Natural Language Processing Algorithm

Memory Integration in LangChain Agents

Heartbeat

DECEMBER 14, 2023

LeCun received the 2018 Turing Award (often referred to as the "Nobel Prize of Computing"), together with Yoshua Bengio and Geoffrey Hinton, for their work on deep learning. He is also one of the main creators of the DjVu image compression technology (together with Léon Bottou and Patrick Haffner).

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 1

AWS Machine Learning Blog

OCTOBER 2, 2024

Laying the groundwork: Collecting ground truth data The foundation of any successful agent is high-quality ground truth data—the accurate, real-world observations used as reference for benchmarks and evaluating the performance of a model, algorithm, or system. Andrew Gordon Wilson before joining Amazon in 2018.

AI

AI AI AWS Machine Learning

Explainability in AI and Machine Learning Systems: An Overview

Heartbeat

SEPTEMBER 13, 2023

Algorithmic Accountability: Explainability ensures accountability in machine learning and AI systems. Audience and Context Interpretability : Interpretability primarily targets researchers, data scientists, or experts interested in understanding the model's behavior and improving its performance. Russell, C. &

Machine Learning

Machine Learning Machine Learning AI AI

Foundation models: a guide

Snorkel AI

MARCH 1, 2023

Data scientists can build upon generalized FMs and fine-tune custom versions with domain-specific or task-specific training data. It is based on GPT and uses machine learning algorithms to generate code suggestions as developers write. That’s where the “foundation” in foundation models comes in.

Natural Language Processing

Natural Language Processing Supervised Learning Machine Learning Machine Learning

FM Summit shows Foundation Model hurdles and potential

Snorkel AI

JANUARY 18, 2023

A 2018 study from McKinsey estimated the total global unclaimed value potential from AI/ML at $10-15 trillion, and Foundation Models will enable valuable applications that the firm couldn’t conceive of four years ago. Arora said that the approach yielded a lift in accuracy on all of the popular LLMs they tested on.

ML

ML ML AI AI

RoBERTa: A Modified BERT Model for NLP

Heartbeat

MARCH 15, 2023

An open-source machine learning model called BERT was developed by Google in 2018 for NLP, but this model had some limitations, and due to this, a modified BERT model called RoBERTa (Robustly Optimized BERT Pre-Training Approach) was developed by the team at Facebook in the year 2019. We pay our contributors, and we don’t sell ads.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Implementing Agents in LangChain

Heartbeat

DECEMBER 8, 2023

Data processing and manipulation: Tools provide the necessary functionality for an agent to process and manipulate data. This includes cleaning and transforming data, performing calculations, or applying machine learning algorithms. He co-developed the Lush programming language with Léon Bottou.

Deep Learning

Deep Learning Deep Learning AI AI

Introducing spaCy v2.1

Explosion

MARCH 17, 2019

Language model pretraining By far the biggest news in NLP research over 2018 was the success of language model pretraining. In 2018, a number of papers showed that a simple language modelling objective worked well for LSTM models. This is exactly what algorithms like word2vec, GloVe and FastText set out to solve. Devlin et al.

Python

Python Natural Language Processing Deep Learning Deep Learning

sense2vec reloaded: contextually-keyed word vectors

Explosion

NOVEMBER 21, 2019

al, 2015) is a twist on the word2vec family of algorithms that lets you learn more interesting word vectors. However, established test sets often don’t correspond well to the data being used, or the definition of similarity that the application requires.

Natural Language Processing

Natural Language Processing Data Scientist Machine Learning Machine Learning

Community Spotlight: Brett Mullins

Machine Learning Interview Questions to Land the Perfect Data Science Job

Trending Sources

First Step to Object Detection Algorithms

Tensor Processing Units (TPUs)

TensorFlow

Fast and accurate zero-shot forecasting with Chronos-Bolt and AutoGluon

Improving ML Datasets with Cleanlab, a Standard Framework for Data-Centric AI

How To Make a Career in GenAI In 2024

What a data scientist should know about machine learning kernels?

In the Counterfeit Boom, AI is on Both Sides

Beyond data: Cloud analytics mastery for business brilliance

3 Takeaways from Gartner’s 2018 Data and Analytics Summit

Predict football punt and kickoff return yards with fat-tailed distribution using GluonTS

Llama 4 family of models from Meta are now available in SageMaker JumpStart

A Vision for the Future: How Computer Vision is Transforming Robotics

NLP-Powered Data Extraction for SLRs and Meta-Analyses

Generate a counterfactual analysis of corn response to nitrogen with Amazon SageMaker JumpStart solutions

Machine Learning Engineering in the Real World

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

10 Keys to AI Success in 2021

How foundation models and data stores unlock the business potential of generative AI

Predicting new and existing product sales in semiconductors using Amazon Forecast

Identifying defense coverage schemes in NFL’s Next Gen Stats

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Overcoming the Crypto Downturn, and the Road to a Free New Data Economy

Formula 1 Prediction Challenge: 2024 Mexico Grand Prix

Customer Data Culture: The Innovators Have Already Reinvented Themselves

Best Colleges for Data Science Course Online in India

An Introduction to AI Impact Statements

How Marubeni is optimizing market decisions using AWS machine learning and analytics

NLP in Legal Discovery: Unleashing Language Processing for Faster Case Analysis

Unleashing the Power of Deep Learning: Revolutionizing Recommender Systems

Create high-quality datasets with Amazon SageMaker Ground Truth and FiftyOne

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 2

Meta-Learning: Learning to Learn in Machine Learning

Memory Integration in LangChain Agents

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 1

Explainability in AI and Machine Learning Systems: An Overview

Foundation models: a guide

FM Summit shows Foundation Model hurdles and potential

RoBERTa: A Modified BERT Model for NLP

Implementing Agents in LangChain

Introducing spaCy v2.1

sense2vec reloaded: contextually-keyed word vectors

Stay Connected