article thumbnail

How to build reusable data cleaning pipelines with scikit-learn

Snorkel AI

Jason Goldfarb, senior data scientist at State Farm , gave a presentation entitled “Reusable Data Cleaning Pipelines in Python” at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022. It has always amazed me how much time the data cleaning portion of my job takes to complete.

article thumbnail

How to build reusable data cleaning pipelines with scikit-learn

Snorkel AI

Jason Goldfarb, senior data scientist at State Farm , gave a presentation entitled “Reusable Data Cleaning Pipelines in Python” at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022. It has always amazed me how much time the data cleaning portion of my job takes to complete.