article thumbnail

How to build reusable data cleaning pipelines with scikit-learn

Snorkel AI

While there are a lot of benefits to using data pipelines, they’re not without limitations. Traditional exploratory data analysis is difficult to accomplish using pipelines given that the data transformations achieved at each step are overwritten by the proceeding step in the pipeline. JG : Exactly.

article thumbnail

How to build reusable data cleaning pipelines with scikit-learn

Snorkel AI

While there are a lot of benefits to using data pipelines, they’re not without limitations. Traditional exploratory data analysis is difficult to accomplish using pipelines given that the data transformations achieved at each step are overwritten by the proceeding step in the pipeline. JG : Exactly.