The Solution

How Skrub helps

Skrub is an open-source Python library designed for data preprocessing within machine learning pipelines that utilize dataframes. It extends popular dataframe libraries such as pandas and polars by providing high-level tools for data exploration, cleaning, and feature engineering without replacing the underlying dataframe structures.