Skrub (formally Dirty_cat)
skrub is a python package for machine-learning on dataframes. It includes our research on encoding non curated categories. It provides encoders that are robust to morphological variants, such as typos, in the category strings. It can be considered as a drop in replacement for “One Hot Encoder” from scikit-learn.
Website: find API documentation and examples on the package