We recently did a couple of talks about our
vtreat data treatment package: one for the Python version, and one for the R version. If you are fitting machine learning models on messy real-world data, then you might find
vtreat useful. Do check out one of the introductory talks below.
- Preparing Messy Data for Supervised Learning at PyData Los Angeles 2019
- Advanced Data Preparation for Supervised Machine Learning for the Why R webinar May 7. 2020
The talks are essentially the same; pick the one in your preferred programming language.
For more documentation on