Cleaning Confused Collections of Characters


Success in data science projects depends upon clean input data. Text data is often badly encoded, lacks data types and is inconsistent. Aimed at the intermediate Pythonista I’ll talk about the time saving tools I use in ModelInsight to clean and normalise my data so you can easily work on new projects.


