Contribute Media
A thank you to everyone who has made this possible: Read More

Tools and Tricks from a Pragmatic Data Scientist

Description

PyData Amsterdam 2016

Description

In this talk I will share some of my favourite tools and tricks I use every day as Data Scientist. They help me to solve all kind of problems, from statistical modeling all the way to scalability issues. Expect machine learning, math, algortithms and of course python. All of them are necessary to be a Pragmatic Data Scientist.

Abstract

The Pragmatic Data Scientist Catalog:

  1. kNN: from slow to fast using math.
  2. Clustering: From k-Means to Spherical Clustering in a breeze.
  3. Missing values in classification: from bad to good using old truth.
  4. Power law: Bucketing the beast.
  5. The King of proportions Ranking.
  6. Hyperparameter search: one tool to rule them all.

Python code for all tricks and tools will be available in github for everyone to use change and challenge.

Details

Improve this page