Contribute Media
A thank you to everyone who has made this possible: Read More

Word Embeddings for fun and profit in Gensim

Description

PyData London 2016

Python has great open source libraries to extract data from its most raw format - the human readable text. We will discuss a family of algorithms called word embeddings - Word2Vec being most famous and how they can be used in practice using Gensim package

A tour of word embeddings, their Python implementations and their use in the industry.

We will start with theory and academic results for word2vec, glove, swivel and Word Movers Distance. Then proceed to their Python open source implementations mainly in the Gensim package

Slides available here: https://speakerdeck.com/tmylk/word-embeddings-for-fun-and-profit-with-gensim-pydata-london-2016

Improve this page