This tutorial aims to introduce beginner Python learners to the diverse types of data a data scientist may face at work: time series, images, videos, text, graphs, geospatial data. In two hours participants will become comfortable handling each type of data and extracting interesting information from it. Participants will also learn tips and tricks how to handle large datasets of each kind.
Data are all around us and come in all sizes and shapes. As a beginner Python learner you often dream to become a master of all data types and handle them with ease: but soon you get intimidated by piles of libraries and unfamiliar jargon surrounding specific data formats and processing tools. The good news is that once you overcome the initial hurdles, you realize that you can analyze these diverse datasets using similar methodology.
The goal of this tutorial is to provide participants with hands-on experience working with diverse types of data: text, sound, video, graphs, GIS. Each data module will include an example with a small dataset to introduce the properties of the data type, and one with a larger dataset to equip the learners with tools to handle the vast data in the wild. Throughout the tutorial we will illustrate how same data mining techniques can be used on different types of data.