Contribute Media
A thank you to everyone who has made this possible: Read More

R2 DAY2-03 Information extraction with Python - jiawei chen (PyCon APAC 2015)

Description

Speaker: jiawei chen

This talk will present a named entity recognition (NER) system for extracting attributes and values, like person, company, place or time, from various of text data. I will introduce how to combine several python tools to build this system. First, use a python written annotation tool BRAT to create a custom annotated corpus. Second, use python to link CRFsuite, training a Conditional Random Fields model to labeling our list of text data, the labeling result will be further analyzed by pandas and scikit-learn.

About the speaker

A search engineer, usually like to study machine learning and natural language processing.

頭銜 search engineer

https://tw.pycon.org/2015apac/zh/program/61

Details

Improve this page