CES Data Science: Information Extraction
Information extraction is the process of deriving structured information (such as alive(Elvis)) from digital text (such as the sentence "Elvis is alive"). We will focus on factual and semantic information extraction, i.e., we will cover named entity recognition, entity disambiguation, instance extraction, and fact extraction. We will also touch upon applications of both Information Extraction and the Semantic Web, such as Google’s knowledge graph, IBM’s Watson question answering system, and Facebook’s Open Graph, and academic projects such as YAGO, DBpedia, and NELL. Finally, we will talk about how the big Internet companies use the information that they extract and gather, and how we can protect ourselves against these practices.
Course title: Information Extraction
Time: Thursday 2018-06-21 and Friday 2018-06-22
Place: Télécom ParisTech, Site Dareau, 37-39, rue Dareau, 75014 Paris, Room DB003
Evaluation: The class will be evaluated by the lab work. Every student works on their own.
- Fabian Suchanek
- Julien Romero (for the labs)
- Dhouha Bouamor (from Wipolo)
- Thursday 2018-06-21:
- Friday 2018-06-22:
- 9:00 - 10:30 Dhouha Bouamor: Information Extraction at Wipolo
- 10:45 - 12:15 Data security & privacy
- lunch break
- 14:00-17:00 Lab (with Julien Romero)