Conditions of usage Creative Commons License BY-NC Share

Knowledge Base Construction

Data&Knowledge program of Paris-Saclay University
© 2015 Fabian M. Suchanek, with Luis Galárraga


In this class, we will take an overview of Information Extraction techniques and the Semantic Web. Information extraction is the process of deriving structured information (such as alive(Elvis)) from digital text (such as the sentence “Elvis is alive”). The lecture will focus on factual and semantic information extraction, i.e., we will cover named entity recognition, entity disambiguation, instance extraction, fact extraction, and ontological information extraction. We will also touch upon applications of both Information Extraction and the Semantic Web, such as Google's knowledge graph/vault, and IBM's Watson question answering system, as well as academic projects such as YAGO, DBpedia, and NELL.


Course title: Information Extraction
Language: English
Location: Télécom ParisTech, 46 rue Barrault, 75013 Paris, France
Room: variable, please see the calendar
Time: Every Friday of the second period, starting on 2015-11-27

The class will be evaluated by work in the labs and by the final exam. The labs will be a combination of practical work (programming) and exam-like exercises. Every student works on their own. The lab work is to be handed at the beginning of the next lecture.

The final grades are now available here.
Students who did not pass: please contact the lecturer for a re-exam.


Day Session 1Session 2
2015-11-27 MotivationKnowledge Rep.
2015-12-04 Named Entity RecognitionLab 1
2015-12-11 Named Entity AnnotationLab 2
2015-12-18 Instance Extraction Lab 3
2016-01-08Unstructured SourcesLab 4
2016-01-15ReasoningLab 5
2016-01-22MiningLab 6
The schedule beyond the current point of time is tentative. The PDF slides are provided for convenience only, the authoritative ones are the SVG slides.