Knowledge Graph Construction
Content
Language Models have revolutionized natural language processing. Yet, they can say wrong things in a very convincing way — they hallucinate. One solution to this problem can come from structured data such as knowledge graphs, which can serve to correct and inform the model. In this class, we will see how to bridge the gap between natural language (the sentence “Elvis is alive”) and structured information (the statement alive(Elvis)). We will cover the technical steps of information extraction: named entity recognition, entity disambiguation, and fact extraction. For each of them, we will see different methods: fine-tuning language models, prompt engineering, and training-free procedures. Finally, we will talk about techniques for knowledge cleaning: link prediction, entity alignment and rule mining.
Grading
The course is graded by 6 labs and a final exam.
- Final grade: 50% labs + 50% exam
- Final grade at re-take: 50% original labs + 50% reexam
Discussing assignments together is allowed, but each student must write their own solution. No sharing of code, plagiarism entails a grade of 0 for the lab/exam.
Teachers:
Schedule
The class takes place at Telecom Paris.
- Introduction (2024-11-15, room 1C47)
-
- Introduction to Information Extraction
- Knowledge Graphs
- Current research topics
- Named Entity Recognition and Classification (2024-11-22, room 1D22)
-
- Knowledge Representation
- Named Entity Recognition and Classification
- A quick refresher of Regular Expressions
- Lab: training-free NERC
- Typing and Disambiguation (2024-11-29, room 1A222)
-
- Prompt Engineering
- Entity Disambiguation
- Lab: Disambiguation with prompt engineering (without Fabian)
- Fact extraction (2024-12-06, room 1A222)
-
- Fact Extraction
- Dependency Parsing
- Constrained Decoding
- Lab: Fact extraction by constrained decoding
- Fact Extraction by Reasoning (2024-12-13, room 1A222)
-
- Extractive Entity Typing
- Information Extraction by Reasoning
- Lab: Max Sat
- Rule Mining (2024-12-20, room 1A222)
-
- Rule Mining
- Link Prediction
- Lab: Rule mining
- Semantic Web (2025-01-10, room 1A222)
-
- Semantic Web
- Reasoning on the Semantic Web (RDF, OWL, SHACL)
- Lab: KB cleanup
- Exam (2025-01-17, room 0C04)
-
The exam is “closed-book”: no materials are allowed except for a pen.