Knowledge BasesA knowledge base (KB) is a structured, computer-processable description of the world. A KB can be thought of as a graph, in which the nodes are entities and the edges are relations. Here is an example: KBs serve all kinds of purposes, such as natural language understanding, intelligent assistance, or machine translation. Our research is guided by specific real-world problems on KBs. I work together with my colleagues and students in the DIG Team of Télécom ParisTech and the Max Planck Institute. We aim to formalize problems, to design principled models for their solution, and to develop real systems that produce that solution.
ProjectsHere are the projects that we currently work on:
- YAGO is a large open source knowledge base constructed from WordNet, Wikipedia, and other sources.
- Schema mining
- Our goal is to deduce schema information automatically from schema-free knowledge bases. We work on mining rules, mining keys, mining completeness, and mining obligatory attributes of classes.
- Medical imaging
- We work on mapping brain activity to scientific terms (in collaboration with INRIA Saclay).
- Information extraction
- We work on repairing regular expressions automatically.
Older projects are
- Combinatorial creativity: a field of research concerned with making computers creative.
- DIVINA: A system that helps internet users make sure that their internet accounts are safe and secure.
- IBEX: An approach to harvest entities such as people, commercial products, and books from the Web.
- Semantic Culturomics: This project combines knowledge from newspapers and knowledge bases to mine trends in history and society.
- PARIS: PARIS is a project to learn mappings between knowledge bases.
- LEILA and SOFIE: These are projects that extract ontological information from natural language texts. The projects are no longer actively maintained.
- Watermarking: This project developed methods to protect ontological knowledge against plagiarism.
Students / PostDocs
- Nedeljko Radulovic (PhD student, 2018-, co-advised with Albert Bifet)
- Thomas Pellisier Tanon (PhD student, 2017-, co-advised with Antoine Amarilli)
- Jonathan Lajus (PhD student, 2016-)
- Jérôme Dockès (PhD student, 2016-, co-advised with Gaël Varoquaux)
- Julien Romero (PhD student, 2017-)
Former Students / PostDocs
- Camille Bourgaux (Postdoc, 2017-2018; now at CNRS)
- Thomas Rebele (PhD student, 2015-2018; now at Orchestra Networks)
- Katerina Tzompanaki (Postdoc in 2016, since 2016 associate professor at the University of Cergy)
- Danai Symeonidou (Postdoc in 2015, since 2015 researcher at INRA)
- Luis Galárraga (PhD 2012-2016, since 2017 researcher at INRIA Rennes)