University of Grenoble Alpes, 2nd year Master of science in informatics (MoSIG), specialty Artificial intelligence and the web

Fundamentals of Data Processing and Distributed Knowledge

Lecturers
Jérôme Euzenat (Jerome : Euzenat # inria : fr)
Pierre Genevès (Pierre : Geneves # inria : fr)
Language
English
Credits
36h, 6 ETCS
Evaluation
Marks are given after All documents allowed.
Official web site (totally obsolete)
WMM9MO60
MoSIG web site
FDPDK

Teams

Objective

Modern computing increasingly takes advantage of large amounts of distributed data and knowledge. This is grounded on theoretical principles borrowing to several fields of computer science such as programming languages, data bases, structured documentation, logic and artificial intelligence. The goal of this course is to present some of them, the problems that they solve and those that they uncover. The course considers two perspectives on data and knowledge: interpretation (what they mean), analysis (what they reveal) and processing (how can they be traversed efficiently and transformed safely).

The course offers a semantic perspective on distributed knowledge. Distributed knowledge may come from data sources using different ontologies on the semantic web, autonomous software agents learning knowledge or social robots interacting with different interlocutors. The course adopts a synthetic view on these. It first presents principles of the semantics of knowledge representation (RDF, OWL). Ontology alignments are then introduced to reduce the heterogeneity between distributed knowledge and their exploitation for answering federated queries is presented. A practical way for cooperating agents to evolve their knowledge is cultural knowledge evolution that is then illustrated. Finally, the course defines dynamic epistemic logics as a way to model the communication of knowledge and beliefs.

The course also introduces a perspective on programming language foundations, algorithms and tools for processing structured information, and in particular tree-shaped data. It consists in an introduction to relevant theoretical tools with an application to NoSQL (not only SQL) and XML technologies in particular. Theories and algorithmic toolboxes such as fundamentals of tree automata and tree logics are introduced, with applications to practical problems found for extracting information. Applications include efficient query evaluation, memory-efficient validation of document streams, robust type-safe processing of documents, static analysis of expressive queries, and static type-checking of programs manipulating structured information. The course also aims at presenting challenges, important results, and open issues in the area.

Place and time

Lectures are on Wednesday from 9h45 to 12h45 (most of the time in H202).

Planning (2023-2024)

This can be consulted on the official timetable Students>Ensimag>masters>mosig 2

DateTitleRoomLecturer
27/09Core XML (XML, Schemas, Parsing)H202PG
04/10Programming with XML (Streaming Validation, XPath, XQuery)H202PG
18/10Foundations of XML Types (Tree Grammars, Tree Automata)H202PG
25/10Tree Logics (FO, MSO)H202PG
08/11Tree Logics continued (μ-calculus)H202PG
15/11H202PG
22/11Knowledge, web, agents, etc.H202JE
29/11Ontology networksH202JE
06/12Belief revisionH202JE
13/12Distributed query evaluationH202JE
20/12Social and cultural knowledge evolutionH202JE
10/01Logics of knowledgeH208JE

Outline and documents

Foundations for processing tree-structured information

Lecturer: Pierre Genevès

Slides and relevant teaching material are available from: http://pierre.geneves.net/teaching-mosig.html.

Semantics of distributed knowledge

Lecturer: Jérôme Euzenat

This part of the course is now collected into its Separate page

Previous exams

In previous years, we had 3h exams at the end of the course. Starting in 2010-2011, we have two exams. This aims at being sure that the students know what is expected from them. In addition here are some past exams.

Here are some questions of an exam proposed at EPFL in 2009 and their corrections (in English) for the XML part only.

Here is the exam of 2008-2009 (in French) and its correction (in English) for the semantic web part only.

Here is the exam of 2009-2010 (in French or English) and its correction (in English) for the semantic web part only.

Here is the exam of 2010-2011 (in French or English) and its correction (in English) for the semantic web part only.

Here is the exam of 2012-2013 (in English) for the semantic web and social network part and its correction (in English) for the semantic web part only.

Here is the exam of 2013-2014 (in English) for the semantic web and social network part and its correction (in English) for the semantic web part only.

Here is the exam of 2014-2015 (in English) for the semantic web and social network part and its correction (in English) for the semantic web part only.

Here is the midterm exam of 2015-2016 and its correction for the semantic web part (in English).

Here is the midterm exam of 2016-2017 and its correction for the semantic web part (in English).

https://moex.inria.fr/teaching/fdk/