University of Grenoble Alpes, 2nd year Master of science in informatics (MoSIG), specialty Artificial intelligence and the web

Fundamentals of Data Processing and Distributed Knowledge

Lecturers
Jérôme Euzenat (Jerome : Euzenat # inria : fr)
Pierre Genevès (Pierre : Geneves # inria : fr)
Language
English
Credits
36h, 6 ETCS
Evaluation
Marks are given after All documents allowed.
Official web site
WMM9MO60
MoSIG web site
FDPDK

Teams

Objective

Modern computing increasingly takes advantage of large amounts of distributed data and knowledge. This is grounded on theoretical principles borrowing to several fields of computer science such as programming languages, data bases, structured documentation, logic and artificial intelligence. The goal of this course is to present some of them, the problems that they solve and those that they uncover. The course considers two perspectives on data and knowledge: interpretation (what they mean), analysis (what they reveal) and processing (how can they be traversed efficiently and transformed safely).

The course offers a semantic perspective on distributed knowledge. Distributed knowledge may come from data sources using different ontologies on the semantic web, autonomous software agents learning knowledge or social robots interacting with different interlocutors. The course adopts a synthetic view on these. It first presents principles of the semantics of knowledge representation (RDF, OWL). Ontology alignments are then introduced to reduce the heterogeneity between distributed knowledge and their exploitation for answering federated queries is presented. A practical way for cooperating agents to evolve their knowledge is cultural knowledge evolution that is then illustrated. Finally, the course defines dynamic epistemic logics as a way to model the communication of knowledge and beliefs.

The course also introduces a perspective on programming language foundations, algorithms and tools for processing structured information, and in particular tree-shaped data. It consists in an introduction to relevant theoretical tools with an application to NoSQL (not only SQL) and XML technologies in particular. Theories and algorithmic toolboxes such as fundamentals of tree automata and tree logics are introduced, with applications to practical problems found for extracting information. Applications include efficient query evaluation, memory-efficient validation of document streams, robust type-safe processing of documents, static analysis of expressive queries, and static type-checking of programs manipulating structured information. The course also aims at presenting challenges, important results, and open issues in the area.

Place and time

Lectures are on Wednesday from 9h45 to 12h45.

Schedule (2024-2025)

This can be consulted on the official timetable Students>Ensimag>masters>mosig 2

DateTitleRoomLecturer
25/09Core XML (XML, Schemas, Parsing)H102PG
02/10Programming with XML (Streaming Validation, XPath, XQuery)H101PG
09/10Foundations of XML Types (Tree Grammars, Tree Automata)H101PG
23/10Tree Logics (FO, MSO)H102PG
06/11Tree Logics continued (μ-calculus)C004PG
13/11C009PG
20/11Knowledge, web, agents, etc.C004JE
27/11Ontology networksC004JE
04/12Belief revisionC004JE
11/12Distributed query evaluationC005JE
18/12Social and cultural knowledge evolutionC004JE
08/01Logics of knowledgeC004JE

Outline and documents

Foundations for processing tree-structured information

Lecturer: Pierre Genevès

Slides and relevant teaching material are available from: http://pierre.geneves.net/teaching-mosig.html.

Semantics of distributed knowledge

Lecturer: Jérôme Euzenat

This part of the course is now collected into its Separate page.

The lecture notes are the only document that you have to look at and they are available here.

Exams

The course is evaluated with two exams one after the first part and the other after the second. This aims at being sure that the students know what is expected from them. In addition here are some past exams. Example of such exams are provided in the corresponding parts.

In case of second chance exams, these are given online through a suited evaluation software.