RKDE 2023

1st International Tutorial and Workshop on
Responsible Knowledge Discovery in Education

Side event at ECML-PKDD 2023 Conference
September 18th 2023, OGR Turin, Italy

Photo by OGR Torino

Call for papers

By offering a large number of highly diverse resources, learning platforms have been attracting lots of participants, and the interactions with these systems have generated a vast amount of learning-related data. Their collection, processing and analysis have promoted a significant growth of machine learning and knowledge discovery approaches and have opened up new opportunities for supporting and assessing educational experiences in a data-driven fashion. Being able to understand students' behavior and devise models able to provide data-driven decisions pertaining to the learning domain is a primary property of learning platforms, aiming at maximizing learning outcomes.

However, the use of knowledge discovery in education also raises a range of ethical challenges including transparency, reliability, fairness, and inclusiveness. The purpose of RKDE, Responsible Knowledge Discovery in Education, is to encourage principled research that will lead to the advancement of explainable, transparent, ethical and fair data mining and machine learning in the context of educational data. RKDE is an event organized into two moments: a tutorial to introduce the audience to the topic, and a workshop to discuss recent advances in the research field. The tutorial will provide a broad overview of the state of the art on the major applications for responsible approaches and their relationship with the educational context. Moreover, it will present hands-on case studies that practically shows how knowledge discovery tasks can be responsibly addressed in education. The workshop will seek top-quality submissions addressing uncovered important issues related to ethical, fair, explainable and transparent data mining and machine learning in education. Papers should present research results in any of the topics of interest for the workshop as well as application experiences, tools and promising preliminary ideas. RKDE asks for contributions from researchers, academia and industries, working on topics addressing these challenges primarily from a technical point of view, but also from a legal, ethical or sociological perspective.

Topics of interest include, but are not limited to:
  • Dataset Collection and Preparation:
    • New tools and systems for capturing educational data
    • Modeling representations of learners from data
    • Building representations of domain knowledge from data
    • Integrating data from multiple (and diverse) data sources
    • Creating datasets that allow to explore ethical dimensions
    • Designing collection protocols tailored to responsible knowledge discovery
  • Techniques and Models:
    • Multimodal / semantic approaches for learners' behavior modeling or personalization
    • Adaptive question-answering and dialogue or automatically generating test questions
    • Personalized support tools and systems for communities of learners
    • Natural language processing applied on exam data in order to assign a grade to them
    • Temporal, behavioral, and physiological analysis of learners' behavior
    • Student engagement assessment via machine-learning techniques
    • Systems that detect and/or adapt the platform to emotional states of learners
    • Techniques to provide automated proctoring support during online examinations
    • Tools able to predict the learner's success or failure along the educational path
    • Developing fair and explainable models for different kinds of stakeholders
    • Developing privacy-protecting algorithms for learners' data processing
  • Evaluation Protocol and Metric Formulation:
    • Performing auditing studies with respect to bias and fairness
    • Defining objective metrics for knowledge discovery in education
    • Formulating bias-aware protocols to evaluate existing algorithms
    • Evaluating existing mitigation strategies in unexplored domains
    • Comparative studies of existing evaluation protocols and strategies
    • Analyzing efficiency and scalability issues of debiasing methods
    • Replicating previous studies with different samples, domains and/or contexts
  • Case Study Exploration:
    • Educational games
    • Learning management systems
    • Interactive simulations
    • Intelligent tutoring
    • Language assessment
Photo by Julia M Cameron from Pexels

Submission and publication

All contributions will be reviewed by at least three members of the Program Committee. All papers should be anonymized (double-blind review process). We strongly encourage making code and data available anonymously (e.g., in an anonymous GitHub repository via Anonymous GitHub. Moreover they should be written in English and be in LNCS format. Author instructions, style files, and the copyright form can be downloaded here: +

The following kinds of submissions will be considered:

  • Full papers between 12 and 16 pages
  • Reproducibility papers between 12 and 16 pages
  • Short papers between 6 and 11 pages
  • Position papers between 4 and 5 pages

It is planned that accepted papers will be published after the workshop by Springer in a volume of Lecture Notes in Computer Science (LNCS). Conditions for inclusion in the post-proceedings are that at least one of the co-authors has presented the paper at the workshop and the overall length of the paper should be no less than 4 pages. Pre-proceedings will be available online before the workshop. We also allow accepted papers to be presented without publication in the conference proceedings, if the authors choose to do so. Some of the full paper submissions may be accepted as short papers after review by the Program Committee. A special issue of a relevant international journal with extended versions of selected papers is under consideration.

The submission link is:

Important dates

Jun 25 2023

Submission deadline (ext)

Jul 17 2023

Accept/Reject Notification

Jul 31 2023

Camera-ready deadline

Sep 18 2023


Program Co-Chairs


Mirko Marras

University of Cagliari


Paola Mejia

École Polytechnique Fédérale de Lausanne


Agathe Merceron

Berliner Hochschule für Technik


Anna Monreale

University of Pisa


Daniela Rotelli

University of Pisa

Program Committee

  • Mario Allegra, CNR, Italy
  • Julien Broisin, Université Paul Sabatier, France
  • Armelle Brun, Université de Lorraine, France
  • Guanliang Chen, Monash University, Australia
  • Filippo Chiarello, University of Pisa, Italy
  • Irene-Angelica Chounta, University of Duisburg-Essen, Germany
  • Danilo Dessì, GESIS - Leibniz-Institut für Sozialwissenschaften, Germany
  • Manuel Gentile, CNR, Italy
  • Martin Hlosta, Swiss Distance University of Applied Sciences, Switzerland
  • Andrea Kienle, Fachhochschule Dortmund, Germany
  • Christopher Krauss, Fraunhofer FOKUS, Germany
  • Sebastien Lallé, Sorbonne Université, France
  • Marie Lefèvre, Lyon University, France
  • Vanda Luengo, Sorbonne Université, France
  • Donatella Merlini, University of Florence, Italy
  • Mathieu Muratet, Sorbonne Université, France
  • Tanya Nazaretsky, Weizmann Institute of Science, Israel
  • Benjamin Paasen, DFKI, Germany
  • Chiara Panciroli, University of Bologna, Italy
  • María-Jesus Rodriguez-Triana, Tallinn University, Estonia
  • Petra Sauer, Berlin University of Applied Sciences, Germany
  • Clara Schumacher, Humboldt Universität zu Berlin, Germany
  • Filippo Sciarrone, Università Mercatorum, Italy
  • Niels Seidel, FernUniversität in Hagen, Germany
  • Lele Sha, Monash University, Australia
  • Amel Yessad, Sorbonne Université, France
  • Andrea Zanellati, University of Bologna, Italy


14:00 - 14:10Welcome, General Overview, Supporting project PNRR-SoBigData.it presentation.
14:10 - 14:20Introduction
14:20 - 14:40Introduction to KDD for students' time management
14:40 - 15:45Hands-on practice (by SoBigData RI Jupyter Hub) held by Daniela Rotelli & Paola Mejia
15:45 - 16:00Open issues and research challenges + Final discussion
16:00 - 16:30Break
16:30 - 18:15Papers Presentations

Accepted Papers

  • Consolidation and Transmission of Multiple xAPI Data Sources from Virtual Learning Environments to Different Learning Record Stores
  • PICA: A Data-driven Synthesis of Peer Instruction and Continuous Assessment
  • A 2-step methodology for XAI in education
  • A Fair Post-Processing Method based on the MADD Metric for Predictive Student Models
  • Towards Personalized Educational Materials: Mapping Student Knowledge through Natural Language Processing
  • Distractor generation for multiple-choice questions with predictive prompting and large language models
  • The ChatGPT and Education Tweets Dataset


The event will take place at the ECML-PKDD 2023 Conference at Officine Grandi Riparazioni (OGR) Turin, Italy


The registration to the workshop is managed by the ECML-PLDD main conference at


The RKDE event was organised as part of the SoBigData.it project (Prot. IR0000013 - Call n. 3264 of 12/28/2021) initiatives aimed at training new users and communities in the usage of the research infrastructure (SoBigData.eu).

SoBigData.it receives funding from European Union - NextGenerationEU - National Recovery and Resilience Plan (Piano Nazionale di Ripresa e Resilienza, PNRR) - Project: “SoBigData.it - Strengthening the Italian RI for Social Mining and Big Data Analytics” - Prot. IR0000013 - Avviso n. 3264 del 28/12/2021.


All inquiries should be sent to:

Email: rkde.ecmlpkdd2023@gmail.com