Confidentiality-Preserving Retrieval-Augmented Generation over Educational Resources

Autori

  • Ludovico Boratto University of Cagliari
  • Francesco Congiu University of Cagliari
  • Gianni Fenu University of Cagliari
  • Mirko Marras University of Cagliari

Parole chiave:

educational content retrieval, learning management, information retrieval, large language models, retrieval-augmented generation, digital learning

Abstract

The evolution of Large Language Models (LLMs) has profoundly transformed access to educational resources, enabling interaction through natural language and fostering new learning modalities. However, the direct use of such models in educational settings raises critical issues related to response verification and the management of sensitive content. In this context, the Retrieval-Augmented Generation (RAG) paradigm represents a promising solution, as it combines the generative capabilities of LLMs with document retrieval from pre-processed knowledge bases, ensuring transparency and source traceability. This work presents a RAG framework designed for education, which integrates retrieval, fusion, and access-control modules to generate reliable responses that comply with users’ authorization levels. Experimental results show that the system enhances the clarity and relevance of the retrieved resources while preserving both equity and confidentiality.

Riferimenti bibliografici

European Commission. (2022). Ethics guidelines for trustworthy AI. Publications Office of the European Union. https://digital-strategy.ec.europa.eu/en/library/ethics-guidelines-trustworthy-ai

Floridi, L., & Cowls, J. (2021). A unified framework of five principles for AI in society. Harvard Data Sci-ence Review, 3(1). https://doi.org/10.1162/99608f92.8cd550d1

Guu, K., Lee, K., Tung, Z., Pasupat, P., & Chang, M.-W. (2020). Retrieval-augmented language model pre-training. In Proceedings of the 37th International Conference on Machine Learning (ICML 2020) (pp. 3929–3938). PMLR. http://proceedings.mlr.press/v119/guu20a.html

Holmes, W., Bialik, M., & Fadel, C. (2019). Artificial intelligence in education: Promises and implications for teaching and learning. Center for Curriculum Redesign. https://curriculumredesign.org/our-work/artificial-intelligence-in-education/

Holmes, W., & Tuomi, I. (2022). State of the art and practice in AI in education. European Journal of Edu-cation, 57(4), 542–570. https://doi.org/10.1111/ejed.12533

Hwang, G.-J., Xie, H., Wah, B. W., & Gašević, D. (2020). Vision, challenges, roles and research issues of Artificial Intelligence in Education. Computers and Education: Artificial Intelligence, 1, 100001. https://doi.org/10.1016/j.caeai.2020.100001

Izacard, G., & Grave, E. (2021). Leveraging passage retrieval with generative models for open-domain question answering. In Proceedings of the 16th Conference of the European Chapter of the ACL (EACL 2021) (pp. 874–880). https://aclanthology.org/2021.eacl-main.74

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W.-T., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. In Advances in Neural Information Processing Systems 33 (NeurIPS 2020). https://proceedings.neurips.cc/paper/2020/hash/6b493230205f780e1bc26945df7481e5-Abstract.html

Luckin, R., Holmes, W., Griffiths, M., & Forcier, L. B. (2016). Intelligence Unleashed: An Argument for AI in Education. Pearson. https://discovery.ucl.ac.uk/1475756/

Maragliano, R. (2019). Educazione digitale e intelligenza artificiale. FrancoAngeli. https://www.francoangeli.it/Libro/Educazione-degli-adulti,-sviluppo-di-comunità,-pedagogica-critica-Angela-Zucconi-e-il-Progetto-Pilota-Abruzzo?Id=25378

Nussbaum, Z., Morris, J. X., Duderstadt, B., & Mulyar, A. (2024). Nomic Embed: Training a reproducible long-context text embedder. arXiv. https://arxiv.org/abs/2402.01613

Selwyn, N. (2019). Should robots replace teachers? AI and the Future of Education. (1st ed.) Polity Press.

Thakur, N., Reimers, N., Rücklé, A., Srivastava, A., & Gurevych, I. (2021). BEIR: A heterogeneous bench-mark for zero-shot evaluation of information retrieval models. NeurIPS Datasets and Benchmarks Track (Round 2). https://openreview.net/forum?id=wCu6T5xFjeJ

Zawacki-Richter, O., Marín, V. I., Bond, M., & Gouverneur, F. (2019). Systematic review of research on ar-tificial intelligence applications in higher education. International Journal of Educational Technolo-gy in Higher Education, 16, 39. https://doi.org/10.1186/s41239-019-0171-0

##submission.downloads##

Pubblicato

2025-11-21

Come citare

Boratto, L., Congiu, F., Fenu, G., & Marras, M. (2025). Confidentiality-Preserving Retrieval-Augmented Generation over Educational Resources. Journal of Inclusive Methodology and Technology in Learning and Teaching, 5(3). Recuperato da https://www.inclusiveteaching.it/index.php/inclusiveteaching/article/view/447