E-learning platforms allow users with different skills to explore large collections of electronic documents and annotate them with notes and highlights. Generating summaries of these document collections is potentially useful for gaining insights into teaching materials. However, most existing summarizers are general-purpose. Thus, they do not consider neither annotations nor user skills during the document summarization process. This paper studies the application of a state-of-the-art summarization system, namely the Itemset-based Summarizer (ItemSum), in an e-learning context. The summarizer produces an ordered sequence of key phrases extracted from a teaching document. The aim of this work is threefold: (i) Evaluate the usefulness of the generated summaries for supporting individual and collective learning activities in a real context, (ii) understand to what extent document highlights, annotations, and user skill levels can be used to drive the summarization process, and (iii) generate multiple summaries of the same document tailored to users with different skill levels. To accomplish Task (i), a crowd-sourcing experience of evaluation of the generated summaries was conducted by involving the students of a B.S. course given by a technical university. The results show that the automatically generated summaries reect, to a large extent, the students' expectations. Hence, they can be useful for supporting learning activities in university-level Computer Science courses. To address Task (ii), three extended versions of the ItemSum summarizer, driven by highlights, annotations, and user skill levels, respectively, have been proposed and their performance improvements with respect to the baseline version have been validated on benchmark documents. Finally, to accomplish Task (iii) multiple summaries of the same benchmark documents have been generated by considering only the annotations made by the users with a different skill level. The results conrm that the summary content reects the level of expertise of the targeted users.

Learning from summaries: supporting e-learning activities by means of document summarization / Baralis, ELENA MARIA; Cagliero, Luca. - In: IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING. - ISSN 2168-6750. - STAMPA. - 4:3(2016), pp. 416-428. [10.1109/TETC.2015.2493338]

Learning from summaries: supporting e-learning activities by means of document summarization

BARALIS, ELENA MARIA;CAGLIERO, LUCA
2016

Abstract

E-learning platforms allow users with different skills to explore large collections of electronic documents and annotate them with notes and highlights. Generating summaries of these document collections is potentially useful for gaining insights into teaching materials. However, most existing summarizers are general-purpose. Thus, they do not consider neither annotations nor user skills during the document summarization process. This paper studies the application of a state-of-the-art summarization system, namely the Itemset-based Summarizer (ItemSum), in an e-learning context. The summarizer produces an ordered sequence of key phrases extracted from a teaching document. The aim of this work is threefold: (i) Evaluate the usefulness of the generated summaries for supporting individual and collective learning activities in a real context, (ii) understand to what extent document highlights, annotations, and user skill levels can be used to drive the summarization process, and (iii) generate multiple summaries of the same document tailored to users with different skill levels. To accomplish Task (i), a crowd-sourcing experience of evaluation of the generated summaries was conducted by involving the students of a B.S. course given by a technical university. The results show that the automatically generated summaries reect, to a large extent, the students' expectations. Hence, they can be useful for supporting learning activities in university-level Computer Science courses. To address Task (ii), three extended versions of the ItemSum summarizer, driven by highlights, annotations, and user skill levels, respectively, have been proposed and their performance improvements with respect to the baseline version have been validated on benchmark documents. Finally, to accomplish Task (iii) multiple summaries of the same benchmark documents have been generated by considering only the annotations made by the users with a different skill level. The results conrm that the summary content reects the level of expertise of the targeted users.
File in questo prodotto:
File Dimensione Formato  
TECT2015_Draft.pdf

accesso aperto

Descrizione: Draft versione finale autore (PDF)
Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: PUBBLICO - Tutti i diritti riservati
Dimensione 353.69 kB
Formato Adobe PDF
353.69 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2643232
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo