Current trends and advances in extractive text summarization : A comprehensive review

Journal article


Azam, Maryam, Khalid, Shah, Almutairi, Sulaiman, Khattak, Hasan Ali, Namoun, Abdallah, Ali, Amjad and Syed Muhammad Bilal, Hafiz. (2025). Current trends and advances in extractive text summarization : A comprehensive review. IEEE Access. 13, pp. 28150-28166. https://doi.org/10.1109/ACCESS.2025.3538886
AuthorsAzam, Maryam, Khalid, Shah, Almutairi, Sulaiman, Khattak, Hasan Ali, Namoun, Abdallah, Ali, Amjad and Syed Muhammad Bilal, Hafiz
Abstract

Given the rapid increase of textual data in various fields, text summarization has become essential for efficient information handling. Over recent decades, numerous methods have been proposed to enhance summarization processes, and various review papers and books have been published to encapsulate these methodologies and discuss their implications. However, existing reviews often fail to provide a comprehensive retrospective of recent advancements, particularly concerning detailed architectural frameworks, the field’s current state, evaluation methodologies, and unresolved challenges. This paper addresses this gap by presenting a detailed analysis of the extractive approaches, encompassing their inherent strengths, limitations, and underlying mechanisms. We present a detailed, multi-layered architectural framework designed to advance and develop summarization models, thereby supporting researchers in their endeavors. The text summarization framework consists mainly of text preprocessing, feature extraction, sentence scoring, use of a base model, sentence selection and output summary, and post-processing. Furthermore, this review of 145 research articles categorizes domain-specific summarization techniques, focusing on unique challenges and tailored strategies for news, scientific articles, and social media. These techniques include statistical, fuzzy logic, rule, optimization, graph, clustering-based, machine learning, and deep learning. We emphasize the impact of evaluation metrics and benchmark datasets in performance assessment, providing a detailed analysis of the commonly utilized datasets and metrics (mainly ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-S) in the current literature. This review article is a valuable resource for advancing text summarization techniques in natural language processing and machine learning by identifying future research directions and open challenges. Notable challenges include expanding summarization for complex tasks, multiple documents, multimodal user input, multi-format and multilingual data, refining the stopping criteria, and improving the evaluation metrics.

Keywordssurvey; text summarization; transformer-based models; domain-specific summarization; generic architecture; datasets and evaluation measures
Year2025
JournalIEEE Access
Journal citation13, pp. 28150-28166
PublisherIEEE Computer Society
ISSN2169-3536
Digital Object Identifier (DOI)https://doi.org/10.1109/ACCESS.2025.3538886
Scopus EID2-s2.0-85217561285
Open accessPublished as ‘gold’ (paid) open access
Page range28150-28166
FunderQassim University
Publisher's version
License
File Access Level
Open
Output statusPublished
Publication dates
Online04 Feb 2025
Publication process dates
Deposited22 Apr 2025
Grant IDQU-APC-2025
Additional information

© 2025 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.

Permalink -

https://acuresearchbank.acu.edu.au/item/91q62/current-trends-and-advances-in-extractive-text-summarization-a-comprehensive-review

Download files


Publisher's version
OA_Azam_2025_Current_trends_and_advances_in_extractive.pdf
License: CC BY 4.0
File access level: Open

  • 5
    total views
  • 3
    total downloads
  • 5
    views this month
  • 3
    downloads this month
These values are for the period from 19th October 2020, when this repository was created.

Export as