Abstract
Extractive document summarization is a vital technique for condensing large volumes of text while retaining key information. This research introduces a dynamic feature space mapping approach to enhance extractive document summarization, aiming to succinctly encapsulate key information from extensive text volumes. The proposed method involves extracting various document properties like term frequency, sentence length, and position to comprehensively describe content. By employing a mapping function, these features are projected into a dynamic feature space, enhancing summarization efficiency and feature clarity. Clustering similar phrases in this space facilitates easier sentence grouping, aiding summary creation. Leveraging TF-IDF vectorization, the most representative phrases are chosen from each cluster based on importance and diversity. This process culminates in generating a high-quality document summary quickly and systematically. The dynamic mapping method streamlines sentence grouping, systematically capturing essential document attributes. This approach addresses challenges in extractive summarization, contributing significantly to automated text summarization. Its applicability spans domains requiring rapid extraction of information from vast textual data.
References
Ghodratnama, S., Beheshti, A., Zakershahrak, M. and Sobhanmanesh, F.,“Extractive document summarization based on dynamic feature space mapping”. IEEE Access, 8, pp.139084-139095, 2020.
Yadav, A.K., Singh, A., Dhiman, M., Vineet, Kaundal, R., Verma, A. and Yadav, D. “Extractive text summarization using deep learning approach”. International Journal of Information Technology, 14(5), pp.2407-2415, 2022.
Hayatin, N., Ghufron, K.M. and Wicaksono, G.W. “Summarization of COVID-19 news documents deep learning-based using transformer architecture”. TELKOMNIKA (Telecommunication Computing Electronics and Control), 19(3), pp.754-761,2021.
Zhang, M., Zhou, G., Yu, W., Huang, N. and Liu, W., “A comprehensive survey of abstractive text summarization based on deep learning”. Computational intelligence and neuroscience, 2022.1-21
Gambhir, M. and Gupta, V. “Deep learning-based extractive text summarization with word-level attention mechanism”. Multimedia Tools and Applications, 81(15), pp.20829-20852,2022
Gidiotis, A. and Tsoumakas, G. “A divide-and-conquer approach to the summarization of long documents”. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, pp.3029-3040, 2020.
Dilawari, A., Khan, M.U.G., Saleem, S. and Shaikh, F.S. “Neural Attention Model for Abstractive Text Summarization Using Linguistic Feature Space”. IEEE Access, 11, pp.23557-23564.,2023.
Jang, M. and Kang, P. “Learning-free unsupervised extractive summarization model”. IEEE Access, 9, pp.14358-14368, 2021.
Zhu, H., Dong, L., Wei, F., Qin, B. and Liu, T. “Transforming wikipedia into augmented data for query-focused summarization”. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 30, pp.2357-2367, 2022.
Yadav, D., Katna, R., Yadav, A.K. and Morato, J., “Feature Based Automatic Text Summarization Methods: A Comprehensive State-of-the-Art Survey”. IEEE Access, 10, pp.133981-134003, 2022.
Mao, Q., Li, J., Peng, H., He, S., Wang, L., Philip, S.Y. and Wang, Z. “Fact-driven abstractive summarization by utilizing multi-granular multi-relational knowledge”. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 30, pp.1665-1678, 2022.
Alqaisi, R., Ghanem, W. and Qaroush, A. “Extractive multi-document Arabic text summarization using evolutionary multi-objective optimization with K-medoid clustering”. IEEE Access, 8, pp.228206-228224, 2020.
Liu, W., Gao, Y., Li, J. and Yang, Y. “A combined extractive with abstractive model for summarization”. IEEE Access, 9, pp.43970-43980, 2021.
Jalil, Z., Nasir, J.A. and Nasir, M. “Extractive Multi-Document Summarization: A Review of Progress in the Last Decade”. IEEE Access, 9, pp.130928-130946, 2021.
Saeed, M.Y., Awais, M., Talib, R. and Younas, M. “Unstructured text documents summarization with multi-stage clustering”. IEEE Access, 8, pp.212838-212854, 2020.
