A new graph based text segmentation using Wikipedia for automatic text summarization

Mohsen Pourvali; Ph.D. Mohammad Saniee Abadeh

doi:10.14569/IJACSA.2012.030105

DOI: 10.14569/IJACSA.2012.030105

PDF

A new graph based text segmentation using Wikipedia for automatic text summarization

Author 1: Mohsen Pourvali

Author 2: Ph.D. Mohammad Saniee Abadeh

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 3 Issue 1, 2012.

Abstract and Keywords
How to Cite this Article
{} BibTeX Source

Abstract: The technology of automatic document summarization is maturing and may provide a solution to the information overload problem. Nowadays, document summarization plays an important role in information retrieval. With a large volume of documents, presenting the user with a summary of each document greatly facilitates the task of finding the desired documents. Document summarization is a process of automatically creating a compressed version of a given document that provides useful information to users, and multi-document summarization is to produce a summary delivering the majority of information content from a set of documents about an explicit or implicit main topic. According to the input text, in this paper we use the knowledge base of Wikipedia and the words of the main text to create independent graphs. We will then determine the important of graphs. Then we are specified importance of graph and sentences that have topics with high importance. Finally, we extract sentences with high importance. The experimental results on an open benchmark datasets from DUC01 and DUC02 show that our proposed approach can improve the performance compared to state-of-the-art summarization approaches.

Keywords: text Summarization; Data Mining; Word Sense Disambiguation.

Mohsen Pourvali and Ph.D. Mohammad Saniee Abadeh, “A new graph based text segmentation using Wikipedia for automatic text summarization” International Journal of Advanced Computer Science and Applications(IJACSA), 3(1), 2012. http://dx.doi.org/10.14569/IJACSA.2012.030105

@article{Pourvali2012,
title = {A new graph based text segmentation using Wikipedia for automatic text summarization},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2012.030105},
url = {http://dx.doi.org/10.14569/IJACSA.2012.030105},
year = {2012},
publisher = {The Science and Information Organization},
volume = {3},
number = {1},
author = {Mohsen Pourvali and Ph.D. Mohammad Saniee Abadeh}
}

Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

A new graph based text segmentation using Wikipedia for automatic text summarization

Upcoming Conferences