Loading…
Improving Unstructured Text Summarization Using An Ensemble Approach
Due to the explosive amounts of text data being created and organizations increased desire to leverage their data corpora, especially with the availability of Big Data platforms, there is not usually enough time to read and understand each document and make decisions based on document contents. Henc...
Saved in:
Published in: | GSTF International journal on computing 2014-10, Vol.4 (1), p.33 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Due to the explosive amounts of text data being created and organizations increased desire to leverage their data corpora, especially with the availability of Big Data platforms, there is not usually enough time to read and understand each document and make decisions based on document contents. Hence, there is a great demand for summarizing text documents to provide a precise substitute for the original documents. In this article, the authors have presented an ensemble approach that combines several of the well-researched text summarization techniques to produce heifer document summaries than individual techniques. An experiment that uses the ensemble approach was designed and results were evaluated. For the purpose of this experiment, the ensemble combined the cosine similarity, enhanced latent semantic analysis using SYD, and maximal marginal relevance measure algorithms. The ensemble was applied on two datasets and the results were found to be promising when compared to the manual summaries developed by human evaluators. |
---|---|
ISSN: | 2010-2283 2251-3043 |