Loading…
Hot topic extraction based on frequency, position, scattering and topical weight for time sliced news documents
Internet based news documents are the basic information transmission media. In such a case detecting hot topics and tracking the event development is most important. However, it is almost impossible to view all the generated topics, due to its large amount of size. Therefore it is necessary to rank...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Internet based news documents are the basic information transmission media. In such a case detecting hot topics and tracking the event development is most important. However, it is almost impossible to view all the generated topics, due to its large amount of size. Therefore it is necessary to rank the topics. The topic ranking should be done on the importance basis. But this importance is determined by how frequently a topic appears and this importance varies in different time slots. For extracting hot topics, most of the text mining approaches with vector space model need to determine the weighting of the feature terms. Existing traditional algorithms can't achieve high accuracy for retrieving hot terms, because they have not considered position, scattering and topicality. This paper presents an innovative and effective hot term extraction by considering position, scattering and topicality of terms along with frequency. |
---|---|
DOI: | 10.1109/ICACT.2013.6710495 |