Loading…
HCMX: AN EFFICIENT HYBRID CLUSTERING APPROACH FOR MULTI-VERSION XML DOCUMENTS
In order to retrieve useful information from large number of growing XML documents on the web, effective management of XML document is essential. One solution is to cluster XML documents to find knowledge that promote effective information management and maintenance. But in the real world XML docume...
Saved in:
Published in: | Journal of Theoretical and Applied Information Technology 2015-12, Vol.82 (1), p.137-137 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In order to retrieve useful information from large number of growing XML documents on the web, effective management of XML document is essential. One solution is to cluster XML documents to find knowledge that promote effective information management and maintenance. But in the real world XML documents are dynamic in nature. In contrast to static XML documents, changes from one version of XML document to another version cannot be predicted. So clustering technique of static XML documents cannot be used to cluster multiple versions of XML documents. In case of multiversion XML documents, preliminary clustering solution is not become valid after document versions appear. XML documents are self descriptive in nature, which results in large document size. To find new clustering solution after change, comparisions between all documents is not viable solution. In this paper we have proposed hybrid clustering approach to cluster multiversion XML documents. This approach improves speed of clustering by limiting the growing size of XML documents by using homo-morphic compression scheme and using distance information from preliminary clustering solution with the changes recorded in compressed delta |
---|---|
ISSN: | 1817-3195 |