Loading…
Ontology-based soft computing and machine learning model for efficient retrieval
Unstructured and unorganized data always degrade the performance of search techniques and produce irrelevant results in response to the query as well as decrease the speed of retrieval results. Ontology in semantic web (SW) provides an adequate solution to represent the knowledge, because of its bac...
Saved in:
Published in: | Knowledge and information systems 2024-02, Vol.66 (2), p.1371-1402 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c270t-8acc3a5f80c963f05164e8e5abfd1a56c608ca45c750ff48fcf1a4964917d31c3 |
container_end_page | 1402 |
container_issue | 2 |
container_start_page | 1371 |
container_title | Knowledge and information systems |
container_volume | 66 |
creator | Anand, Sanjay Kumar Kumar, Suresh |
description | Unstructured and unorganized data always degrade the performance of search techniques and produce irrelevant results in response to the query as well as decrease the speed of retrieval results. Ontology in semantic web (SW) provides an adequate solution to represent the knowledge, because of its backbone knowledge of an application or domain. But, domain ontology has three basic problems while retrieving useful knowledge from a domain ontology: (a) structuring/arrangement, (b) unnecessary knowledge reduction, selection and extraction, and (c) speeding up the retrieval process. To resolve these problems, we proposed multi-level k-mean clustering approach with rough set and Bayesian network model for ontology (MLK-rBO). The proposed model works in four different phases—clustering, knowledge discovery, building a probabilistic network, and model evaluation. The model ensembles three different techniques, namely clustering, rough set (RS), and Bayesian network (BN). Finally, the proposed model is tested with statistical parameters and compared with other models, namely decision tree (DT), random forest (RF), and support vector machine (SVM) to evaluate performance. By analyzing experimental results, we observed that the MLK-rBO gives better accuracy: 98.36% for survey data (fever) and 86% for Wine quality data than available models. |
doi_str_mv | 10.1007/s10115-023-01990-8 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2915809203</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2915809203</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-8acc3a5f80c963f05164e8e5abfd1a56c608ca45c750ff48fcf1a4964917d31c3</originalsourceid><addsrcrecordid>eNp9kE1LAzEQhoMoWKt_wFPAc3Rms9lNjlL8gkI96Dmk2aRu2U1qshX6713dgjdPMwzP-w48hFwj3CJAfZcREAWDgjNApYDJEzKDAhXjiNXpcUde1-fkIuctANYV4oy8rsIQu7g5sLXJrqE5-oHa2O_2Qxs21ISG9sZ-tMHRzpkUfo59bFxHfUzUed_a1oWBJjek1n2Z7pKcedNld3Wcc_L--PC2eGbL1dPL4n7JbFHDwKSxlhvhJVhVcQ8Cq9JJJ8zaN2hEZSuQ1pTC1gK8L6W3Hk2pqlJh3XC0fE5upt5dip97lwe9jfsUxpe6UCgkqAL4SBUTZVPMOTmvd6ntTTpoBP1jTk_m9GhO_5rTcgzxKZRHOGxc-qv-J_UNsnZxlg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2915809203</pqid></control><display><type>article</type><title>Ontology-based soft computing and machine learning model for efficient retrieval</title><source>Springer Nature</source><creator>Anand, Sanjay Kumar ; Kumar, Suresh</creator><creatorcontrib>Anand, Sanjay Kumar ; Kumar, Suresh</creatorcontrib><description>Unstructured and unorganized data always degrade the performance of search techniques and produce irrelevant results in response to the query as well as decrease the speed of retrieval results. Ontology in semantic web (SW) provides an adequate solution to represent the knowledge, because of its backbone knowledge of an application or domain. But, domain ontology has three basic problems while retrieving useful knowledge from a domain ontology: (a) structuring/arrangement, (b) unnecessary knowledge reduction, selection and extraction, and (c) speeding up the retrieval process. To resolve these problems, we proposed multi-level k-mean clustering approach with rough set and Bayesian network model for ontology (MLK-rBO). The proposed model works in four different phases—clustering, knowledge discovery, building a probabilistic network, and model evaluation. The model ensembles three different techniques, namely clustering, rough set (RS), and Bayesian network (BN). Finally, the proposed model is tested with statistical parameters and compared with other models, namely decision tree (DT), random forest (RF), and support vector machine (SVM) to evaluate performance. By analyzing experimental results, we observed that the MLK-rBO gives better accuracy: 98.36% for survey data (fever) and 86% for Wine quality data than available models.</description><identifier>ISSN: 0219-1377</identifier><identifier>EISSN: 0219-3116</identifier><identifier>DOI: 10.1007/s10115-023-01990-8</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Bayesian analysis ; Clustering ; Computer Science ; Data Mining and Knowledge Discovery ; Database Management ; Decision trees ; Information retrieval ; Information Storage and Retrieval ; Information Systems and Communication Service ; Information Systems Applications (incl.Internet) ; IT in Business ; Knowledge representation ; Machine learning ; Ontology ; Performance degradation ; Performance evaluation ; Regular Paper ; Semantic web ; Soft computing ; Statistical analysis ; Support vector machines ; Uncertainty ; Unstructured data</subject><ispartof>Knowledge and information systems, 2024-02, Vol.66 (2), p.1371-1402</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-8acc3a5f80c963f05164e8e5abfd1a56c608ca45c750ff48fcf1a4964917d31c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Anand, Sanjay Kumar</creatorcontrib><creatorcontrib>Kumar, Suresh</creatorcontrib><title>Ontology-based soft computing and machine learning model for efficient retrieval</title><title>Knowledge and information systems</title><addtitle>Knowl Inf Syst</addtitle><description>Unstructured and unorganized data always degrade the performance of search techniques and produce irrelevant results in response to the query as well as decrease the speed of retrieval results. Ontology in semantic web (SW) provides an adequate solution to represent the knowledge, because of its backbone knowledge of an application or domain. But, domain ontology has three basic problems while retrieving useful knowledge from a domain ontology: (a) structuring/arrangement, (b) unnecessary knowledge reduction, selection and extraction, and (c) speeding up the retrieval process. To resolve these problems, we proposed multi-level k-mean clustering approach with rough set and Bayesian network model for ontology (MLK-rBO). The proposed model works in four different phases—clustering, knowledge discovery, building a probabilistic network, and model evaluation. The model ensembles three different techniques, namely clustering, rough set (RS), and Bayesian network (BN). Finally, the proposed model is tested with statistical parameters and compared with other models, namely decision tree (DT), random forest (RF), and support vector machine (SVM) to evaluate performance. By analyzing experimental results, we observed that the MLK-rBO gives better accuracy: 98.36% for survey data (fever) and 86% for Wine quality data than available models.</description><subject>Bayesian analysis</subject><subject>Clustering</subject><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Database Management</subject><subject>Decision trees</subject><subject>Information retrieval</subject><subject>Information Storage and Retrieval</subject><subject>Information Systems and Communication Service</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>IT in Business</subject><subject>Knowledge representation</subject><subject>Machine learning</subject><subject>Ontology</subject><subject>Performance degradation</subject><subject>Performance evaluation</subject><subject>Regular Paper</subject><subject>Semantic web</subject><subject>Soft computing</subject><subject>Statistical analysis</subject><subject>Support vector machines</subject><subject>Uncertainty</subject><subject>Unstructured data</subject><issn>0219-1377</issn><issn>0219-3116</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LAzEQhoMoWKt_wFPAc3Rms9lNjlL8gkI96Dmk2aRu2U1qshX6713dgjdPMwzP-w48hFwj3CJAfZcREAWDgjNApYDJEzKDAhXjiNXpcUde1-fkIuctANYV4oy8rsIQu7g5sLXJrqE5-oHa2O_2Qxs21ISG9sZ-tMHRzpkUfo59bFxHfUzUed_a1oWBJjek1n2Z7pKcedNld3Wcc_L--PC2eGbL1dPL4n7JbFHDwKSxlhvhJVhVcQ8Cq9JJJ8zaN2hEZSuQ1pTC1gK8L6W3Hk2pqlJh3XC0fE5upt5dip97lwe9jfsUxpe6UCgkqAL4SBUTZVPMOTmvd6ntTTpoBP1jTk_m9GhO_5rTcgzxKZRHOGxc-qv-J_UNsnZxlg</recordid><startdate>20240201</startdate><enddate>20240201</enddate><creator>Anand, Sanjay Kumar</creator><creator>Kumar, Suresh</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20240201</creationdate><title>Ontology-based soft computing and machine learning model for efficient retrieval</title><author>Anand, Sanjay Kumar ; Kumar, Suresh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-8acc3a5f80c963f05164e8e5abfd1a56c608ca45c750ff48fcf1a4964917d31c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Bayesian analysis</topic><topic>Clustering</topic><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Database Management</topic><topic>Decision trees</topic><topic>Information retrieval</topic><topic>Information Storage and Retrieval</topic><topic>Information Systems and Communication Service</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>IT in Business</topic><topic>Knowledge representation</topic><topic>Machine learning</topic><topic>Ontology</topic><topic>Performance degradation</topic><topic>Performance evaluation</topic><topic>Regular Paper</topic><topic>Semantic web</topic><topic>Soft computing</topic><topic>Statistical analysis</topic><topic>Support vector machines</topic><topic>Uncertainty</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Anand, Sanjay Kumar</creatorcontrib><creatorcontrib>Kumar, Suresh</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Knowledge and information systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Anand, Sanjay Kumar</au><au>Kumar, Suresh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Ontology-based soft computing and machine learning model for efficient retrieval</atitle><jtitle>Knowledge and information systems</jtitle><stitle>Knowl Inf Syst</stitle><date>2024-02-01</date><risdate>2024</risdate><volume>66</volume><issue>2</issue><spage>1371</spage><epage>1402</epage><pages>1371-1402</pages><issn>0219-1377</issn><eissn>0219-3116</eissn><abstract>Unstructured and unorganized data always degrade the performance of search techniques and produce irrelevant results in response to the query as well as decrease the speed of retrieval results. Ontology in semantic web (SW) provides an adequate solution to represent the knowledge, because of its backbone knowledge of an application or domain. But, domain ontology has three basic problems while retrieving useful knowledge from a domain ontology: (a) structuring/arrangement, (b) unnecessary knowledge reduction, selection and extraction, and (c) speeding up the retrieval process. To resolve these problems, we proposed multi-level k-mean clustering approach with rough set and Bayesian network model for ontology (MLK-rBO). The proposed model works in four different phases—clustering, knowledge discovery, building a probabilistic network, and model evaluation. The model ensembles three different techniques, namely clustering, rough set (RS), and Bayesian network (BN). Finally, the proposed model is tested with statistical parameters and compared with other models, namely decision tree (DT), random forest (RF), and support vector machine (SVM) to evaluate performance. By analyzing experimental results, we observed that the MLK-rBO gives better accuracy: 98.36% for survey data (fever) and 86% for Wine quality data than available models.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s10115-023-01990-8</doi><tpages>32</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0219-1377 |
ispartof | Knowledge and information systems, 2024-02, Vol.66 (2), p.1371-1402 |
issn | 0219-1377 0219-3116 |
language | eng |
recordid | cdi_proquest_journals_2915809203 |
source | Springer Nature |
subjects | Bayesian analysis Clustering Computer Science Data Mining and Knowledge Discovery Database Management Decision trees Information retrieval Information Storage and Retrieval Information Systems and Communication Service Information Systems Applications (incl.Internet) IT in Business Knowledge representation Machine learning Ontology Performance degradation Performance evaluation Regular Paper Semantic web Soft computing Statistical analysis Support vector machines Uncertainty Unstructured data |
title | Ontology-based soft computing and machine learning model for efficient retrieval |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T17%3A26%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Ontology-based%20soft%20computing%20and%20machine%20learning%20model%20for%20efficient%20retrieval&rft.jtitle=Knowledge%20and%20information%20systems&rft.au=Anand,%20Sanjay%20Kumar&rft.date=2024-02-01&rft.volume=66&rft.issue=2&rft.spage=1371&rft.epage=1402&rft.pages=1371-1402&rft.issn=0219-1377&rft.eissn=0219-3116&rft_id=info:doi/10.1007/s10115-023-01990-8&rft_dat=%3Cproquest_cross%3E2915809203%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c270t-8acc3a5f80c963f05164e8e5abfd1a56c608ca45c750ff48fcf1a4964917d31c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2915809203&rft_id=info:pmid/&rfr_iscdi=true |