Loading…

From Linguistic to Conceptual: A Framework Based on a Pipeline for Building Ontologies from Texts

This paper presents a novel approach to extract information for building ontologies for an extensive range of applications from corpora. Our goal is to propose a method that is independent of domains and based on a distributional analysis of semantic units to bring out all the candidate’s informativ...

Full description

Saved in:
Bibliographic Details
Published in:Journal of advanced computational intelligence and intelligent informatics 2016-11, Vol.20 (6), p.941-960
Main Authors: Benafia, Ali, Mazouzi, Smaine, Maamri, Ramdane, Sahnoun, Zaidi, Benafia, Sara
Format: Article
Language:English
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presents a novel approach to extract information for building ontologies for an extensive range of applications from corpora. Our goal is to propose a method that is independent of domains and based on a distributional analysis of semantic units to bring out all the candidate’s informative elements (concepts, entities, semantic relations, named entities etc.). This method is based on a pipeline of four main stages allows for the extraction of information from unstructured text in the form of a suite of decomposable representations (sentences in triplets, ‘argumental structure’ etc.) until a consistent final ontology is obtained. We applied the defined pipeline a repeated sampling of 100 articles randomly drawn from a text corpus (‘Le Monde’ of annual version ‘2013’). The evaluation results of the trial implementation of our system level of accuracy to be up to 74%. The results obtained indicate that the proposed methodology is quite generic and can be easily adapted to any new domain.
ISSN:1343-0130
1883-8014
DOI:10.20965/jaciii.2016.p0941