Loading…
A hybrid strategy to extract metadata from scholarly articles by utilizing support vector machine and heuristics
The immense growth in online research publications has attracted the research community to extract valuable information from scientific resources by exploring online digital libraries and publishers’ websites. The metadata stored in a machine comprehendible form can facilitate a precise search to en...
Saved in:
Published in: | Scientometrics 2023-08, Vol.128 (8), p.4349-4382 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The immense growth in online research publications has attracted the research community to extract valuable information from scientific resources by exploring online digital libraries and publishers’ websites. The metadata stored in a machine comprehendible form can facilitate a precise search to enlist most related articles by applying semantic queries to the document’s metadata and the structural elements. The online search engines and digital libraries offer only keyword-based search on full-body text, which creates excessive results. The research community in recent years has adopted different approaches to extract structural information from research documents. We have distributed the content of an article into two logical layouts and metadata levels. This strategy has given our technique an advantage over the state-of-the-art (SOTA) extracting metadata with diversified publication styles. The experimental results have revealed that the proposed approach has shown a significant gain in performance of 20.26% to 27.14%. |
---|---|
ISSN: | 0138-9130 1588-2861 |
DOI: | 10.1007/s11192-023-04774-7 |