Loading…

Research on data consistency detection method based on interactive matching under sampling background

Multisource data are a common phenomenon in the era of big data. Detecting the consistency of multisource data is a basic problem in decision-making, which is widely contemplated in academic and applied fields. In this paper, the consistency detection between big datasets is mainly conducted as foll...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems 2022-11, Vol.255, p.109695, Article 109695
Main Authors: Li, Fachao, Ma, Shijie, Feng, Yazhou, Jin, Chenxia
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Multisource data are a common phenomenon in the era of big data. Detecting the consistency of multisource data is a basic problem in decision-making, which is widely contemplated in academic and applied fields. In this paper, the consistency detection between big datasets is mainly conducted as follows: (1) With If-then classification rules as the core concern of the dataset, decision trees as the acquisition approach of the rules, and the interactive matching of classification rules as the carrier, the measurement model of interactive matching under the background of random sampling (SB-IMM in short) was established. (2) The performance analysis of the SB-IMM is analyzed by combining the law of large numbers and several common UCI datasets. 3) The application of the SB-IMM in public policy-making is discussed by taking the consistency between the middle and east data of the CHFS dataset as an example. A theoretical analysis and the experimental results show that the SB-IMM has good structural characteristics and interpretability, which can provide theoretical support for the processing of big data and has wide prospects for application.
ISSN:0950-7051
1872-7409
DOI:10.1016/j.knosys.2022.109695