Loading…

Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources

Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datas...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on reliability 2018-06, Vol.67 (2), p.538-556
Main Authors: Blanco, Raquel, Enriquez, Jose G., Dominguez-Mayo, Francisco J., Escalona, M. J., Tuya, Javier
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583
cites cdi_FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583
container_end_page 556
container_issue 2
container_start_page 538
container_title IEEE transactions on reliability
container_volume 67
creator Blanco, Raquel
Enriquez, Jose G.
Dominguez-Mayo, Francisco J.
Escalona, M. J.
Tuya, Javier
description Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datasets that store information about the same topic. Testing the applications that implement the ER problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. This paper presents an approach based on model-driven engineering that allows the creation of test models for the early integration testing of ER applications, contributing in three main aspects: the description of the elements of the proposed framework, the definition of the testing model, and the validation of the proposal through two real-world case studies. This validation verifies that the early integration testing of the ER application is capable of detecting a series of deficiencies, which a priori are not known and that will help to improve the final result that the ER application offers.
doi_str_mv 10.1109/TR.2018.2809866
format article
fullrecord <record><control><sourceid>crossref_ieee_</sourceid><recordid>TN_cdi_ieee_primary_8320955</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8320955</ieee_id><sourcerecordid>10_1109_TR_2018_2809866</sourcerecordid><originalsourceid>FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583</originalsourceid><addsrcrecordid>eNo9kMFKAzEURYMoWKtrF27yA9O-JJOZZCm12kJBqLMfM5mXGqkTSSLYv3dKi6vL5d3zFoeQewYzxkDPm-2MA1MzrkCrqrogEyalKljN2SWZwHgqtOT6mtyk9DnWstRqQt6XJu4PdD1k3EWTfRhogyn7YUddiHQ5ZJ8PdIs2DNbv_WnhB5o_kC7CSP1mGhxdYcYYdjhg-En0yWRD38JPtJhuyZUz-4R355yS5nnZLFbF5vVlvXjcFFYAz4UwFXQIpnIVd2Ng37tOlrwXWtmu1jUK7dAZIUtjGQeAGqFnujOV0lKJKZmf3toYUoro2u_ov0w8tAzao5-22bZHP-3Zz0g8nAiPiP9rJThoKcUf289i9w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources</title><source>IEEE Xplore (Online service)</source><creator>Blanco, Raquel ; Enriquez, Jose G. ; Dominguez-Mayo, Francisco J. ; Escalona, M. J. ; Tuya, Javier</creator><creatorcontrib>Blanco, Raquel ; Enriquez, Jose G. ; Dominguez-Mayo, Francisco J. ; Escalona, M. J. ; Tuya, Javier</creatorcontrib><description>Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datasets that store information about the same topic. Testing the applications that implement the ER problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. This paper presents an approach based on model-driven engineering that allows the creation of test models for the early integration testing of ER applications, contributing in three main aspects: the description of the elements of the proposed framework, the definition of the testing model, and the validation of the proposal through two real-world case studies. This validation verifies that the early integration testing of the ER application is capable of detecting a series of deficiencies, which a priori are not known and that will help to improve the final result that the ER application offers.</description><identifier>ISSN: 0018-9529</identifier><identifier>EISSN: 1558-1721</identifier><identifier>DOI: 10.1109/TR.2018.2809866</identifier><identifier>CODEN: IERQAD</identifier><language>eng</language><publisher>IEEE</publisher><subject>Big Data ; Business ; Data models ; Early testing ; entity reconciliation ; Erbium ; heterogeneous data sources ; Information and communication technology ; model-driven engineering ; Software ; software testing ; specification-based testing ; Testing</subject><ispartof>IEEE transactions on reliability, 2018-06, Vol.67 (2), p.538-556</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583</citedby><cites>FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583</cites><orcidid>0000-0003-2855-0760 ; 0000-0002-1091-934X ; 0000-0002-2631-5890</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8320955$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Blanco, Raquel</creatorcontrib><creatorcontrib>Enriquez, Jose G.</creatorcontrib><creatorcontrib>Dominguez-Mayo, Francisco J.</creatorcontrib><creatorcontrib>Escalona, M. J.</creatorcontrib><creatorcontrib>Tuya, Javier</creatorcontrib><title>Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources</title><title>IEEE transactions on reliability</title><addtitle>TR</addtitle><description>Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datasets that store information about the same topic. Testing the applications that implement the ER problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. This paper presents an approach based on model-driven engineering that allows the creation of test models for the early integration testing of ER applications, contributing in three main aspects: the description of the elements of the proposed framework, the definition of the testing model, and the validation of the proposal through two real-world case studies. This validation verifies that the early integration testing of the ER application is capable of detecting a series of deficiencies, which a priori are not known and that will help to improve the final result that the ER application offers.</description><subject>Big Data</subject><subject>Business</subject><subject>Data models</subject><subject>Early testing</subject><subject>entity reconciliation</subject><subject>Erbium</subject><subject>heterogeneous data sources</subject><subject>Information and communication technology</subject><subject>model-driven engineering</subject><subject>Software</subject><subject>software testing</subject><subject>specification-based testing</subject><subject>Testing</subject><issn>0018-9529</issn><issn>1558-1721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><recordid>eNo9kMFKAzEURYMoWKtrF27yA9O-JJOZZCm12kJBqLMfM5mXGqkTSSLYv3dKi6vL5d3zFoeQewYzxkDPm-2MA1MzrkCrqrogEyalKljN2SWZwHgqtOT6mtyk9DnWstRqQt6XJu4PdD1k3EWTfRhogyn7YUddiHQ5ZJ8PdIs2DNbv_WnhB5o_kC7CSP1mGhxdYcYYdjhg-En0yWRD38JPtJhuyZUz-4R355yS5nnZLFbF5vVlvXjcFFYAz4UwFXQIpnIVd2Ng37tOlrwXWtmu1jUK7dAZIUtjGQeAGqFnujOV0lKJKZmf3toYUoro2u_ov0w8tAzao5-22bZHP-3Zz0g8nAiPiP9rJThoKcUf289i9w</recordid><startdate>201806</startdate><enddate>201806</enddate><creator>Blanco, Raquel</creator><creator>Enriquez, Jose G.</creator><creator>Dominguez-Mayo, Francisco J.</creator><creator>Escalona, M. J.</creator><creator>Tuya, Javier</creator><general>IEEE</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-2855-0760</orcidid><orcidid>https://orcid.org/0000-0002-1091-934X</orcidid><orcidid>https://orcid.org/0000-0002-2631-5890</orcidid></search><sort><creationdate>201806</creationdate><title>Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources</title><author>Blanco, Raquel ; Enriquez, Jose G. ; Dominguez-Mayo, Francisco J. ; Escalona, M. J. ; Tuya, Javier</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Big Data</topic><topic>Business</topic><topic>Data models</topic><topic>Early testing</topic><topic>entity reconciliation</topic><topic>Erbium</topic><topic>heterogeneous data sources</topic><topic>Information and communication technology</topic><topic>model-driven engineering</topic><topic>Software</topic><topic>software testing</topic><topic>specification-based testing</topic><topic>Testing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Blanco, Raquel</creatorcontrib><creatorcontrib>Enriquez, Jose G.</creatorcontrib><creatorcontrib>Dominguez-Mayo, Francisco J.</creatorcontrib><creatorcontrib>Escalona, M. J.</creatorcontrib><creatorcontrib>Tuya, Javier</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE/IET Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE transactions on reliability</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Blanco, Raquel</au><au>Enriquez, Jose G.</au><au>Dominguez-Mayo, Francisco J.</au><au>Escalona, M. J.</au><au>Tuya, Javier</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources</atitle><jtitle>IEEE transactions on reliability</jtitle><stitle>TR</stitle><date>2018-06</date><risdate>2018</risdate><volume>67</volume><issue>2</issue><spage>538</spage><epage>556</epage><pages>538-556</pages><issn>0018-9529</issn><eissn>1558-1721</eissn><coden>IERQAD</coden><abstract>Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datasets that store information about the same topic. Testing the applications that implement the ER problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. This paper presents an approach based on model-driven engineering that allows the creation of test models for the early integration testing of ER applications, contributing in three main aspects: the description of the elements of the proposed framework, the definition of the testing model, and the validation of the proposal through two real-world case studies. This validation verifies that the early integration testing of the ER application is capable of detecting a series of deficiencies, which a priori are not known and that will help to improve the final result that the ER application offers.</abstract><pub>IEEE</pub><doi>10.1109/TR.2018.2809866</doi><tpages>19</tpages><orcidid>https://orcid.org/0000-0003-2855-0760</orcidid><orcidid>https://orcid.org/0000-0002-1091-934X</orcidid><orcidid>https://orcid.org/0000-0002-2631-5890</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0018-9529
ispartof IEEE transactions on reliability, 2018-06, Vol.67 (2), p.538-556
issn 0018-9529
1558-1721
language eng
recordid cdi_ieee_primary_8320955
source IEEE Xplore (Online service)
subjects Big Data
Business
Data models
Early testing
entity reconciliation
Erbium
heterogeneous data sources
Information and communication technology
model-driven engineering
Software
software testing
specification-based testing
Testing
title Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T22%3A02%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Early%20Integration%20Testing%20for%20Entity%20Reconciliation%20in%20the%20Context%20of%20Heterogeneous%20Data%20Sources&rft.jtitle=IEEE%20transactions%20on%20reliability&rft.au=Blanco,%20Raquel&rft.date=2018-06&rft.volume=67&rft.issue=2&rft.spage=538&rft.epage=556&rft.pages=538-556&rft.issn=0018-9529&rft.eissn=1558-1721&rft.coden=IERQAD&rft_id=info:doi/10.1109/TR.2018.2809866&rft_dat=%3Ccrossref_ieee_%3E10_1109_TR_2018_2809866%3C/crossref_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=8320955&rfr_iscdi=true