Loading…
Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources
Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datas...
Saved in:
Published in: | IEEE transactions on reliability 2018-06, Vol.67 (2), p.538-556 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583 |
---|---|
cites | cdi_FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583 |
container_end_page | 556 |
container_issue | 2 |
container_start_page | 538 |
container_title | IEEE transactions on reliability |
container_volume | 67 |
creator | Blanco, Raquel Enriquez, Jose G. Dominguez-Mayo, Francisco J. Escalona, M. J. Tuya, Javier |
description | Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datasets that store information about the same topic. Testing the applications that implement the ER problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. This paper presents an approach based on model-driven engineering that allows the creation of test models for the early integration testing of ER applications, contributing in three main aspects: the description of the elements of the proposed framework, the definition of the testing model, and the validation of the proposal through two real-world case studies. This validation verifies that the early integration testing of the ER application is capable of detecting a series of deficiencies, which a priori are not known and that will help to improve the final result that the ER application offers. |
doi_str_mv | 10.1109/TR.2018.2809866 |
format | article |
fullrecord | <record><control><sourceid>crossref_ieee_</sourceid><recordid>TN_cdi_ieee_primary_8320955</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8320955</ieee_id><sourcerecordid>10_1109_TR_2018_2809866</sourcerecordid><originalsourceid>FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583</originalsourceid><addsrcrecordid>eNo9kMFKAzEURYMoWKtrF27yA9O-JJOZZCm12kJBqLMfM5mXGqkTSSLYv3dKi6vL5d3zFoeQewYzxkDPm-2MA1MzrkCrqrogEyalKljN2SWZwHgqtOT6mtyk9DnWstRqQt6XJu4PdD1k3EWTfRhogyn7YUddiHQ5ZJ8PdIs2DNbv_WnhB5o_kC7CSP1mGhxdYcYYdjhg-En0yWRD38JPtJhuyZUz-4R355yS5nnZLFbF5vVlvXjcFFYAz4UwFXQIpnIVd2Ng37tOlrwXWtmu1jUK7dAZIUtjGQeAGqFnujOV0lKJKZmf3toYUoro2u_ov0w8tAzao5-22bZHP-3Zz0g8nAiPiP9rJThoKcUf289i9w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources</title><source>IEEE Xplore (Online service)</source><creator>Blanco, Raquel ; Enriquez, Jose G. ; Dominguez-Mayo, Francisco J. ; Escalona, M. J. ; Tuya, Javier</creator><creatorcontrib>Blanco, Raquel ; Enriquez, Jose G. ; Dominguez-Mayo, Francisco J. ; Escalona, M. J. ; Tuya, Javier</creatorcontrib><description>Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datasets that store information about the same topic. Testing the applications that implement the ER problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. This paper presents an approach based on model-driven engineering that allows the creation of test models for the early integration testing of ER applications, contributing in three main aspects: the description of the elements of the proposed framework, the definition of the testing model, and the validation of the proposal through two real-world case studies. This validation verifies that the early integration testing of the ER application is capable of detecting a series of deficiencies, which a priori are not known and that will help to improve the final result that the ER application offers.</description><identifier>ISSN: 0018-9529</identifier><identifier>EISSN: 1558-1721</identifier><identifier>DOI: 10.1109/TR.2018.2809866</identifier><identifier>CODEN: IERQAD</identifier><language>eng</language><publisher>IEEE</publisher><subject>Big Data ; Business ; Data models ; Early testing ; entity reconciliation ; Erbium ; heterogeneous data sources ; Information and communication technology ; model-driven engineering ; Software ; software testing ; specification-based testing ; Testing</subject><ispartof>IEEE transactions on reliability, 2018-06, Vol.67 (2), p.538-556</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583</citedby><cites>FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583</cites><orcidid>0000-0003-2855-0760 ; 0000-0002-1091-934X ; 0000-0002-2631-5890</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8320955$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Blanco, Raquel</creatorcontrib><creatorcontrib>Enriquez, Jose G.</creatorcontrib><creatorcontrib>Dominguez-Mayo, Francisco J.</creatorcontrib><creatorcontrib>Escalona, M. J.</creatorcontrib><creatorcontrib>Tuya, Javier</creatorcontrib><title>Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources</title><title>IEEE transactions on reliability</title><addtitle>TR</addtitle><description>Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datasets that store information about the same topic. Testing the applications that implement the ER problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. This paper presents an approach based on model-driven engineering that allows the creation of test models for the early integration testing of ER applications, contributing in three main aspects: the description of the elements of the proposed framework, the definition of the testing model, and the validation of the proposal through two real-world case studies. This validation verifies that the early integration testing of the ER application is capable of detecting a series of deficiencies, which a priori are not known and that will help to improve the final result that the ER application offers.</description><subject>Big Data</subject><subject>Business</subject><subject>Data models</subject><subject>Early testing</subject><subject>entity reconciliation</subject><subject>Erbium</subject><subject>heterogeneous data sources</subject><subject>Information and communication technology</subject><subject>model-driven engineering</subject><subject>Software</subject><subject>software testing</subject><subject>specification-based testing</subject><subject>Testing</subject><issn>0018-9529</issn><issn>1558-1721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><recordid>eNo9kMFKAzEURYMoWKtrF27yA9O-JJOZZCm12kJBqLMfM5mXGqkTSSLYv3dKi6vL5d3zFoeQewYzxkDPm-2MA1MzrkCrqrogEyalKljN2SWZwHgqtOT6mtyk9DnWstRqQt6XJu4PdD1k3EWTfRhogyn7YUddiHQ5ZJ8PdIs2DNbv_WnhB5o_kC7CSP1mGhxdYcYYdjhg-En0yWRD38JPtJhuyZUz-4R355yS5nnZLFbF5vVlvXjcFFYAz4UwFXQIpnIVd2Ng37tOlrwXWtmu1jUK7dAZIUtjGQeAGqFnujOV0lKJKZmf3toYUoro2u_ov0w8tAzao5-22bZHP-3Zz0g8nAiPiP9rJThoKcUf289i9w</recordid><startdate>201806</startdate><enddate>201806</enddate><creator>Blanco, Raquel</creator><creator>Enriquez, Jose G.</creator><creator>Dominguez-Mayo, Francisco J.</creator><creator>Escalona, M. J.</creator><creator>Tuya, Javier</creator><general>IEEE</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-2855-0760</orcidid><orcidid>https://orcid.org/0000-0002-1091-934X</orcidid><orcidid>https://orcid.org/0000-0002-2631-5890</orcidid></search><sort><creationdate>201806</creationdate><title>Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources</title><author>Blanco, Raquel ; Enriquez, Jose G. ; Dominguez-Mayo, Francisco J. ; Escalona, M. J. ; Tuya, Javier</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Big Data</topic><topic>Business</topic><topic>Data models</topic><topic>Early testing</topic><topic>entity reconciliation</topic><topic>Erbium</topic><topic>heterogeneous data sources</topic><topic>Information and communication technology</topic><topic>model-driven engineering</topic><topic>Software</topic><topic>software testing</topic><topic>specification-based testing</topic><topic>Testing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Blanco, Raquel</creatorcontrib><creatorcontrib>Enriquez, Jose G.</creatorcontrib><creatorcontrib>Dominguez-Mayo, Francisco J.</creatorcontrib><creatorcontrib>Escalona, M. J.</creatorcontrib><creatorcontrib>Tuya, Javier</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE/IET Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE transactions on reliability</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Blanco, Raquel</au><au>Enriquez, Jose G.</au><au>Dominguez-Mayo, Francisco J.</au><au>Escalona, M. J.</au><au>Tuya, Javier</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources</atitle><jtitle>IEEE transactions on reliability</jtitle><stitle>TR</stitle><date>2018-06</date><risdate>2018</risdate><volume>67</volume><issue>2</issue><spage>538</spage><epage>556</epage><pages>538-556</pages><issn>0018-9529</issn><eissn>1558-1721</eissn><coden>IERQAD</coden><abstract>Entity reconciliation (ER) aims to combine data from different sources for a unified vision. The management of large volumes of data has given rise to significant challenges to the ER problem due to facts such as data becoming more unstructured, unclean, and incomplete or the existence of many datasets that store information about the same topic. Testing the applications that implement the ER problem is crucial to ensure both the correctness of the reconciliation process and the quality of the reconciled data. This paper presents an approach based on model-driven engineering that allows the creation of test models for the early integration testing of ER applications, contributing in three main aspects: the description of the elements of the proposed framework, the definition of the testing model, and the validation of the proposal through two real-world case studies. This validation verifies that the early integration testing of the ER application is capable of detecting a series of deficiencies, which a priori are not known and that will help to improve the final result that the ER application offers.</abstract><pub>IEEE</pub><doi>10.1109/TR.2018.2809866</doi><tpages>19</tpages><orcidid>https://orcid.org/0000-0003-2855-0760</orcidid><orcidid>https://orcid.org/0000-0002-1091-934X</orcidid><orcidid>https://orcid.org/0000-0002-2631-5890</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0018-9529 |
ispartof | IEEE transactions on reliability, 2018-06, Vol.67 (2), p.538-556 |
issn | 0018-9529 1558-1721 |
language | eng |
recordid | cdi_ieee_primary_8320955 |
source | IEEE Xplore (Online service) |
subjects | Big Data Business Data models Early testing entity reconciliation Erbium heterogeneous data sources Information and communication technology model-driven engineering Software software testing specification-based testing Testing |
title | Early Integration Testing for Entity Reconciliation in the Context of Heterogeneous Data Sources |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T22%3A02%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Early%20Integration%20Testing%20for%20Entity%20Reconciliation%20in%20the%20Context%20of%20Heterogeneous%20Data%20Sources&rft.jtitle=IEEE%20transactions%20on%20reliability&rft.au=Blanco,%20Raquel&rft.date=2018-06&rft.volume=67&rft.issue=2&rft.spage=538&rft.epage=556&rft.pages=538-556&rft.issn=0018-9529&rft.eissn=1558-1721&rft.coden=IERQAD&rft_id=info:doi/10.1109/TR.2018.2809866&rft_dat=%3Ccrossref_ieee_%3E10_1109_TR_2018_2809866%3C/crossref_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c302t-3a60be0a6f62f0a6eddfb542d398cb797e39fefa354ac120007e0d19ba689583%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=8320955&rfr_iscdi=true |