Loading…
Creating a large database test bed with typographical errors for record linkage evaluation
Evaluation of record linkage algorithms requires a large database test bed that is representative of the real-world data. We created such a large database that reflects the demographic distribution of a typical population and contains typographical errors commonly made during data entry. This databa...
Saved in:
Published in: | AMIA ... Annual Symposium proceedings 2008-11, p.1153-1153 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 1153 |
container_issue | |
container_start_page | 1153 |
container_title | AMIA ... Annual Symposium proceedings |
container_volume | |
creator | Theera-Ampornpunt, Nawanan Kijsanayotin, Boonchai Speedie, Stuart M |
description | Evaluation of record linkage algorithms requires a large database test bed that is representative of the real-world data. We created such a large database that reflects the demographic distribution of a typical population and contains typographical errors commonly made during data entry. This database can be used with high confidence as a test bed to evaluate various record linkage algorithms. |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_733872816</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>733872816</sourcerecordid><originalsourceid>FETCH-LOGICAL-p125t-3e8f088bd7702260c6d1195e728c8fe074001a9ee3de91d43beaaba1a1e122fb3</originalsourceid><addsrcrecordid>eNo1kM1KxDAYRYMgzjj6CpKdq0J-2iZdSvEPBtzoxk350nztVNOmJqkyb2_BcXU35x4u94xseVFUWc5UuSGXMX4wlqtClxdkw3VVaa3ZlrzXASENU0-BOgg9UgsJDESkCWOiBi39GdKBpuPs-wDzYWjBUQzBh0g7H2jA1gdL3TB9wlrHb3DLavTTFTnvwEW8PuWOvD3cv9ZP2f7l8bm-22czF0XKJOqOaW2sUkyIkrWl5bwqUAnd6g6ZyhnjUCFKixW3uTQI60AOHLkQnZE7cvvnnYP_WtbRzTjEFp2DCf0SGyWlXmW8XMmbE7mYEW0zh2GEcGz-75C_lVZcKw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>733872816</pqid></control><display><type>article</type><title>Creating a large database test bed with typographical errors for record linkage evaluation</title><source>PubMed Central</source><creator>Theera-Ampornpunt, Nawanan ; Kijsanayotin, Boonchai ; Speedie, Stuart M</creator><creatorcontrib>Theera-Ampornpunt, Nawanan ; Kijsanayotin, Boonchai ; Speedie, Stuart M</creatorcontrib><description>Evaluation of record linkage algorithms requires a large database test bed that is representative of the real-world data. We created such a large database that reflects the demographic distribution of a typical population and contains typographical errors commonly made during data entry. This database can be used with high confidence as a test bed to evaluate various record linkage algorithms.</description><identifier>EISSN: 1559-4076</identifier><identifier>PMID: 18998880</identifier><language>eng</language><publisher>United States</publisher><subject>Databases, Factual ; Forms and Records Control ; Information Storage and Retrieval - methods ; Medical History Taking - methods ; Medical Record Linkage ; Medical Records Systems, Computerized ; Minnesota ; Terminology as Topic ; Word Processing</subject><ispartof>AMIA ... Annual Symposium proceedings, 2008-11, p.1153-1153</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/18998880$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Theera-Ampornpunt, Nawanan</creatorcontrib><creatorcontrib>Kijsanayotin, Boonchai</creatorcontrib><creatorcontrib>Speedie, Stuart M</creatorcontrib><title>Creating a large database test bed with typographical errors for record linkage evaluation</title><title>AMIA ... Annual Symposium proceedings</title><addtitle>AMIA Annu Symp Proc</addtitle><description>Evaluation of record linkage algorithms requires a large database test bed that is representative of the real-world data. We created such a large database that reflects the demographic distribution of a typical population and contains typographical errors commonly made during data entry. This database can be used with high confidence as a test bed to evaluate various record linkage algorithms.</description><subject>Databases, Factual</subject><subject>Forms and Records Control</subject><subject>Information Storage and Retrieval - methods</subject><subject>Medical History Taking - methods</subject><subject>Medical Record Linkage</subject><subject>Medical Records Systems, Computerized</subject><subject>Minnesota</subject><subject>Terminology as Topic</subject><subject>Word Processing</subject><issn>1559-4076</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2008</creationdate><recordtype>article</recordtype><recordid>eNo1kM1KxDAYRYMgzjj6CpKdq0J-2iZdSvEPBtzoxk350nztVNOmJqkyb2_BcXU35x4u94xseVFUWc5UuSGXMX4wlqtClxdkw3VVaa3ZlrzXASENU0-BOgg9UgsJDESkCWOiBi39GdKBpuPs-wDzYWjBUQzBh0g7H2jA1gdL3TB9wlrHb3DLavTTFTnvwEW8PuWOvD3cv9ZP2f7l8bm-22czF0XKJOqOaW2sUkyIkrWl5bwqUAnd6g6ZyhnjUCFKixW3uTQI60AOHLkQnZE7cvvnnYP_WtbRzTjEFp2DCf0SGyWlXmW8XMmbE7mYEW0zh2GEcGz-75C_lVZcKw</recordid><startdate>20081106</startdate><enddate>20081106</enddate><creator>Theera-Ampornpunt, Nawanan</creator><creator>Kijsanayotin, Boonchai</creator><creator>Speedie, Stuart M</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>7X8</scope></search><sort><creationdate>20081106</creationdate><title>Creating a large database test bed with typographical errors for record linkage evaluation</title><author>Theera-Ampornpunt, Nawanan ; Kijsanayotin, Boonchai ; Speedie, Stuart M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p125t-3e8f088bd7702260c6d1195e728c8fe074001a9ee3de91d43beaaba1a1e122fb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Databases, Factual</topic><topic>Forms and Records Control</topic><topic>Information Storage and Retrieval - methods</topic><topic>Medical History Taking - methods</topic><topic>Medical Record Linkage</topic><topic>Medical Records Systems, Computerized</topic><topic>Minnesota</topic><topic>Terminology as Topic</topic><topic>Word Processing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Theera-Ampornpunt, Nawanan</creatorcontrib><creatorcontrib>Kijsanayotin, Boonchai</creatorcontrib><creatorcontrib>Speedie, Stuart M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>MEDLINE - Academic</collection><jtitle>AMIA ... Annual Symposium proceedings</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Theera-Ampornpunt, Nawanan</au><au>Kijsanayotin, Boonchai</au><au>Speedie, Stuart M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Creating a large database test bed with typographical errors for record linkage evaluation</atitle><jtitle>AMIA ... Annual Symposium proceedings</jtitle><addtitle>AMIA Annu Symp Proc</addtitle><date>2008-11-06</date><risdate>2008</risdate><spage>1153</spage><epage>1153</epage><pages>1153-1153</pages><eissn>1559-4076</eissn><abstract>Evaluation of record linkage algorithms requires a large database test bed that is representative of the real-world data. We created such a large database that reflects the demographic distribution of a typical population and contains typographical errors commonly made during data entry. This database can be used with high confidence as a test bed to evaluate various record linkage algorithms.</abstract><cop>United States</cop><pmid>18998880</pmid><tpages>1</tpages></addata></record> |
fulltext | fulltext |
identifier | EISSN: 1559-4076 |
ispartof | AMIA ... Annual Symposium proceedings, 2008-11, p.1153-1153 |
issn | 1559-4076 |
language | eng |
recordid | cdi_proquest_miscellaneous_733872816 |
source | PubMed Central |
subjects | Databases, Factual Forms and Records Control Information Storage and Retrieval - methods Medical History Taking - methods Medical Record Linkage Medical Records Systems, Computerized Minnesota Terminology as Topic Word Processing |
title | Creating a large database test bed with typographical errors for record linkage evaluation |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T13%3A40%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Creating%20a%20large%20database%20test%20bed%20with%20typographical%20errors%20for%20record%20linkage%20evaluation&rft.jtitle=AMIA%20...%20Annual%20Symposium%20proceedings&rft.au=Theera-Ampornpunt,%20Nawanan&rft.date=2008-11-06&rft.spage=1153&rft.epage=1153&rft.pages=1153-1153&rft.eissn=1559-4076&rft_id=info:doi/&rft_dat=%3Cproquest_pubme%3E733872816%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p125t-3e8f088bd7702260c6d1195e728c8fe074001a9ee3de91d43beaaba1a1e122fb3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=733872816&rft_id=info:pmid/18998880&rfr_iscdi=true |