Loading…

Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort

Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-inc...

Full description

Saved in:
Bibliographic Details
Published in:BMC medical informatics and decision making 2020-07, Vol.20 (1), p.173-173, Article 173
Main Authors: Almeida, Daniela, Gorender, David, Ichihara, Maria Yury, Sena, Samila, Menezes, Luan, Barbosa, George C G, Fiaccone, Rosimeire L, Paixão, Enny S, Pita, Robespierre, Barreto, Mauricio L
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73
cites cdi_FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73
container_end_page 173
container_issue 1
container_start_page 173
container_title BMC medical informatics and decision making
container_volume 20
creator Almeida, Daniela
Gorender, David
Ichihara, Maria Yury
Sena, Samila
Menezes, Luan
Barbosa, George C G
Fiaccone, Rosimeire L
Paixão, Enny S
Pita, Robespierre
Barreto, Mauricio L
description Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals. We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias. A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets. We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset.
doi_str_mv 10.1186/s12911-020-01192-0
format article
fullrecord <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_59f6898563b64d20b86f548e69f56288</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A631903088</galeid><doaj_id>oai_doaj_org_article_59f6898563b64d20b86f548e69f56288</doaj_id><sourcerecordid>A631903088</sourcerecordid><originalsourceid>FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73</originalsourceid><addsrcrecordid>eNptkk1v1DAQhiMEoqXwBzggS1y4pPgjdpwLUqkKVKrEBc7WxBnvesnaWzsplBv_HO9uKV2EfLA1884zM9ZbVS8ZPWVMq7eZ8Y6xmnJaU8Y6XtNH1TFrWl6rrmkfP3gfVc9yXlHKWi3k0-pI8JYxKfhx9eviB6x98GFBpiWS6xlGP92S6EhCG9NARh--wQLJJkWLOZM5b7UBJh_Ddz8geZ_gpx89BALDlpSnVJI3SAaYoIeMmUyR9LMfBwJkhFRgvU_Tkti4jGl6Xj1xMGZ8cXefVF8_XHw5_1Rfff54eX52VVupxFS3yBGlbS20KEQvtEVkONjBIjgpeedUI8t6XAnXaweDpQ3rgDWN6LTqW3FSXe65Q4SV2SS_hnRrInizC8S0MJAmb0c0ssB0p0vfXjUDp71WTjYaVeek4loX1rs9azP36zIEhrL0eAA9zAS_NIt4Y1qhuVZNAby5A6R4PWOezNpni-MIAeOcDW94K7nUDS3S1_9IV3FOoXxVUQnJBG0V_6taQFnABxdLX7uFmjMlWEcF3c19-h9VOQOuvY0BnS_xgwK-L7Ap5pzQ3e_IqNma0OxNaIoJzc6EZjvxq4e_c1_yx3XiN1iO2IQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2435130762</pqid></control><display><type>article</type><title>Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort</title><source>Publicly Available Content Database</source><source>PubMed Central (PMC)</source><creator>Almeida, Daniela ; Gorender, David ; Ichihara, Maria Yury ; Sena, Samila ; Menezes, Luan ; Barbosa, George C G ; Fiaccone, Rosimeire L ; Paixão, Enny S ; Pita, Robespierre ; Barreto, Mauricio L</creator><creatorcontrib>Almeida, Daniela ; Gorender, David ; Ichihara, Maria Yury ; Sena, Samila ; Menezes, Luan ; Barbosa, George C G ; Fiaccone, Rosimeire L ; Paixão, Enny S ; Pita, Robespierre ; Barreto, Mauricio L</creatorcontrib><description>Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals. We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias. A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets. We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset.</description><identifier>ISSN: 1472-6947</identifier><identifier>EISSN: 1472-6947</identifier><identifier>DOI: 10.1186/s12911-020-01192-0</identifier><identifier>PMID: 32711532</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Accuracy ; Analysis ; Bias ; Births ; Brazil ; Cohort Studies ; Databases, Factual ; Datasets ; Error analysis ; Families &amp; family life ; Female ; Health informatics ; Humans ; Identification ; Income ; Information sources ; Information systems ; Linked Data ; Living conditions ; Male ; Medical Record Linkage ; Missing data ; Names ; Parturition ; Pregnancy ; Public assistance ; Urban areas ; Variables</subject><ispartof>BMC medical informatics and decision making, 2020-07, Vol.20 (1), p.173-173, Article 173</ispartof><rights>COPYRIGHT 2020 BioMed Central Ltd.</rights><rights>2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>The Author(s) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73</citedby><cites>FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73</cites><orcidid>0000-0002-4797-908X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7382864/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2435130762?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32711532$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Almeida, Daniela</creatorcontrib><creatorcontrib>Gorender, David</creatorcontrib><creatorcontrib>Ichihara, Maria Yury</creatorcontrib><creatorcontrib>Sena, Samila</creatorcontrib><creatorcontrib>Menezes, Luan</creatorcontrib><creatorcontrib>Barbosa, George C G</creatorcontrib><creatorcontrib>Fiaccone, Rosimeire L</creatorcontrib><creatorcontrib>Paixão, Enny S</creatorcontrib><creatorcontrib>Pita, Robespierre</creatorcontrib><creatorcontrib>Barreto, Mauricio L</creatorcontrib><title>Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort</title><title>BMC medical informatics and decision making</title><addtitle>BMC Med Inform Decis Mak</addtitle><description>Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals. We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias. A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets. We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset.</description><subject>Accuracy</subject><subject>Analysis</subject><subject>Bias</subject><subject>Births</subject><subject>Brazil</subject><subject>Cohort Studies</subject><subject>Databases, Factual</subject><subject>Datasets</subject><subject>Error analysis</subject><subject>Families &amp; family life</subject><subject>Female</subject><subject>Health informatics</subject><subject>Humans</subject><subject>Identification</subject><subject>Income</subject><subject>Information sources</subject><subject>Information systems</subject><subject>Linked Data</subject><subject>Living conditions</subject><subject>Male</subject><subject>Medical Record Linkage</subject><subject>Missing data</subject><subject>Names</subject><subject>Parturition</subject><subject>Pregnancy</subject><subject>Public assistance</subject><subject>Urban areas</subject><subject>Variables</subject><issn>1472-6947</issn><issn>1472-6947</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNptkk1v1DAQhiMEoqXwBzggS1y4pPgjdpwLUqkKVKrEBc7WxBnvesnaWzsplBv_HO9uKV2EfLA1884zM9ZbVS8ZPWVMq7eZ8Y6xmnJaU8Y6XtNH1TFrWl6rrmkfP3gfVc9yXlHKWi3k0-pI8JYxKfhx9eviB6x98GFBpiWS6xlGP92S6EhCG9NARh--wQLJJkWLOZM5b7UBJh_Ddz8geZ_gpx89BALDlpSnVJI3SAaYoIeMmUyR9LMfBwJkhFRgvU_Tkti4jGl6Xj1xMGZ8cXefVF8_XHw5_1Rfff54eX52VVupxFS3yBGlbS20KEQvtEVkONjBIjgpeedUI8t6XAnXaweDpQ3rgDWN6LTqW3FSXe65Q4SV2SS_hnRrInizC8S0MJAmb0c0ssB0p0vfXjUDp71WTjYaVeek4loX1rs9azP36zIEhrL0eAA9zAS_NIt4Y1qhuVZNAby5A6R4PWOezNpni-MIAeOcDW94K7nUDS3S1_9IV3FOoXxVUQnJBG0V_6taQFnABxdLX7uFmjMlWEcF3c19-h9VOQOuvY0BnS_xgwK-L7Ap5pzQ3e_IqNma0OxNaIoJzc6EZjvxq4e_c1_yx3XiN1iO2IQ</recordid><startdate>20200725</startdate><enddate>20200725</enddate><creator>Almeida, Daniela</creator><creator>Gorender, David</creator><creator>Ichihara, Maria Yury</creator><creator>Sena, Samila</creator><creator>Menezes, Luan</creator><creator>Barbosa, George C G</creator><creator>Fiaccone, Rosimeire L</creator><creator>Paixão, Enny S</creator><creator>Pita, Robespierre</creator><creator>Barreto, Mauricio L</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><general>BMC</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QO</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88C</scope><scope>88E</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>L7M</scope><scope>LK8</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M0T</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-4797-908X</orcidid></search><sort><creationdate>20200725</creationdate><title>Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort</title><author>Almeida, Daniela ; Gorender, David ; Ichihara, Maria Yury ; Sena, Samila ; Menezes, Luan ; Barbosa, George C G ; Fiaccone, Rosimeire L ; Paixão, Enny S ; Pita, Robespierre ; Barreto, Mauricio L</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accuracy</topic><topic>Analysis</topic><topic>Bias</topic><topic>Births</topic><topic>Brazil</topic><topic>Cohort Studies</topic><topic>Databases, Factual</topic><topic>Datasets</topic><topic>Error analysis</topic><topic>Families &amp; family life</topic><topic>Female</topic><topic>Health informatics</topic><topic>Humans</topic><topic>Identification</topic><topic>Income</topic><topic>Information sources</topic><topic>Information systems</topic><topic>Linked Data</topic><topic>Living conditions</topic><topic>Male</topic><topic>Medical Record Linkage</topic><topic>Missing data</topic><topic>Names</topic><topic>Parturition</topic><topic>Pregnancy</topic><topic>Public assistance</topic><topic>Urban areas</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Almeida, Daniela</creatorcontrib><creatorcontrib>Gorender, David</creatorcontrib><creatorcontrib>Ichihara, Maria Yury</creatorcontrib><creatorcontrib>Sena, Samila</creatorcontrib><creatorcontrib>Menezes, Luan</creatorcontrib><creatorcontrib>Barbosa, George C G</creatorcontrib><creatorcontrib>Fiaccone, Rosimeire L</creatorcontrib><creatorcontrib>Paixão, Enny S</creatorcontrib><creatorcontrib>Pita, Robespierre</creatorcontrib><creatorcontrib>Barreto, Mauricio L</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Healthcare Administration Database (Alumni)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest Biological Science Collection</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Healthcare Administration Database</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>BMC medical informatics and decision making</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Almeida, Daniela</au><au>Gorender, David</au><au>Ichihara, Maria Yury</au><au>Sena, Samila</au><au>Menezes, Luan</au><au>Barbosa, George C G</au><au>Fiaccone, Rosimeire L</au><au>Paixão, Enny S</au><au>Pita, Robespierre</au><au>Barreto, Mauricio L</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort</atitle><jtitle>BMC medical informatics and decision making</jtitle><addtitle>BMC Med Inform Decis Mak</addtitle><date>2020-07-25</date><risdate>2020</risdate><volume>20</volume><issue>1</issue><spage>173</spage><epage>173</epage><pages>173-173</pages><artnum>173</artnum><issn>1472-6947</issn><eissn>1472-6947</eissn><abstract>Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals. We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias. A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets. We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>32711532</pmid><doi>10.1186/s12911-020-01192-0</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-4797-908X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1472-6947
ispartof BMC medical informatics and decision making, 2020-07, Vol.20 (1), p.173-173, Article 173
issn 1472-6947
1472-6947
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_59f6898563b64d20b86f548e69f56288
source Publicly Available Content Database; PubMed Central (PMC)
subjects Accuracy
Analysis
Bias
Births
Brazil
Cohort Studies
Databases, Factual
Datasets
Error analysis
Families & family life
Female
Health informatics
Humans
Identification
Income
Information sources
Information systems
Linked Data
Living conditions
Male
Medical Record Linkage
Missing data
Names
Parturition
Pregnancy
Public assistance
Urban areas
Variables
title Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T22%3A20%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Examining%20the%20quality%20of%20record%20linkage%20process%20using%20nationwide%20Brazilian%20administrative%20databases%20to%20build%20a%20large%20birth%20cohort&rft.jtitle=BMC%20medical%20informatics%20and%20decision%20making&rft.au=Almeida,%20Daniela&rft.date=2020-07-25&rft.volume=20&rft.issue=1&rft.spage=173&rft.epage=173&rft.pages=173-173&rft.artnum=173&rft.issn=1472-6947&rft.eissn=1472-6947&rft_id=info:doi/10.1186/s12911-020-01192-0&rft_dat=%3Cgale_doaj_%3EA631903088%3C/gale_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2435130762&rft_id=info:pmid/32711532&rft_galeid=A631903088&rfr_iscdi=true