Loading…
Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort
Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-inc...
Saved in:
Published in: | BMC medical informatics and decision making 2020-07, Vol.20 (1), p.173-173, Article 173 |
---|---|
Main Authors: | , , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73 |
---|---|
cites | cdi_FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73 |
container_end_page | 173 |
container_issue | 1 |
container_start_page | 173 |
container_title | BMC medical informatics and decision making |
container_volume | 20 |
creator | Almeida, Daniela Gorender, David Ichihara, Maria Yury Sena, Samila Menezes, Luan Barbosa, George C G Fiaccone, Rosimeire L Paixão, Enny S Pita, Robespierre Barreto, Mauricio L |
description | Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals.
We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias.
A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets.
We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset. |
doi_str_mv | 10.1186/s12911-020-01192-0 |
format | article |
fullrecord | <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_59f6898563b64d20b86f548e69f56288</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A631903088</galeid><doaj_id>oai_doaj_org_article_59f6898563b64d20b86f548e69f56288</doaj_id><sourcerecordid>A631903088</sourcerecordid><originalsourceid>FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73</originalsourceid><addsrcrecordid>eNptkk1v1DAQhiMEoqXwBzggS1y4pPgjdpwLUqkKVKrEBc7WxBnvesnaWzsplBv_HO9uKV2EfLA1884zM9ZbVS8ZPWVMq7eZ8Y6xmnJaU8Y6XtNH1TFrWl6rrmkfP3gfVc9yXlHKWi3k0-pI8JYxKfhx9eviB6x98GFBpiWS6xlGP92S6EhCG9NARh--wQLJJkWLOZM5b7UBJh_Ddz8geZ_gpx89BALDlpSnVJI3SAaYoIeMmUyR9LMfBwJkhFRgvU_Tkti4jGl6Xj1xMGZ8cXefVF8_XHw5_1Rfff54eX52VVupxFS3yBGlbS20KEQvtEVkONjBIjgpeedUI8t6XAnXaweDpQ3rgDWN6LTqW3FSXe65Q4SV2SS_hnRrInizC8S0MJAmb0c0ssB0p0vfXjUDp71WTjYaVeek4loX1rs9azP36zIEhrL0eAA9zAS_NIt4Y1qhuVZNAby5A6R4PWOezNpni-MIAeOcDW94K7nUDS3S1_9IV3FOoXxVUQnJBG0V_6taQFnABxdLX7uFmjMlWEcF3c19-h9VOQOuvY0BnS_xgwK-L7Ap5pzQ3e_IqNma0OxNaIoJzc6EZjvxq4e_c1_yx3XiN1iO2IQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2435130762</pqid></control><display><type>article</type><title>Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort</title><source>Publicly Available Content Database</source><source>PubMed Central (PMC)</source><creator>Almeida, Daniela ; Gorender, David ; Ichihara, Maria Yury ; Sena, Samila ; Menezes, Luan ; Barbosa, George C G ; Fiaccone, Rosimeire L ; Paixão, Enny S ; Pita, Robespierre ; Barreto, Mauricio L</creator><creatorcontrib>Almeida, Daniela ; Gorender, David ; Ichihara, Maria Yury ; Sena, Samila ; Menezes, Luan ; Barbosa, George C G ; Fiaccone, Rosimeire L ; Paixão, Enny S ; Pita, Robespierre ; Barreto, Mauricio L</creatorcontrib><description>Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals.
We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias.
A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets.
We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset.</description><identifier>ISSN: 1472-6947</identifier><identifier>EISSN: 1472-6947</identifier><identifier>DOI: 10.1186/s12911-020-01192-0</identifier><identifier>PMID: 32711532</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Accuracy ; Analysis ; Bias ; Births ; Brazil ; Cohort Studies ; Databases, Factual ; Datasets ; Error analysis ; Families & family life ; Female ; Health informatics ; Humans ; Identification ; Income ; Information sources ; Information systems ; Linked Data ; Living conditions ; Male ; Medical Record Linkage ; Missing data ; Names ; Parturition ; Pregnancy ; Public assistance ; Urban areas ; Variables</subject><ispartof>BMC medical informatics and decision making, 2020-07, Vol.20 (1), p.173-173, Article 173</ispartof><rights>COPYRIGHT 2020 BioMed Central Ltd.</rights><rights>2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>The Author(s) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73</citedby><cites>FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73</cites><orcidid>0000-0002-4797-908X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7382864/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2435130762?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32711532$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Almeida, Daniela</creatorcontrib><creatorcontrib>Gorender, David</creatorcontrib><creatorcontrib>Ichihara, Maria Yury</creatorcontrib><creatorcontrib>Sena, Samila</creatorcontrib><creatorcontrib>Menezes, Luan</creatorcontrib><creatorcontrib>Barbosa, George C G</creatorcontrib><creatorcontrib>Fiaccone, Rosimeire L</creatorcontrib><creatorcontrib>Paixão, Enny S</creatorcontrib><creatorcontrib>Pita, Robespierre</creatorcontrib><creatorcontrib>Barreto, Mauricio L</creatorcontrib><title>Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort</title><title>BMC medical informatics and decision making</title><addtitle>BMC Med Inform Decis Mak</addtitle><description>Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals.
We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias.
A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets.
We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset.</description><subject>Accuracy</subject><subject>Analysis</subject><subject>Bias</subject><subject>Births</subject><subject>Brazil</subject><subject>Cohort Studies</subject><subject>Databases, Factual</subject><subject>Datasets</subject><subject>Error analysis</subject><subject>Families & family life</subject><subject>Female</subject><subject>Health informatics</subject><subject>Humans</subject><subject>Identification</subject><subject>Income</subject><subject>Information sources</subject><subject>Information systems</subject><subject>Linked Data</subject><subject>Living conditions</subject><subject>Male</subject><subject>Medical Record Linkage</subject><subject>Missing data</subject><subject>Names</subject><subject>Parturition</subject><subject>Pregnancy</subject><subject>Public assistance</subject><subject>Urban areas</subject><subject>Variables</subject><issn>1472-6947</issn><issn>1472-6947</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNptkk1v1DAQhiMEoqXwBzggS1y4pPgjdpwLUqkKVKrEBc7WxBnvesnaWzsplBv_HO9uKV2EfLA1884zM9ZbVS8ZPWVMq7eZ8Y6xmnJaU8Y6XtNH1TFrWl6rrmkfP3gfVc9yXlHKWi3k0-pI8JYxKfhx9eviB6x98GFBpiWS6xlGP92S6EhCG9NARh--wQLJJkWLOZM5b7UBJh_Ddz8geZ_gpx89BALDlpSnVJI3SAaYoIeMmUyR9LMfBwJkhFRgvU_Tkti4jGl6Xj1xMGZ8cXefVF8_XHw5_1Rfff54eX52VVupxFS3yBGlbS20KEQvtEVkONjBIjgpeedUI8t6XAnXaweDpQ3rgDWN6LTqW3FSXe65Q4SV2SS_hnRrInizC8S0MJAmb0c0ssB0p0vfXjUDp71WTjYaVeek4loX1rs9azP36zIEhrL0eAA9zAS_NIt4Y1qhuVZNAby5A6R4PWOezNpni-MIAeOcDW94K7nUDS3S1_9IV3FOoXxVUQnJBG0V_6taQFnABxdLX7uFmjMlWEcF3c19-h9VOQOuvY0BnS_xgwK-L7Ap5pzQ3e_IqNma0OxNaIoJzc6EZjvxq4e_c1_yx3XiN1iO2IQ</recordid><startdate>20200725</startdate><enddate>20200725</enddate><creator>Almeida, Daniela</creator><creator>Gorender, David</creator><creator>Ichihara, Maria Yury</creator><creator>Sena, Samila</creator><creator>Menezes, Luan</creator><creator>Barbosa, George C G</creator><creator>Fiaccone, Rosimeire L</creator><creator>Paixão, Enny S</creator><creator>Pita, Robespierre</creator><creator>Barreto, Mauricio L</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><general>BMC</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QO</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88C</scope><scope>88E</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>L7M</scope><scope>LK8</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M0T</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-4797-908X</orcidid></search><sort><creationdate>20200725</creationdate><title>Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort</title><author>Almeida, Daniela ; Gorender, David ; Ichihara, Maria Yury ; Sena, Samila ; Menezes, Luan ; Barbosa, George C G ; Fiaccone, Rosimeire L ; Paixão, Enny S ; Pita, Robespierre ; Barreto, Mauricio L</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accuracy</topic><topic>Analysis</topic><topic>Bias</topic><topic>Births</topic><topic>Brazil</topic><topic>Cohort Studies</topic><topic>Databases, Factual</topic><topic>Datasets</topic><topic>Error analysis</topic><topic>Families & family life</topic><topic>Female</topic><topic>Health informatics</topic><topic>Humans</topic><topic>Identification</topic><topic>Income</topic><topic>Information sources</topic><topic>Information systems</topic><topic>Linked Data</topic><topic>Living conditions</topic><topic>Male</topic><topic>Medical Record Linkage</topic><topic>Missing data</topic><topic>Names</topic><topic>Parturition</topic><topic>Pregnancy</topic><topic>Public assistance</topic><topic>Urban areas</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Almeida, Daniela</creatorcontrib><creatorcontrib>Gorender, David</creatorcontrib><creatorcontrib>Ichihara, Maria Yury</creatorcontrib><creatorcontrib>Sena, Samila</creatorcontrib><creatorcontrib>Menezes, Luan</creatorcontrib><creatorcontrib>Barbosa, George C G</creatorcontrib><creatorcontrib>Fiaccone, Rosimeire L</creatorcontrib><creatorcontrib>Paixão, Enny S</creatorcontrib><creatorcontrib>Pita, Robespierre</creatorcontrib><creatorcontrib>Barreto, Mauricio L</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Healthcare Administration Database (Alumni)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest Biological Science Collection</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Healthcare Administration Database</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>BMC medical informatics and decision making</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Almeida, Daniela</au><au>Gorender, David</au><au>Ichihara, Maria Yury</au><au>Sena, Samila</au><au>Menezes, Luan</au><au>Barbosa, George C G</au><au>Fiaccone, Rosimeire L</au><au>Paixão, Enny S</au><au>Pita, Robespierre</au><au>Barreto, Mauricio L</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort</atitle><jtitle>BMC medical informatics and decision making</jtitle><addtitle>BMC Med Inform Decis Mak</addtitle><date>2020-07-25</date><risdate>2020</risdate><volume>20</volume><issue>1</issue><spage>173</spage><epage>173</epage><pages>173-173</pages><artnum>173</artnum><issn>1472-6947</issn><eissn>1472-6947</eissn><abstract>Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals.
We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias.
A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets.
We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>32711532</pmid><doi>10.1186/s12911-020-01192-0</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-4797-908X</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1472-6947 |
ispartof | BMC medical informatics and decision making, 2020-07, Vol.20 (1), p.173-173, Article 173 |
issn | 1472-6947 1472-6947 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_59f6898563b64d20b86f548e69f56288 |
source | Publicly Available Content Database; PubMed Central (PMC) |
subjects | Accuracy Analysis Bias Births Brazil Cohort Studies Databases, Factual Datasets Error analysis Families & family life Female Health informatics Humans Identification Income Information sources Information systems Linked Data Living conditions Male Medical Record Linkage Missing data Names Parturition Pregnancy Public assistance Urban areas Variables |
title | Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T22%3A20%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Examining%20the%20quality%20of%20record%20linkage%20process%20using%20nationwide%20Brazilian%20administrative%20databases%20to%20build%20a%20large%20birth%20cohort&rft.jtitle=BMC%20medical%20informatics%20and%20decision%20making&rft.au=Almeida,%20Daniela&rft.date=2020-07-25&rft.volume=20&rft.issue=1&rft.spage=173&rft.epage=173&rft.pages=173-173&rft.artnum=173&rft.issn=1472-6947&rft.eissn=1472-6947&rft_id=info:doi/10.1186/s12911-020-01192-0&rft_dat=%3Cgale_doaj_%3EA631903088%3C/gale_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c563t-7e2ee5c7ca7e33b38cee1edcdceaf5529f645783263fb8fadc0419a1443986b73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2435130762&rft_id=info:pmid/32711532&rft_galeid=A631903088&rfr_iscdi=true |