Loading…

Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies

The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. The authors sampled 172 articles published by authors from the Uni...

Full description

Saved in:
Bibliographic Details
Published in:Journal of the Medical Library Association 2002-10, Vol.90 (4), p.411-430
Main Authors: RUIZ-PEREZ, R, DELGADO LOPEZ-COZAR, E, JIMENEZ-CONTRERAS, E
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 430
container_issue 4
container_start_page 411
container_title Journal of the Medical Library Association
container_volume 90
creator RUIZ-PEREZ, R
DELGADO LOPEZ-COZAR, E
JIMENEZ-CONTRERAS, E
description The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. The authors sampled 172 articles published by authors from the University of Granada Medical School between 1987 and 1996 and analyzed the variations in how each of their names was indexed in Science Citation Index (SCI), MEDLINE, and Indice Medico Español (IME). The number and types of variants that appeared for each author's name were recorded and compared across databases to identify inconsistencies in indexing practices. We analyzed the relationship between variability (number of variants of an author's name) and productivity (number of items the name was associated with as an author), the consequences for retrieval of information, and the most frequent indexing structures used for Spanish names. The proportion of authors who appeared under more then one name was 48.1% in SCI, 50.7% in MEDLINE, and 69.0% in IME. Productivity correlated directly with variability: more than 50% of the authors listed on five to ten items appeared under more than one name in any given database, and close to 100% of the authors listed on more than ten items appeared under two or more variants. Productivity correlated inversely with retrievability: as the number of variants for a name increased, the number of items retrieved under each variant decreased. For the most highly productive authors, the number of items retrieved under each variant tended toward one. The most frequent indexing methods varied between databases. In MEDLINE and IME, names were indexed correctly as "first surname second surname, first name initial middle name initial" (if present) in 41.7% and 49.5% of the records, respectively. However, in SCI, the most frequent method was "first surname, first name initial second name initial" (48.0% of the records) and first surname and second surname run together, first name initial (18.3%). Retrievability on the basis of author's name was poor in all three databases. Each database uses accurate indexing methods, but these methods fail to result in consistency or coherence for specific entries. The likely causes of inconsistency are: (1) use by authors of variants of their names during their publication careers, (2) lack of authority control in all three databases, (3) the use of an inappropriate indexing method for Spanish names in SC
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_128958</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>57554970</sourcerecordid><originalsourceid>FETCH-LOGICAL-p322t-712c663a5d6bc1de172982b2ea7305f19316cabbb4bb477dea9458339fa1d2453</originalsourceid><addsrcrecordid>eNqFUU1LxDAQLaK46-pfkF70VmiSpmkFD7L4BQse1HOZfNSNtGlN2gX_hz_YWXdd9SSE5OXNmzczyV40JZwXSZmxcn-NWZ7wlKeT6CiE1zQlohDpYTQhlJUFzYpp9PHYg7NhGffGh85BEztoTbwCb2GwnQuxdUitIcbAabwPxu8YabvWaKsQahhAQjDhIrZt3yC3Mag7j0m4t19E7M3grVlt7aSVzdoDORWHYdTWhOPooIYmmJPtOYueb66f5nfJ4uH2fn61SHpG6ZAIQlWeM-A6l4poQwTFqSQ1IFjKa1IykiuQUma4hNAGyowXjJU1EE0zzmbR5ca3HyVOoYwbPDRV720L_r3qwFZ_I84uq5duVRFalOg0i863-b57G00YqtYGZZoGnOnGUAmaEyzzv5ALzrNSpCg8_d3RrpXvD0PB2VYAAR-99uCUDT86lHGKRp92iqWr</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>57554970</pqid></control><display><type>article</type><title>Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies</title><source>Library &amp; Information Science Abstracts (LISA)</source><source>NCBI_PubMed Central(免费)</source><creator>RUIZ-PEREZ, R ; DELGADO LOPEZ-COZAR, E ; JIMENEZ-CONTRERAS, E</creator><creatorcontrib>RUIZ-PEREZ, R ; DELGADO LOPEZ-COZAR, E ; JIMENEZ-CONTRERAS, E</creatorcontrib><description>The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. The authors sampled 172 articles published by authors from the University of Granada Medical School between 1987 and 1996 and analyzed the variations in how each of their names was indexed in Science Citation Index (SCI), MEDLINE, and Indice Medico Español (IME). The number and types of variants that appeared for each author's name were recorded and compared across databases to identify inconsistencies in indexing practices. We analyzed the relationship between variability (number of variants of an author's name) and productivity (number of items the name was associated with as an author), the consequences for retrieval of information, and the most frequent indexing structures used for Spanish names. The proportion of authors who appeared under more then one name was 48.1% in SCI, 50.7% in MEDLINE, and 69.0% in IME. Productivity correlated directly with variability: more than 50% of the authors listed on five to ten items appeared under more than one name in any given database, and close to 100% of the authors listed on more than ten items appeared under two or more variants. Productivity correlated inversely with retrievability: as the number of variants for a name increased, the number of items retrieved under each variant decreased. For the most highly productive authors, the number of items retrieved under each variant tended toward one. The most frequent indexing methods varied between databases. In MEDLINE and IME, names were indexed correctly as "first surname second surname, first name initial middle name initial" (if present) in 41.7% and 49.5% of the records, respectively. However, in SCI, the most frequent method was "first surname, first name initial second name initial" (48.0% of the records) and first surname and second surname run together, first name initial (18.3%). Retrievability on the basis of author's name was poor in all three databases. Each database uses accurate indexing methods, but these methods fail to result in consistency or coherence for specific entries. The likely causes of inconsistency are: (1) use by authors of variants of their names during their publication careers, (2) lack of authority control in all three databases, (3) the use of an inappropriate indexing method for Spanish names in SCI, (4) authors' inconsistent behaviors, and (5) possible editorial interventions by some journals. We offer some suggestions as to how to avert the proliferation of author name variants in the databases.</description><identifier>ISSN: 1536-5050</identifier><identifier>EISSN: 1558-9439</identifier><identifier>PMID: 12398248</identifier><language>eng</language><publisher>Chicago, IL: Medical Library Association</publisher><subject>Bibliographic catalogs and databases ; Bibliometrics ; Databases, Bibliographic - standards ; Exact sciences and technology ; Humans ; Indice Medico Espanol ; Information and communication sciences ; Information science. Documentation ; Information Storage and Retrieval - standards ; Language ; Linguistics ; Medicine ; MEDLINE ; Names ; Online databases ; Personal names ; Quality Control ; Science Citation Index ; Sciences and techniques of general use ; Search strategies ; Searching ; Secondary and tertiary information and documents ; Spanish personal names ; Specialized information sources ; Specialized information sources (paper or electronic format) ; Variations</subject><ispartof>Journal of the Medical Library Association, 2002-10, Vol.90 (4), p.411-430</ispartof><rights>2003 INIST-CNRS</rights><rights>Copyright © 2002, Medical Library Association 2002</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC128958/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC128958/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,34136,53791,53793</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=13985270$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/12398248$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>RUIZ-PEREZ, R</creatorcontrib><creatorcontrib>DELGADO LOPEZ-COZAR, E</creatorcontrib><creatorcontrib>JIMENEZ-CONTRERAS, E</creatorcontrib><title>Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies</title><title>Journal of the Medical Library Association</title><addtitle>J Med Libr Assoc</addtitle><description>The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. The authors sampled 172 articles published by authors from the University of Granada Medical School between 1987 and 1996 and analyzed the variations in how each of their names was indexed in Science Citation Index (SCI), MEDLINE, and Indice Medico Español (IME). The number and types of variants that appeared for each author's name were recorded and compared across databases to identify inconsistencies in indexing practices. We analyzed the relationship between variability (number of variants of an author's name) and productivity (number of items the name was associated with as an author), the consequences for retrieval of information, and the most frequent indexing structures used for Spanish names. The proportion of authors who appeared under more then one name was 48.1% in SCI, 50.7% in MEDLINE, and 69.0% in IME. Productivity correlated directly with variability: more than 50% of the authors listed on five to ten items appeared under more than one name in any given database, and close to 100% of the authors listed on more than ten items appeared under two or more variants. Productivity correlated inversely with retrievability: as the number of variants for a name increased, the number of items retrieved under each variant decreased. For the most highly productive authors, the number of items retrieved under each variant tended toward one. The most frequent indexing methods varied between databases. In MEDLINE and IME, names were indexed correctly as "first surname second surname, first name initial middle name initial" (if present) in 41.7% and 49.5% of the records, respectively. However, in SCI, the most frequent method was "first surname, first name initial second name initial" (48.0% of the records) and first surname and second surname run together, first name initial (18.3%). Retrievability on the basis of author's name was poor in all three databases. Each database uses accurate indexing methods, but these methods fail to result in consistency or coherence for specific entries. The likely causes of inconsistency are: (1) use by authors of variants of their names during their publication careers, (2) lack of authority control in all three databases, (3) the use of an inappropriate indexing method for Spanish names in SCI, (4) authors' inconsistent behaviors, and (5) possible editorial interventions by some journals. We offer some suggestions as to how to avert the proliferation of author name variants in the databases.</description><subject>Bibliographic catalogs and databases</subject><subject>Bibliometrics</subject><subject>Databases, Bibliographic - standards</subject><subject>Exact sciences and technology</subject><subject>Humans</subject><subject>Indice Medico Espanol</subject><subject>Information and communication sciences</subject><subject>Information science. Documentation</subject><subject>Information Storage and Retrieval - standards</subject><subject>Language</subject><subject>Linguistics</subject><subject>Medicine</subject><subject>MEDLINE</subject><subject>Names</subject><subject>Online databases</subject><subject>Personal names</subject><subject>Quality Control</subject><subject>Science Citation Index</subject><subject>Sciences and techniques of general use</subject><subject>Search strategies</subject><subject>Searching</subject><subject>Secondary and tertiary information and documents</subject><subject>Spanish personal names</subject><subject>Specialized information sources</subject><subject>Specialized information sources (paper or electronic format)</subject><subject>Variations</subject><issn>1536-5050</issn><issn>1558-9439</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2002</creationdate><recordtype>article</recordtype><sourceid>F2A</sourceid><recordid>eNqFUU1LxDAQLaK46-pfkF70VmiSpmkFD7L4BQse1HOZfNSNtGlN2gX_hz_YWXdd9SSE5OXNmzczyV40JZwXSZmxcn-NWZ7wlKeT6CiE1zQlohDpYTQhlJUFzYpp9PHYg7NhGffGh85BEztoTbwCb2GwnQuxdUitIcbAabwPxu8YabvWaKsQahhAQjDhIrZt3yC3Mag7j0m4t19E7M3grVlt7aSVzdoDORWHYdTWhOPooIYmmJPtOYueb66f5nfJ4uH2fn61SHpG6ZAIQlWeM-A6l4poQwTFqSQ1IFjKa1IykiuQUma4hNAGyowXjJU1EE0zzmbR5ca3HyVOoYwbPDRV720L_r3qwFZ_I84uq5duVRFalOg0i863-b57G00YqtYGZZoGnOnGUAmaEyzzv5ALzrNSpCg8_d3RrpXvD0PB2VYAAR-99uCUDT86lHGKRp92iqWr</recordid><startdate>20021001</startdate><enddate>20021001</enddate><creator>RUIZ-PEREZ, R</creator><creator>DELGADO LOPEZ-COZAR, E</creator><creator>JIMENEZ-CONTRERAS, E</creator><general>Medical Library Association</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>E3H</scope><scope>F2A</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20021001</creationdate><title>Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies</title><author>RUIZ-PEREZ, R ; DELGADO LOPEZ-COZAR, E ; JIMENEZ-CONTRERAS, E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p322t-712c663a5d6bc1de172982b2ea7305f19316cabbb4bb477dea9458339fa1d2453</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2002</creationdate><topic>Bibliographic catalogs and databases</topic><topic>Bibliometrics</topic><topic>Databases, Bibliographic - standards</topic><topic>Exact sciences and technology</topic><topic>Humans</topic><topic>Indice Medico Espanol</topic><topic>Information and communication sciences</topic><topic>Information science. Documentation</topic><topic>Information Storage and Retrieval - standards</topic><topic>Language</topic><topic>Linguistics</topic><topic>Medicine</topic><topic>MEDLINE</topic><topic>Names</topic><topic>Online databases</topic><topic>Personal names</topic><topic>Quality Control</topic><topic>Science Citation Index</topic><topic>Sciences and techniques of general use</topic><topic>Search strategies</topic><topic>Searching</topic><topic>Secondary and tertiary information and documents</topic><topic>Spanish personal names</topic><topic>Specialized information sources</topic><topic>Specialized information sources (paper or electronic format)</topic><topic>Variations</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>RUIZ-PEREZ, R</creatorcontrib><creatorcontrib>DELGADO LOPEZ-COZAR, E</creatorcontrib><creatorcontrib>JIMENEZ-CONTRERAS, E</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of the Medical Library Association</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>RUIZ-PEREZ, R</au><au>DELGADO LOPEZ-COZAR, E</au><au>JIMENEZ-CONTRERAS, E</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies</atitle><jtitle>Journal of the Medical Library Association</jtitle><addtitle>J Med Libr Assoc</addtitle><date>2002-10-01</date><risdate>2002</risdate><volume>90</volume><issue>4</issue><spage>411</spage><epage>430</epage><pages>411-430</pages><issn>1536-5050</issn><eissn>1558-9439</eissn><abstract>The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. The authors sampled 172 articles published by authors from the University of Granada Medical School between 1987 and 1996 and analyzed the variations in how each of their names was indexed in Science Citation Index (SCI), MEDLINE, and Indice Medico Español (IME). The number and types of variants that appeared for each author's name were recorded and compared across databases to identify inconsistencies in indexing practices. We analyzed the relationship between variability (number of variants of an author's name) and productivity (number of items the name was associated with as an author), the consequences for retrieval of information, and the most frequent indexing structures used for Spanish names. The proportion of authors who appeared under more then one name was 48.1% in SCI, 50.7% in MEDLINE, and 69.0% in IME. Productivity correlated directly with variability: more than 50% of the authors listed on five to ten items appeared under more than one name in any given database, and close to 100% of the authors listed on more than ten items appeared under two or more variants. Productivity correlated inversely with retrievability: as the number of variants for a name increased, the number of items retrieved under each variant decreased. For the most highly productive authors, the number of items retrieved under each variant tended toward one. The most frequent indexing methods varied between databases. In MEDLINE and IME, names were indexed correctly as "first surname second surname, first name initial middle name initial" (if present) in 41.7% and 49.5% of the records, respectively. However, in SCI, the most frequent method was "first surname, first name initial second name initial" (48.0% of the records) and first surname and second surname run together, first name initial (18.3%). Retrievability on the basis of author's name was poor in all three databases. Each database uses accurate indexing methods, but these methods fail to result in consistency or coherence for specific entries. The likely causes of inconsistency are: (1) use by authors of variants of their names during their publication careers, (2) lack of authority control in all three databases, (3) the use of an inappropriate indexing method for Spanish names in SCI, (4) authors' inconsistent behaviors, and (5) possible editorial interventions by some journals. We offer some suggestions as to how to avert the proliferation of author name variants in the databases.</abstract><cop>Chicago, IL</cop><pub>Medical Library Association</pub><pmid>12398248</pmid><tpages>20</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1536-5050
ispartof Journal of the Medical Library Association, 2002-10, Vol.90 (4), p.411-430
issn 1536-5050
1558-9439
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_128958
source Library & Information Science Abstracts (LISA); NCBI_PubMed Central(免费)
subjects Bibliographic catalogs and databases
Bibliometrics
Databases, Bibliographic - standards
Exact sciences and technology
Humans
Indice Medico Espanol
Information and communication sciences
Information science. Documentation
Information Storage and Retrieval - standards
Language
Linguistics
Medicine
MEDLINE
Names
Online databases
Personal names
Quality Control
Science Citation Index
Sciences and techniques of general use
Search strategies
Searching
Secondary and tertiary information and documents
Spanish personal names
Specialized information sources
Specialized information sources (paper or electronic format)
Variations
title Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T05%3A12%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Spanish%20personal%20name%20variations%20in%20national%20and%20international%20biomedical%20databases:%20implications%20for%20information%20retrieval%20and%20bibliometric%20studies&rft.jtitle=Journal%20of%20the%20Medical%20Library%20Association&rft.au=RUIZ-PEREZ,%20R&rft.date=2002-10-01&rft.volume=90&rft.issue=4&rft.spage=411&rft.epage=430&rft.pages=411-430&rft.issn=1536-5050&rft.eissn=1558-9439&rft_id=info:doi/&rft_dat=%3Cproquest_pubme%3E57554970%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p322t-712c663a5d6bc1de172982b2ea7305f19316cabbb4bb477dea9458339fa1d2453%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=57554970&rft_id=info:pmid/12398248&rfr_iscdi=true