Loading…

The Field-Dependent Nature of PageRank Values in Citation Networks

The value of scientific research can be easier to assess at the collective level than at the level of individual contributions. Several journal-level and article-level metrics aim to measure the importance of journals or individual manuscripts. However, many are citation-based and citation practices...

Full description

Saved in:
Bibliographic Details
Published in:bioRxiv 2023-01
Main Authors: Heil, Benjamin J, Greene, Casey S
Format: Article
Language:English
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title bioRxiv
container_volume
creator Heil, Benjamin J
Greene, Casey S
description The value of scientific research can be easier to assess at the collective level than at the level of individual contributions. Several journal-level and article-level metrics aim to measure the importance of journals or individual manuscripts. However, many are citation-based and citation practices vary between fields. To account for these differences, scientists have devised normalization schemes to make metrics more comparable across fields. We use PageRank as an example metric and examine the extent to which field-specific citation norms drive estimated importance differences. In doing so, we recapitulate differences in journal and article PageRanks between fields. We also find that manuscripts shared between fields have different PageRanks depending on which field's citation network the metric is calculated in. We implement a degree-preserving graph shuffling algorithm to generate a null distribution of similar networks and find differences more likely attributed to field-specific preferences than citation norms. Our results suggest that while differences exist between fields' metric distributions, applying metrics in a field-aware manner rather than using normalized global metrics avoids losing important information about article preferences. They also imply that assigning a single importance value to a manuscript may not be a useful construct, as the importance of each manuscript varies by the reader's field.Competing Interest StatementThe authors have declared no competing interest.Footnotes* https://github.com/greenelab/indices
doi_str_mv 10.1101/2023.01.05.522943
format article
fullrecord <record><control><sourceid>proquest_COVID</sourceid><recordid>TN_cdi_proquest_journals_2761423125</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2761423125</sourcerecordid><originalsourceid>FETCH-LOGICAL-p1823-181bc9bfc78fd1dfc5aaf8bbff7d64b5aebf62d2a40874e7d952449b57d7b0d03</originalsourceid><addsrcrecordid>eNotjrFOwzAUAL0woMIHsFliTvB7tuNkhEABqSoIFdbKjp8hNLJD4ojfp1KZbrs7xq5AlAACblCgLAWUQpcasVHynN3tvoivexp8cU8jRU8x863Ny0Q8Bf5qP-nNxgP_sMNCM-8jb_tsc58i31L-TdNhvmBnwQ4zXf5zxd7XD7v2qdi8PD63t5tihBplATW4rnGhM3Xw4EOnrQ21cyEYXymnLblQoUerRG0UGd9oVKpx2njjhBdyxa5P3nFKP8eZvP9OyxSPyT2aChRKQC3_AOEORag</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2761423125</pqid></control><display><type>article</type><title>The Field-Dependent Nature of PageRank Values in Citation Networks</title><source>Coronavirus Research Database</source><creator>Heil, Benjamin J ; Greene, Casey S</creator><creatorcontrib>Heil, Benjamin J ; Greene, Casey S</creatorcontrib><description>The value of scientific research can be easier to assess at the collective level than at the level of individual contributions. Several journal-level and article-level metrics aim to measure the importance of journals or individual manuscripts. However, many are citation-based and citation practices vary between fields. To account for these differences, scientists have devised normalization schemes to make metrics more comparable across fields. We use PageRank as an example metric and examine the extent to which field-specific citation norms drive estimated importance differences. In doing so, we recapitulate differences in journal and article PageRanks between fields. We also find that manuscripts shared between fields have different PageRanks depending on which field's citation network the metric is calculated in. We implement a degree-preserving graph shuffling algorithm to generate a null distribution of similar networks and find differences more likely attributed to field-specific preferences than citation norms. Our results suggest that while differences exist between fields' metric distributions, applying metrics in a field-aware manner rather than using normalized global metrics avoids losing important information about article preferences. They also imply that assigning a single importance value to a manuscript may not be a useful construct, as the importance of each manuscript varies by the reader's field.Competing Interest StatementThe authors have declared no competing interest.Footnotes* https://github.com/greenelab/indices</description><identifier>DOI: 10.1101/2023.01.05.522943</identifier><language>eng</language><publisher>Cold Spring Harbor: Cold Spring Harbor Laboratory Press</publisher><ispartof>bioRxiv, 2023-01</ispartof><rights>2023. This article is published under http://creativecommons.org/licenses/by/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2761423125?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,27925,38516,43895</link.rule.ids><linktorsrc>$$Uhttps://www.proquest.com/docview/2761423125?pq-origsite=primo$$EView_record_in_ProQuest$$FView_record_in_$$GProQuest$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Heil, Benjamin J</creatorcontrib><creatorcontrib>Greene, Casey S</creatorcontrib><title>The Field-Dependent Nature of PageRank Values in Citation Networks</title><title>bioRxiv</title><description>The value of scientific research can be easier to assess at the collective level than at the level of individual contributions. Several journal-level and article-level metrics aim to measure the importance of journals or individual manuscripts. However, many are citation-based and citation practices vary between fields. To account for these differences, scientists have devised normalization schemes to make metrics more comparable across fields. We use PageRank as an example metric and examine the extent to which field-specific citation norms drive estimated importance differences. In doing so, we recapitulate differences in journal and article PageRanks between fields. We also find that manuscripts shared between fields have different PageRanks depending on which field's citation network the metric is calculated in. We implement a degree-preserving graph shuffling algorithm to generate a null distribution of similar networks and find differences more likely attributed to field-specific preferences than citation norms. Our results suggest that while differences exist between fields' metric distributions, applying metrics in a field-aware manner rather than using normalized global metrics avoids losing important information about article preferences. They also imply that assigning a single importance value to a manuscript may not be a useful construct, as the importance of each manuscript varies by the reader's field.Competing Interest StatementThe authors have declared no competing interest.Footnotes* https://github.com/greenelab/indices</description><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>COVID</sourceid><sourceid>PIMPY</sourceid><recordid>eNotjrFOwzAUAL0woMIHsFliTvB7tuNkhEABqSoIFdbKjp8hNLJD4ojfp1KZbrs7xq5AlAACblCgLAWUQpcasVHynN3tvoivexp8cU8jRU8x863Ny0Q8Bf5qP-nNxgP_sMNCM-8jb_tsc58i31L-TdNhvmBnwQ4zXf5zxd7XD7v2qdi8PD63t5tihBplATW4rnGhM3Xw4EOnrQ21cyEYXymnLblQoUerRG0UGd9oVKpx2njjhBdyxa5P3nFKP8eZvP9OyxSPyT2aChRKQC3_AOEORag</recordid><startdate>20230106</startdate><enddate>20230106</enddate><creator>Heil, Benjamin J</creator><creator>Greene, Casey S</creator><general>Cold Spring Harbor Laboratory Press</general><scope>8FE</scope><scope>8FH</scope><scope>AAFGM</scope><scope>AAMXL</scope><scope>ABOIG</scope><scope>ABUWG</scope><scope>ADZZV</scope><scope>AFKRA</scope><scope>AFLLJ</scope><scope>AFOLM</scope><scope>AGAJT</scope><scope>AQTIP</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>COVID</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>LK8</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQCXX</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope></search><sort><creationdate>20230106</creationdate><title>The Field-Dependent Nature of PageRank Values in Citation Networks</title><author>Heil, Benjamin J ; Greene, Casey S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p1823-181bc9bfc78fd1dfc5aaf8bbff7d64b5aebf62d2a40874e7d952449b57d7b0d03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Heil, Benjamin J</creatorcontrib><creatorcontrib>Greene, Casey S</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>Biological Sciences</collection><collection>Biological Science Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>bioRxiv</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Heil, Benjamin J</au><au>Greene, Casey S</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Field-Dependent Nature of PageRank Values in Citation Networks</atitle><jtitle>bioRxiv</jtitle><date>2023-01-06</date><risdate>2023</risdate><abstract>The value of scientific research can be easier to assess at the collective level than at the level of individual contributions. Several journal-level and article-level metrics aim to measure the importance of journals or individual manuscripts. However, many are citation-based and citation practices vary between fields. To account for these differences, scientists have devised normalization schemes to make metrics more comparable across fields. We use PageRank as an example metric and examine the extent to which field-specific citation norms drive estimated importance differences. In doing so, we recapitulate differences in journal and article PageRanks between fields. We also find that manuscripts shared between fields have different PageRanks depending on which field's citation network the metric is calculated in. We implement a degree-preserving graph shuffling algorithm to generate a null distribution of similar networks and find differences more likely attributed to field-specific preferences than citation norms. Our results suggest that while differences exist between fields' metric distributions, applying metrics in a field-aware manner rather than using normalized global metrics avoids losing important information about article preferences. They also imply that assigning a single importance value to a manuscript may not be a useful construct, as the importance of each manuscript varies by the reader's field.Competing Interest StatementThe authors have declared no competing interest.Footnotes* https://github.com/greenelab/indices</abstract><cop>Cold Spring Harbor</cop><pub>Cold Spring Harbor Laboratory Press</pub><doi>10.1101/2023.01.05.522943</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.1101/2023.01.05.522943
ispartof bioRxiv, 2023-01
issn
language eng
recordid cdi_proquest_journals_2761423125
source Coronavirus Research Database
title The Field-Dependent Nature of PageRank Values in Citation Networks
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T23%3A27%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_COVID&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Field-Dependent%20Nature%20of%20PageRank%20Values%20in%20Citation%20Networks&rft.jtitle=bioRxiv&rft.au=Heil,%20Benjamin%20J&rft.date=2023-01-06&rft_id=info:doi/10.1101/2023.01.05.522943&rft_dat=%3Cproquest_COVID%3E2761423125%3C/proquest_COVID%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p1823-181bc9bfc78fd1dfc5aaf8bbff7d64b5aebf62d2a40874e7d952449b57d7b0d03%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2761423125&rft_id=info:pmid/&rfr_iscdi=true