Loading…
Complexities of leveraging user-generated book reviews for scholarly research: transiency, power dynamics, and cultural dependency
In the past two decades, digital libraries (DL) have increasingly supported computational studies of digitized books (Jett et al. The hathitrust research center extracted features dataset (2.0), 2020; Underwood, Distant horizons: digital evidence and literary change, University of Chicago Press, Chi...
Saved in:
Published in: | International journal on digital libraries 2024-06, Vol.25 (2), p.317-340 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c319t-7e7688ccb97de3888d190a5be4d0cf85730a7b7824b02929c55349f942dce8043 |
---|---|
cites | cdi_FETCH-LOGICAL-c319t-7e7688ccb97de3888d190a5be4d0cf85730a7b7824b02929c55349f942dce8043 |
container_end_page | 340 |
container_issue | 2 |
container_start_page | 317 |
container_title | International journal on digital libraries |
container_volume | 25 |
creator | Hu, Yuerong LeBlanc, Zoe Diesner, Jana Underwood, Ted Layne-Worthey, Glen Downie, J. Stephen |
description | In the past two decades, digital libraries (DL) have increasingly supported computational studies of digitized books (Jett et al. The hathitrust research center extracted features dataset (2.0), 2020; Underwood, Distant horizons: digital evidence and literary change, University of Chicago Press, Chicago, 2019; Organisciak et al. J Assoc Inf Sci Technol 73:317–332, 2022; Michel et al. Science 331:176–182, 2011). Nonetheless, there remains a dearth of DL data provisions or infrastructures for research on book reception, and user-generated book reviews have opened up unprecedented research opportunities in this area. However, insufficient attention has been paid to real-world complexities and limitations of using these datasets in scholarly research, which may cause analytical oversights (Crawford and Finn, Geo J 80:491–502, 2015), methodological pitfalls (Olteanu et al. Front Big Data 2:13, 2019), and ethical concerns (Hu et al. Research with user-generated book review data: legal and ethical pitfalls and contextualized mitigations, Springer, Berlin, 2023; Diesner and Chin, Gratis, libre, or something else? regulations and misassumptions related to working with publicly available text data, 2016). In this paper, we present three case studies that contextually and empirically investigate book reviews for their temporal, cultural, and socio-participatory complexities: (1) a longitudinal analysis of a ranked book list across ten years and over one month; (2) a text classification of 20,000 sponsored and 20,000 non-sponsored books reviews; and (3) a comparative analysis of 537 book ratings from Anglophone and non-Anglophone readerships. Our work reflects on both (1) data curation challenges that researchers may encounter (e.g., platform providers’ lack of bibliographic control) when studying book reviews and (2) mitigations that researchers might adopt to address these challenges (e.g., how to align data from various platforms). Taken together, our findings illustrate some of the sociotechnical complexities of working with user-generated book reviews by revealing the transiency, power dynamics, and cultural dependency in these datasets. This paper explores some of the limitations and challenges of using user-generated book reviews for scholarship and calls for critical and contextualized usage of user-generated book reviews in future scholarly research. |
doi_str_mv | 10.1007/s00799-023-00376-z |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3072355199</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3072355199</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-7e7688ccb97de3888d190a5be4d0cf85730a7b7824b02929c55349f942dce8043</originalsourceid><addsrcrecordid>eNp9kDtPwzAUhSMEEuXxB5gssTZwbSfYZkMVLwmJBWbLsW9KILWDnbS0I7-cQJHYWO5L3zlXOll2QuGMAojzNBalcmA8B-DiIt_sZBNacJZTDrD7O5dA2X52kNIrAFBJxST7nIVF1-JH0zeYSKhJi0uMZt74ORkSxnyOftx7dKQK4Y1EXDa4SqQOkST7EloT2_V4TWiifbkkfTQ-Nejtekq6sMJI3NqbRWPTlBjviB3afoimJQ479O4bPMr2atMmPP7th9nzzfXT7C5_eLy9n1095JZT1ecCxYWU1lZKOORSSkcVmLLCwoGtZSk4GFEJyYoKmGLKliUvVK0K5ixKKPhhdrr17WJ4HzD1-jUM0Y8vNQfBeFlSpUaKbSkbQ0oRa93FZmHiWlPQ31nrbdZ6zFr_ZK03o4hvRWmE_Rzjn_U_qi-7mYTr</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3072355199</pqid></control><display><type>article</type><title>Complexities of leveraging user-generated book reviews for scholarly research: transiency, power dynamics, and cultural dependency</title><source>Springer Link</source><creator>Hu, Yuerong ; LeBlanc, Zoe ; Diesner, Jana ; Underwood, Ted ; Layne-Worthey, Glen ; Downie, J. Stephen</creator><creatorcontrib>Hu, Yuerong ; LeBlanc, Zoe ; Diesner, Jana ; Underwood, Ted ; Layne-Worthey, Glen ; Downie, J. Stephen</creatorcontrib><description>In the past two decades, digital libraries (DL) have increasingly supported computational studies of digitized books (Jett et al. The hathitrust research center extracted features dataset (2.0), 2020; Underwood, Distant horizons: digital evidence and literary change, University of Chicago Press, Chicago, 2019; Organisciak et al. J Assoc Inf Sci Technol 73:317–332, 2022; Michel et al. Science 331:176–182, 2011). Nonetheless, there remains a dearth of DL data provisions or infrastructures for research on book reception, and user-generated book reviews have opened up unprecedented research opportunities in this area. However, insufficient attention has been paid to real-world complexities and limitations of using these datasets in scholarly research, which may cause analytical oversights (Crawford and Finn, Geo J 80:491–502, 2015), methodological pitfalls (Olteanu et al. Front Big Data 2:13, 2019), and ethical concerns (Hu et al. Research with user-generated book review data: legal and ethical pitfalls and contextualized mitigations, Springer, Berlin, 2023; Diesner and Chin, Gratis, libre, or something else? regulations and misassumptions related to working with publicly available text data, 2016). In this paper, we present three case studies that contextually and empirically investigate book reviews for their temporal, cultural, and socio-participatory complexities: (1) a longitudinal analysis of a ranked book list across ten years and over one month; (2) a text classification of 20,000 sponsored and 20,000 non-sponsored books reviews; and (3) a comparative analysis of 537 book ratings from Anglophone and non-Anglophone readerships. Our work reflects on both (1) data curation challenges that researchers may encounter (e.g., platform providers’ lack of bibliographic control) when studying book reviews and (2) mitigations that researchers might adopt to address these challenges (e.g., how to align data from various platforms). Taken together, our findings illustrate some of the sociotechnical complexities of working with user-generated book reviews by revealing the transiency, power dynamics, and cultural dependency in these datasets. This paper explores some of the limitations and challenges of using user-generated book reviews for scholarship and calls for critical and contextualized usage of user-generated book reviews in future scholarly research.</description><identifier>ISSN: 1432-5012</identifier><identifier>EISSN: 1432-1300</identifier><identifier>DOI: 10.1007/s00799-023-00376-z</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Anglophones ; Bibliographic control ; Big Data ; Computer Science ; Database Management ; Datasets ; E-books ; Ethics ; Information Systems and Communication Service ; Research facilities</subject><ispartof>International journal on digital libraries, 2024-06, Vol.25 (2), p.317-340</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-7e7688ccb97de3888d190a5be4d0cf85730a7b7824b02929c55349f942dce8043</citedby><cites>FETCH-LOGICAL-c319t-7e7688ccb97de3888d190a5be4d0cf85730a7b7824b02929c55349f942dce8043</cites><orcidid>0000-0001-8375-9108 ; 0000-0001-9784-5090 ; 0000-0001-8183-7109 ; 0000-0001-8960-1846 ; 0000-0003-2785-0040 ; 0000-0003-2012-8805</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>313,314,780,784,792,27922,27924,27925</link.rule.ids></links><search><creatorcontrib>Hu, Yuerong</creatorcontrib><creatorcontrib>LeBlanc, Zoe</creatorcontrib><creatorcontrib>Diesner, Jana</creatorcontrib><creatorcontrib>Underwood, Ted</creatorcontrib><creatorcontrib>Layne-Worthey, Glen</creatorcontrib><creatorcontrib>Downie, J. Stephen</creatorcontrib><title>Complexities of leveraging user-generated book reviews for scholarly research: transiency, power dynamics, and cultural dependency</title><title>International journal on digital libraries</title><addtitle>Int J Digit Libr</addtitle><description>In the past two decades, digital libraries (DL) have increasingly supported computational studies of digitized books (Jett et al. The hathitrust research center extracted features dataset (2.0), 2020; Underwood, Distant horizons: digital evidence and literary change, University of Chicago Press, Chicago, 2019; Organisciak et al. J Assoc Inf Sci Technol 73:317–332, 2022; Michel et al. Science 331:176–182, 2011). Nonetheless, there remains a dearth of DL data provisions or infrastructures for research on book reception, and user-generated book reviews have opened up unprecedented research opportunities in this area. However, insufficient attention has been paid to real-world complexities and limitations of using these datasets in scholarly research, which may cause analytical oversights (Crawford and Finn, Geo J 80:491–502, 2015), methodological pitfalls (Olteanu et al. Front Big Data 2:13, 2019), and ethical concerns (Hu et al. Research with user-generated book review data: legal and ethical pitfalls and contextualized mitigations, Springer, Berlin, 2023; Diesner and Chin, Gratis, libre, or something else? regulations and misassumptions related to working with publicly available text data, 2016). In this paper, we present three case studies that contextually and empirically investigate book reviews for their temporal, cultural, and socio-participatory complexities: (1) a longitudinal analysis of a ranked book list across ten years and over one month; (2) a text classification of 20,000 sponsored and 20,000 non-sponsored books reviews; and (3) a comparative analysis of 537 book ratings from Anglophone and non-Anglophone readerships. Our work reflects on both (1) data curation challenges that researchers may encounter (e.g., platform providers’ lack of bibliographic control) when studying book reviews and (2) mitigations that researchers might adopt to address these challenges (e.g., how to align data from various platforms). Taken together, our findings illustrate some of the sociotechnical complexities of working with user-generated book reviews by revealing the transiency, power dynamics, and cultural dependency in these datasets. This paper explores some of the limitations and challenges of using user-generated book reviews for scholarship and calls for critical and contextualized usage of user-generated book reviews in future scholarly research.</description><subject>Anglophones</subject><subject>Bibliographic control</subject><subject>Big Data</subject><subject>Computer Science</subject><subject>Database Management</subject><subject>Datasets</subject><subject>E-books</subject><subject>Ethics</subject><subject>Information Systems and Communication Service</subject><subject>Research facilities</subject><issn>1432-5012</issn><issn>1432-1300</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kDtPwzAUhSMEEuXxB5gssTZwbSfYZkMVLwmJBWbLsW9KILWDnbS0I7-cQJHYWO5L3zlXOll2QuGMAojzNBalcmA8B-DiIt_sZBNacJZTDrD7O5dA2X52kNIrAFBJxST7nIVF1-JH0zeYSKhJi0uMZt74ORkSxnyOftx7dKQK4Y1EXDa4SqQOkST7EloT2_V4TWiifbkkfTQ-Nejtekq6sMJI3NqbRWPTlBjviB3afoimJQ479O4bPMr2atMmPP7th9nzzfXT7C5_eLy9n1095JZT1ecCxYWU1lZKOORSSkcVmLLCwoGtZSk4GFEJyYoKmGLKliUvVK0K5ixKKPhhdrr17WJ4HzD1-jUM0Y8vNQfBeFlSpUaKbSkbQ0oRa93FZmHiWlPQ31nrbdZ6zFr_ZK03o4hvRWmE_Rzjn_U_qi-7mYTr</recordid><startdate>20240601</startdate><enddate>20240601</enddate><creator>Hu, Yuerong</creator><creator>LeBlanc, Zoe</creator><creator>Diesner, Jana</creator><creator>Underwood, Ted</creator><creator>Layne-Worthey, Glen</creator><creator>Downie, J. Stephen</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-8375-9108</orcidid><orcidid>https://orcid.org/0000-0001-9784-5090</orcidid><orcidid>https://orcid.org/0000-0001-8183-7109</orcidid><orcidid>https://orcid.org/0000-0001-8960-1846</orcidid><orcidid>https://orcid.org/0000-0003-2785-0040</orcidid><orcidid>https://orcid.org/0000-0003-2012-8805</orcidid></search><sort><creationdate>20240601</creationdate><title>Complexities of leveraging user-generated book reviews for scholarly research: transiency, power dynamics, and cultural dependency</title><author>Hu, Yuerong ; LeBlanc, Zoe ; Diesner, Jana ; Underwood, Ted ; Layne-Worthey, Glen ; Downie, J. Stephen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-7e7688ccb97de3888d190a5be4d0cf85730a7b7824b02929c55349f942dce8043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Anglophones</topic><topic>Bibliographic control</topic><topic>Big Data</topic><topic>Computer Science</topic><topic>Database Management</topic><topic>Datasets</topic><topic>E-books</topic><topic>Ethics</topic><topic>Information Systems and Communication Service</topic><topic>Research facilities</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hu, Yuerong</creatorcontrib><creatorcontrib>LeBlanc, Zoe</creatorcontrib><creatorcontrib>Diesner, Jana</creatorcontrib><creatorcontrib>Underwood, Ted</creatorcontrib><creatorcontrib>Layne-Worthey, Glen</creatorcontrib><creatorcontrib>Downie, J. Stephen</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>International journal on digital libraries</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hu, Yuerong</au><au>LeBlanc, Zoe</au><au>Diesner, Jana</au><au>Underwood, Ted</au><au>Layne-Worthey, Glen</au><au>Downie, J. Stephen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Complexities of leveraging user-generated book reviews for scholarly research: transiency, power dynamics, and cultural dependency</atitle><jtitle>International journal on digital libraries</jtitle><stitle>Int J Digit Libr</stitle><date>2024-06-01</date><risdate>2024</risdate><volume>25</volume><issue>2</issue><spage>317</spage><epage>340</epage><pages>317-340</pages><issn>1432-5012</issn><eissn>1432-1300</eissn><abstract>In the past two decades, digital libraries (DL) have increasingly supported computational studies of digitized books (Jett et al. The hathitrust research center extracted features dataset (2.0), 2020; Underwood, Distant horizons: digital evidence and literary change, University of Chicago Press, Chicago, 2019; Organisciak et al. J Assoc Inf Sci Technol 73:317–332, 2022; Michel et al. Science 331:176–182, 2011). Nonetheless, there remains a dearth of DL data provisions or infrastructures for research on book reception, and user-generated book reviews have opened up unprecedented research opportunities in this area. However, insufficient attention has been paid to real-world complexities and limitations of using these datasets in scholarly research, which may cause analytical oversights (Crawford and Finn, Geo J 80:491–502, 2015), methodological pitfalls (Olteanu et al. Front Big Data 2:13, 2019), and ethical concerns (Hu et al. Research with user-generated book review data: legal and ethical pitfalls and contextualized mitigations, Springer, Berlin, 2023; Diesner and Chin, Gratis, libre, or something else? regulations and misassumptions related to working with publicly available text data, 2016). In this paper, we present three case studies that contextually and empirically investigate book reviews for their temporal, cultural, and socio-participatory complexities: (1) a longitudinal analysis of a ranked book list across ten years and over one month; (2) a text classification of 20,000 sponsored and 20,000 non-sponsored books reviews; and (3) a comparative analysis of 537 book ratings from Anglophone and non-Anglophone readerships. Our work reflects on both (1) data curation challenges that researchers may encounter (e.g., platform providers’ lack of bibliographic control) when studying book reviews and (2) mitigations that researchers might adopt to address these challenges (e.g., how to align data from various platforms). Taken together, our findings illustrate some of the sociotechnical complexities of working with user-generated book reviews by revealing the transiency, power dynamics, and cultural dependency in these datasets. This paper explores some of the limitations and challenges of using user-generated book reviews for scholarship and calls for critical and contextualized usage of user-generated book reviews in future scholarly research.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00799-023-00376-z</doi><tpages>24</tpages><orcidid>https://orcid.org/0000-0001-8375-9108</orcidid><orcidid>https://orcid.org/0000-0001-9784-5090</orcidid><orcidid>https://orcid.org/0000-0001-8183-7109</orcidid><orcidid>https://orcid.org/0000-0001-8960-1846</orcidid><orcidid>https://orcid.org/0000-0003-2785-0040</orcidid><orcidid>https://orcid.org/0000-0003-2012-8805</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1432-5012 |
ispartof | International journal on digital libraries, 2024-06, Vol.25 (2), p.317-340 |
issn | 1432-5012 1432-1300 |
language | eng |
recordid | cdi_proquest_journals_3072355199 |
source | Springer Link |
subjects | Anglophones Bibliographic control Big Data Computer Science Database Management Datasets E-books Ethics Information Systems and Communication Service Research facilities |
title | Complexities of leveraging user-generated book reviews for scholarly research: transiency, power dynamics, and cultural dependency |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T16%3A12%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Complexities%20of%20leveraging%20user-generated%20book%20reviews%20for%20scholarly%20research:%20transiency,%20power%20dynamics,%20and%20cultural%20dependency&rft.jtitle=International%20journal%20on%20digital%20libraries&rft.au=Hu,%20Yuerong&rft.date=2024-06-01&rft.volume=25&rft.issue=2&rft.spage=317&rft.epage=340&rft.pages=317-340&rft.issn=1432-5012&rft.eissn=1432-1300&rft_id=info:doi/10.1007/s00799-023-00376-z&rft_dat=%3Cproquest_cross%3E3072355199%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-7e7688ccb97de3888d190a5be4d0cf85730a7b7824b02929c55349f942dce8043%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3072355199&rft_id=info:pmid/&rfr_iscdi=true |