Loading…

Predicting the popularity of books before publication using machine learning

The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does n...

Full description

Saved in:
Bibliographic Details
Main Authors: Sachdeva, Hansika, Puri, Ujjwal, Poornima, S.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue 1
container_start_page
container_title
container_volume 3075
creator Sachdeva, Hansika
Puri, Ujjwal
Poornima, S.
description The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does not perform as well as it was supposed to, they lose the time and resources spent on that book. There is no measure that can validate the editors’ decisions about books and predict their popularity. In our paper, we propose a novel approach for predicting the popularity of books before publication using the text of books as the primary data source. We investigate the connection between two independent data sets and a book’s popularity. The first one is the various textual features of book manuscripts, and the second is the metadata of the book, like the topic, the genre, and the author. To validate the proposed approach, we have used a large data set of published books and their corresponding ratings and rankings.
doi_str_mv 10.1063/5.0217437
format conference_proceeding
fullrecord <record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_3085723965</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3085723965</sourcerecordid><originalsourceid>FETCH-LOGICAL-p637-45da744d340c432050663314674bd3381bbd613068c015d2560c9d4c11cc376c3</originalsourceid><addsrcrecordid>eNotkEtPwzAQhC0EEqVw4B9Y4oaUss76kRxRxUuqBIceuFmO7VKXNA52cuDfk9KeRtqZ3R19hNwyWDCQ-CAWUDLFUZ2RGROCFUoyeU5mADUvSo6fl-Qq5x1AWStVzcjqI3kX7BC6LzpsPe1jP7YmheGXxg1tYvzOtPGbmCZrbNpgzRBiR8d8WNgbuw2dp603qZsG1-RiY9rsb046J-vnp_XytVi9v7wtH1dFL1EVXDijOHfIwXIsQYCUiIxLxRuHWLGmcZIhyMoCE64UEmztuGXMWlTS4pzcHc_2Kf6MPg96F8fUTR81QiVUibUUU-r-mMo2DP-tdZ_C3qRfzUAfYGmhT7DwDwd5Wm8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>3085723965</pqid></control><display><type>conference_proceeding</type><title>Predicting the popularity of books before publication using machine learning</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><creator>Sachdeva, Hansika ; Puri, Ujjwal ; Poornima, S.</creator><contributor>Godfrey Winster, S ; Pushpalatha, M ; Baskar, M ; Kishore Anthuvan Sahayaraj, K</contributor><creatorcontrib>Sachdeva, Hansika ; Puri, Ujjwal ; Poornima, S. ; Godfrey Winster, S ; Pushpalatha, M ; Baskar, M ; Kishore Anthuvan Sahayaraj, K</creatorcontrib><description>The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does not perform as well as it was supposed to, they lose the time and resources spent on that book. There is no measure that can validate the editors’ decisions about books and predict their popularity. In our paper, we propose a novel approach for predicting the popularity of books before publication using the text of books as the primary data source. We investigate the connection between two independent data sets and a book’s popularity. The first one is the various textual features of book manuscripts, and the second is the metadata of the book, like the topic, the genre, and the author. To validate the proposed approach, we have used a large data set of published books and their corresponding ratings and rankings.</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0217437</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Datasets ; Machine learning ; Performance prediction</subject><ispartof>AIP Conference Proceedings, 2024, Vol.3075 (1)</ispartof><rights>AIP Publishing LLC</rights><rights>2024 AIP Publishing LLC.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>309,310,314,780,784,789,790,23930,23931,25140,27924,27925</link.rule.ids></links><search><contributor>Godfrey Winster, S</contributor><contributor>Pushpalatha, M</contributor><contributor>Baskar, M</contributor><contributor>Kishore Anthuvan Sahayaraj, K</contributor><creatorcontrib>Sachdeva, Hansika</creatorcontrib><creatorcontrib>Puri, Ujjwal</creatorcontrib><creatorcontrib>Poornima, S.</creatorcontrib><title>Predicting the popularity of books before publication using machine learning</title><title>AIP Conference Proceedings</title><description>The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does not perform as well as it was supposed to, they lose the time and resources spent on that book. There is no measure that can validate the editors’ decisions about books and predict their popularity. In our paper, we propose a novel approach for predicting the popularity of books before publication using the text of books as the primary data source. We investigate the connection between two independent data sets and a book’s popularity. The first one is the various textual features of book manuscripts, and the second is the metadata of the book, like the topic, the genre, and the author. To validate the proposed approach, we have used a large data set of published books and their corresponding ratings and rankings.</description><subject>Datasets</subject><subject>Machine learning</subject><subject>Performance prediction</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkEtPwzAQhC0EEqVw4B9Y4oaUss76kRxRxUuqBIceuFmO7VKXNA52cuDfk9KeRtqZ3R19hNwyWDCQ-CAWUDLFUZ2RGROCFUoyeU5mADUvSo6fl-Qq5x1AWStVzcjqI3kX7BC6LzpsPe1jP7YmheGXxg1tYvzOtPGbmCZrbNpgzRBiR8d8WNgbuw2dp603qZsG1-RiY9rsb046J-vnp_XytVi9v7wtH1dFL1EVXDijOHfIwXIsQYCUiIxLxRuHWLGmcZIhyMoCE64UEmztuGXMWlTS4pzcHc_2Kf6MPg96F8fUTR81QiVUibUUU-r-mMo2DP-tdZ_C3qRfzUAfYGmhT7DwDwd5Wm8</recordid><startdate>20240729</startdate><enddate>20240729</enddate><creator>Sachdeva, Hansika</creator><creator>Puri, Ujjwal</creator><creator>Poornima, S.</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20240729</creationdate><title>Predicting the popularity of books before publication using machine learning</title><author>Sachdeva, Hansika ; Puri, Ujjwal ; Poornima, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p637-45da744d340c432050663314674bd3381bbd613068c015d2560c9d4c11cc376c3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Datasets</topic><topic>Machine learning</topic><topic>Performance prediction</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sachdeva, Hansika</creatorcontrib><creatorcontrib>Puri, Ujjwal</creatorcontrib><creatorcontrib>Poornima, S.</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sachdeva, Hansika</au><au>Puri, Ujjwal</au><au>Poornima, S.</au><au>Godfrey Winster, S</au><au>Pushpalatha, M</au><au>Baskar, M</au><au>Kishore Anthuvan Sahayaraj, K</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Predicting the popularity of books before publication using machine learning</atitle><btitle>AIP Conference Proceedings</btitle><date>2024-07-29</date><risdate>2024</risdate><volume>3075</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does not perform as well as it was supposed to, they lose the time and resources spent on that book. There is no measure that can validate the editors’ decisions about books and predict their popularity. In our paper, we propose a novel approach for predicting the popularity of books before publication using the text of books as the primary data source. We investigate the connection between two independent data sets and a book’s popularity. The first one is the various textual features of book manuscripts, and the second is the metadata of the book, like the topic, the genre, and the author. To validate the proposed approach, we have used a large data set of published books and their corresponding ratings and rankings.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0217437</doi><tpages>19</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0094-243X
ispartof AIP Conference Proceedings, 2024, Vol.3075 (1)
issn 0094-243X
1551-7616
language eng
recordid cdi_proquest_journals_3085723965
source American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)
subjects Datasets
Machine learning
Performance prediction
title Predicting the popularity of books before publication using machine learning
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T05%3A07%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Predicting%20the%20popularity%20of%20books%20before%20publication%20using%20machine%20learning&rft.btitle=AIP%20Conference%20Proceedings&rft.au=Sachdeva,%20Hansika&rft.date=2024-07-29&rft.volume=3075&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0217437&rft_dat=%3Cproquest_scita%3E3085723965%3C/proquest_scita%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p637-45da744d340c432050663314674bd3381bbd613068c015d2560c9d4c11cc376c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3085723965&rft_id=info:pmid/&rfr_iscdi=true