Loading…

Predicting the popularity of books before publication using machine learning

The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does n...

Full description

Saved in:

Bibliographic Details
Main Authors:	Sachdeva, Hansika, Puri, Ujjwal, Poornima, S.
Format:	Conference Proceeding
Language:	English
Subjects:	Datasets Machine learning Performance prediction
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue	1
container_start_page
container_title
container_volume	3075
creator	Sachdeva, Hansika Puri, Ujjwal Poornima, S.
description	The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does not perform as well as it was supposed to, they lose the time and resources spent on that book. There is no measure that can validate the editors’ decisions about books and predict their popularity. In our paper, we propose a novel approach for predicting the popularity of books before publication using the text of books as the primary data source. We investigate the connection between two independent data sets and a book’s popularity. The first one is the various textual features of book manuscripts, and the second is the metadata of the book, like the topic, the genre, and the author. To validate the proposed approach, we have used a large data set of published books and their corresponding ratings and rankings.
doi_str_mv	10.1063/5.0217437
format	conference_proceeding
fullrecord	<record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_3085723965</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3085723965</sourcerecordid><originalsourceid>FETCH-LOGICAL-p637-45da744d340c432050663314674bd3381bbd613068c015d2560c9d4c11cc376c3</originalsourceid><addsrcrecordid>eNotkEtPwzAQhC0EEqVw4B9Y4oaUss76kRxRxUuqBIceuFmO7VKXNA52cuDfk9KeRtqZ3R19hNwyWDCQ-CAWUDLFUZ2RGROCFUoyeU5mADUvSo6fl-Qq5x1AWStVzcjqI3kX7BC6LzpsPe1jP7YmheGXxg1tYvzOtPGbmCZrbNpgzRBiR8d8WNgbuw2dp603qZsG1-RiY9rsb046J-vnp_XytVi9v7wtH1dFL1EVXDijOHfIwXIsQYCUiIxLxRuHWLGmcZIhyMoCE64UEmztuGXMWlTS4pzcHc_2Kf6MPg96F8fUTR81QiVUibUUU-r-mMo2DP-tdZ_C3qRfzUAfYGmhT7DwDwd5Wm8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>3085723965</pqid></control><display><type>conference_proceeding</type><title>Predicting the popularity of books before publication using machine learning</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><creator>Sachdeva, Hansika ; Puri, Ujjwal ; Poornima, S.</creator><contributor>Godfrey Winster, S ; Pushpalatha, M ; Baskar, M ; Kishore Anthuvan Sahayaraj, K</contributor><creatorcontrib>Sachdeva, Hansika ; Puri, Ujjwal ; Poornima, S. ; Godfrey Winster, S ; Pushpalatha, M ; Baskar, M ; Kishore Anthuvan Sahayaraj, K</creatorcontrib><description>The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does not perform as well as it was supposed to, they lose the time and resources spent on that book. There is no measure that can validate the editors’ decisions about books and predict their popularity. In our paper, we propose a novel approach for predicting the popularity of books before publication using the text of books as the primary data source. We investigate the connection between two independent data sets and a book’s popularity. The first one is the various textual features of book manuscripts, and the second is the metadata of the book, like the topic, the genre, and the author. To validate the proposed approach, we have used a large data set of published books and their corresponding ratings and rankings.</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0217437</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Datasets ; Machine learning ; Performance prediction</subject><ispartof>AIP Conference Proceedings, 2024, Vol.3075 (1)</ispartof><rights>AIP Publishing LLC</rights><rights>2024 AIP Publishing LLC.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>309,310,314,780,784,789,790,23930,23931,25140,27924,27925</link.rule.ids></links><search><contributor>Godfrey Winster, S</contributor><contributor>Pushpalatha, M</contributor><contributor>Baskar, M</contributor><contributor>Kishore Anthuvan Sahayaraj, K</contributor><creatorcontrib>Sachdeva, Hansika</creatorcontrib><creatorcontrib>Puri, Ujjwal</creatorcontrib><creatorcontrib>Poornima, S.</creatorcontrib><title>Predicting the popularity of books before publication using machine learning</title><title>AIP Conference Proceedings</title><description>The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does not perform as well as it was supposed to, they lose the time and resources spent on that book. There is no measure that can validate the editors’ decisions about books and predict their popularity. In our paper, we propose a novel approach for predicting the popularity of books before publication using the text of books as the primary data source. We investigate the connection between two independent data sets and a book’s popularity. The first one is the various textual features of book manuscripts, and the second is the metadata of the book, like the topic, the genre, and the author. To validate the proposed approach, we have used a large data set of published books and their corresponding ratings and rankings.</description><subject>Datasets</subject><subject>Machine learning</subject><subject>Performance prediction</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkEtPwzAQhC0EEqVw4B9Y4oaUss76kRxRxUuqBIceuFmO7VKXNA52cuDfk9KeRtqZ3R19hNwyWDCQ-CAWUDLFUZ2RGROCFUoyeU5mADUvSo6fl-Qq5x1AWStVzcjqI3kX7BC6LzpsPe1jP7YmheGXxg1tYvzOtPGbmCZrbNpgzRBiR8d8WNgbuw2dp603qZsG1-RiY9rsb046J-vnp_XytVi9v7wtH1dFL1EVXDijOHfIwXIsQYCUiIxLxRuHWLGmcZIhyMoCE64UEmztuGXMWlTS4pzcHc_2Kf6MPg96F8fUTR81QiVUibUUU-r-mMo2DP-tdZ_C3qRfzUAfYGmhT7DwDwd5Wm8</recordid><startdate>20240729</startdate><enddate>20240729</enddate><creator>Sachdeva, Hansika</creator><creator>Puri, Ujjwal</creator><creator>Poornima, S.</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20240729</creationdate><title>Predicting the popularity of books before publication using machine learning</title><author>Sachdeva, Hansika ; Puri, Ujjwal ; Poornima, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p637-45da744d340c432050663314674bd3381bbd613068c015d2560c9d4c11cc376c3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Datasets</topic><topic>Machine learning</topic><topic>Performance prediction</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sachdeva, Hansika</creatorcontrib><creatorcontrib>Puri, Ujjwal</creatorcontrib><creatorcontrib>Poornima, S.</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sachdeva, Hansika</au><au>Puri, Ujjwal</au><au>Poornima, S.</au><au>Godfrey Winster, S</au><au>Pushpalatha, M</au><au>Baskar, M</au><au>Kishore Anthuvan Sahayaraj, K</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Predicting the popularity of books before publication using machine learning</atitle><btitle>AIP Conference Proceedings</btitle><date>2024-07-29</date><risdate>2024</risdate><volume>3075</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>The publishing industry is highly competitive, and predicting the popularity of books before their publication can significantly benefit publishers and authors alike. Publishers tend to put a lot of resources into all the books they choose to publish. If the book ends up being unsuccessful or does not perform as well as it was supposed to, they lose the time and resources spent on that book. There is no measure that can validate the editors’ decisions about books and predict their popularity. In our paper, we propose a novel approach for predicting the popularity of books before publication using the text of books as the primary data source. We investigate the connection between two independent data sets and a book’s popularity. The first one is the various textual features of book manuscripts, and the second is the metadata of the book, like the topic, the genre, and the author. To validate the proposed approach, we have used a large data set of published books and their corresponding ratings and rankings.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0217437</doi><tpages>19</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0094-243X
ispartof	AIP Conference Proceedings, 2024, Vol.3075 (1)
issn	0094-243X 1551-7616
language	eng
recordid	cdi_proquest_journals_3085723965
source	American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)
subjects	Datasets Machine learning Performance prediction
title	Predicting the popularity of books before publication using machine learning
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T05%3A07%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Predicting%20the%20popularity%20of%20books%20before%20publication%20using%20machine%20learning&rft.btitle=AIP%20Conference%20Proceedings&rft.au=Sachdeva,%20Hansika&rft.date=2024-07-29&rft.volume=3075&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0217437&rft_dat=%3Cproquest_scita%3E3085723965%3C/proquest_scita%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p637-45da744d340c432050663314674bd3381bbd613068c015d2560c9d4c11cc376c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3085723965&rft_id=info:pmid/&rfr_iscdi=true