Loading…

A deep reinforcement learning based long-term recommender system

Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems 2021-02, Vol.213, p.106706, Article 106706
Main Authors: Huang, Liwei, Fu, Mingsheng, Li, Fan, Qu, Hong, Liu, Yangjun, Chen, Wenyu
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3
cites cdi_FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3
container_end_page
container_issue
container_start_page 106706
container_title Knowledge-based systems
container_volume 213
creator Huang, Liwei
Fu, Mingsheng
Li, Fan
Qu, Hong
Liu, Yangjun
Chen, Wenyu
description Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios. •A novel top-N interactive recommender system based on deep reinforcement learning is proposed.•The interactions between recommender system and users are simulated by recurrent neural networks.•The proposed model can deal with both cold-start and warm-start scenarios.•Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy.•Experiments and comparisons are conducted to show the merits of the proposed model.
doi_str_mv 10.1016/j.knosys.2020.106706
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2502226749</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705120308352</els_id><sourcerecordid>2502226749</sourcerecordid><originalsourceid>FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3</originalsourceid><addsrcrecordid>eNp9kE1LwzAYx4MoOKffwEPBc2uSJk1yEcfwDQZe9Bya5sloXZOZdMK-vRn17OmBh_8L_x9CtwRXBJPmfqi-fEjHVFFMT69G4OYMLYgUtBQMq3O0wIrjUmBOLtFVSgPGmFIiF-hxVViAfRGh9y7EDkbwU7GDNvrebwvTJrDFLvhtOUEcs6wLY5ZYiEUunGC8Rheu3SW4-btL9Pn89LF-LTfvL2_r1abs6ppNZS0tMdxa64SswRmhVEudI5IayhhQToRVlgnOeUsJlVRZXBNncCukAQP1Et3NufsYvg-QJj2EQ_S5UlOex9BGMJVVbFZ1MaQUwel97Mc2HjXB-sRKD3pmpU-s9Mwq2x5mG-QFPz1EnboefAe2z4snbUP_f8AvBX10Ow</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2502226749</pqid></control><display><type>article</type><title>A deep reinforcement learning based long-term recommender system</title><source>Library &amp; Information Science Abstracts (LISA)</source><source>Elsevier</source><creator>Huang, Liwei ; Fu, Mingsheng ; Li, Fan ; Qu, Hong ; Liu, Yangjun ; Chen, Wenyu</creator><creatorcontrib>Huang, Liwei ; Fu, Mingsheng ; Li, Fan ; Qu, Hong ; Liu, Yangjun ; Chen, Wenyu</creatorcontrib><description>Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios. •A novel top-N interactive recommender system based on deep reinforcement learning is proposed.•The interactions between recommender system and users are simulated by recurrent neural networks.•The proposed model can deal with both cold-start and warm-start scenarios.•Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy.•Experiments and comparisons are conducted to show the merits of the proposed model.</description><identifier>ISSN: 0950-7051</identifier><identifier>EISSN: 1872-7409</identifier><identifier>DOI: 10.1016/j.knosys.2020.106706</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Cold starts ; Cold-start ; Decision making ; Deep learning ; Deep reinforcement learning ; Interactive systems ; Long-term recommendation ; Markov processes ; Optimization ; Recommender system ; Recommender systems ; Recurrent neural networks</subject><ispartof>Knowledge-based systems, 2021-02, Vol.213, p.106706, Article 106706</ispartof><rights>2020 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Feb 15, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3</citedby><cites>FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925,34135</link.rule.ids></links><search><creatorcontrib>Huang, Liwei</creatorcontrib><creatorcontrib>Fu, Mingsheng</creatorcontrib><creatorcontrib>Li, Fan</creatorcontrib><creatorcontrib>Qu, Hong</creatorcontrib><creatorcontrib>Liu, Yangjun</creatorcontrib><creatorcontrib>Chen, Wenyu</creatorcontrib><title>A deep reinforcement learning based long-term recommender system</title><title>Knowledge-based systems</title><description>Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios. •A novel top-N interactive recommender system based on deep reinforcement learning is proposed.•The interactions between recommender system and users are simulated by recurrent neural networks.•The proposed model can deal with both cold-start and warm-start scenarios.•Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy.•Experiments and comparisons are conducted to show the merits of the proposed model.</description><subject>Cold starts</subject><subject>Cold-start</subject><subject>Decision making</subject><subject>Deep learning</subject><subject>Deep reinforcement learning</subject><subject>Interactive systems</subject><subject>Long-term recommendation</subject><subject>Markov processes</subject><subject>Optimization</subject><subject>Recommender system</subject><subject>Recommender systems</subject><subject>Recurrent neural networks</subject><issn>0950-7051</issn><issn>1872-7409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>F2A</sourceid><recordid>eNp9kE1LwzAYx4MoOKffwEPBc2uSJk1yEcfwDQZe9Bya5sloXZOZdMK-vRn17OmBh_8L_x9CtwRXBJPmfqi-fEjHVFFMT69G4OYMLYgUtBQMq3O0wIrjUmBOLtFVSgPGmFIiF-hxVViAfRGh9y7EDkbwU7GDNvrebwvTJrDFLvhtOUEcs6wLY5ZYiEUunGC8Rheu3SW4-btL9Pn89LF-LTfvL2_r1abs6ppNZS0tMdxa64SswRmhVEudI5IayhhQToRVlgnOeUsJlVRZXBNncCukAQP1Et3NufsYvg-QJj2EQ_S5UlOex9BGMJVVbFZ1MaQUwel97Mc2HjXB-sRKD3pmpU-s9Mwq2x5mG-QFPz1EnboefAe2z4snbUP_f8AvBX10Ow</recordid><startdate>20210215</startdate><enddate>20210215</enddate><creator>Huang, Liwei</creator><creator>Fu, Mingsheng</creator><creator>Li, Fan</creator><creator>Qu, Hong</creator><creator>Liu, Yangjun</creator><creator>Chen, Wenyu</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20210215</creationdate><title>A deep reinforcement learning based long-term recommender system</title><author>Huang, Liwei ; Fu, Mingsheng ; Li, Fan ; Qu, Hong ; Liu, Yangjun ; Chen, Wenyu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Cold starts</topic><topic>Cold-start</topic><topic>Decision making</topic><topic>Deep learning</topic><topic>Deep reinforcement learning</topic><topic>Interactive systems</topic><topic>Long-term recommendation</topic><topic>Markov processes</topic><topic>Optimization</topic><topic>Recommender system</topic><topic>Recommender systems</topic><topic>Recurrent neural networks</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Liwei</creatorcontrib><creatorcontrib>Fu, Mingsheng</creatorcontrib><creatorcontrib>Li, Fan</creatorcontrib><creatorcontrib>Qu, Hong</creatorcontrib><creatorcontrib>Liu, Yangjun</creatorcontrib><creatorcontrib>Chen, Wenyu</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Liwei</au><au>Fu, Mingsheng</au><au>Li, Fan</au><au>Qu, Hong</au><au>Liu, Yangjun</au><au>Chen, Wenyu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A deep reinforcement learning based long-term recommender system</atitle><jtitle>Knowledge-based systems</jtitle><date>2021-02-15</date><risdate>2021</risdate><volume>213</volume><spage>106706</spage><pages>106706-</pages><artnum>106706</artnum><issn>0950-7051</issn><eissn>1872-7409</eissn><abstract>Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios. •A novel top-N interactive recommender system based on deep reinforcement learning is proposed.•The interactions between recommender system and users are simulated by recurrent neural networks.•The proposed model can deal with both cold-start and warm-start scenarios.•Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy.•Experiments and comparisons are conducted to show the merits of the proposed model.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2020.106706</doi></addata></record>
fulltext fulltext
identifier ISSN: 0950-7051
ispartof Knowledge-based systems, 2021-02, Vol.213, p.106706, Article 106706
issn 0950-7051
1872-7409
language eng
recordid cdi_proquest_journals_2502226749
source Library & Information Science Abstracts (LISA); Elsevier
subjects Cold starts
Cold-start
Decision making
Deep learning
Deep reinforcement learning
Interactive systems
Long-term recommendation
Markov processes
Optimization
Recommender system
Recommender systems
Recurrent neural networks
title A deep reinforcement learning based long-term recommender system
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T18%3A47%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20deep%20reinforcement%20learning%20based%20long-term%20recommender%20system&rft.jtitle=Knowledge-based%20systems&rft.au=Huang,%20Liwei&rft.date=2021-02-15&rft.volume=213&rft.spage=106706&rft.pages=106706-&rft.artnum=106706&rft.issn=0950-7051&rft.eissn=1872-7409&rft_id=info:doi/10.1016/j.knosys.2020.106706&rft_dat=%3Cproquest_cross%3E2502226749%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2502226749&rft_id=info:pmid/&rfr_iscdi=true