Loading…
A deep reinforcement learning based long-term recommender system
Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations...
Saved in:
Published in: | Knowledge-based systems 2021-02, Vol.213, p.106706, Article 106706 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3 |
---|---|
cites | cdi_FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3 |
container_end_page | |
container_issue | |
container_start_page | 106706 |
container_title | Knowledge-based systems |
container_volume | 213 |
creator | Huang, Liwei Fu, Mingsheng Li, Fan Qu, Hong Liu, Yangjun Chen, Wenyu |
description | Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios.
•A novel top-N interactive recommender system based on deep reinforcement learning is proposed.•The interactions between recommender system and users are simulated by recurrent neural networks.•The proposed model can deal with both cold-start and warm-start scenarios.•Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy.•Experiments and comparisons are conducted to show the merits of the proposed model. |
doi_str_mv | 10.1016/j.knosys.2020.106706 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2502226749</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705120308352</els_id><sourcerecordid>2502226749</sourcerecordid><originalsourceid>FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3</originalsourceid><addsrcrecordid>eNp9kE1LwzAYx4MoOKffwEPBc2uSJk1yEcfwDQZe9Bya5sloXZOZdMK-vRn17OmBh_8L_x9CtwRXBJPmfqi-fEjHVFFMT69G4OYMLYgUtBQMq3O0wIrjUmBOLtFVSgPGmFIiF-hxVViAfRGh9y7EDkbwU7GDNvrebwvTJrDFLvhtOUEcs6wLY5ZYiEUunGC8Rheu3SW4-btL9Pn89LF-LTfvL2_r1abs6ppNZS0tMdxa64SswRmhVEudI5IayhhQToRVlgnOeUsJlVRZXBNncCukAQP1Et3NufsYvg-QJj2EQ_S5UlOex9BGMJVVbFZ1MaQUwel97Mc2HjXB-sRKD3pmpU-s9Mwq2x5mG-QFPz1EnboefAe2z4snbUP_f8AvBX10Ow</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2502226749</pqid></control><display><type>article</type><title>A deep reinforcement learning based long-term recommender system</title><source>Library & Information Science Abstracts (LISA)</source><source>Elsevier</source><creator>Huang, Liwei ; Fu, Mingsheng ; Li, Fan ; Qu, Hong ; Liu, Yangjun ; Chen, Wenyu</creator><creatorcontrib>Huang, Liwei ; Fu, Mingsheng ; Li, Fan ; Qu, Hong ; Liu, Yangjun ; Chen, Wenyu</creatorcontrib><description>Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios.
•A novel top-N interactive recommender system based on deep reinforcement learning is proposed.•The interactions between recommender system and users are simulated by recurrent neural networks.•The proposed model can deal with both cold-start and warm-start scenarios.•Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy.•Experiments and comparisons are conducted to show the merits of the proposed model.</description><identifier>ISSN: 0950-7051</identifier><identifier>EISSN: 1872-7409</identifier><identifier>DOI: 10.1016/j.knosys.2020.106706</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Cold starts ; Cold-start ; Decision making ; Deep learning ; Deep reinforcement learning ; Interactive systems ; Long-term recommendation ; Markov processes ; Optimization ; Recommender system ; Recommender systems ; Recurrent neural networks</subject><ispartof>Knowledge-based systems, 2021-02, Vol.213, p.106706, Article 106706</ispartof><rights>2020 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Feb 15, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3</citedby><cites>FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925,34135</link.rule.ids></links><search><creatorcontrib>Huang, Liwei</creatorcontrib><creatorcontrib>Fu, Mingsheng</creatorcontrib><creatorcontrib>Li, Fan</creatorcontrib><creatorcontrib>Qu, Hong</creatorcontrib><creatorcontrib>Liu, Yangjun</creatorcontrib><creatorcontrib>Chen, Wenyu</creatorcontrib><title>A deep reinforcement learning based long-term recommender system</title><title>Knowledge-based systems</title><description>Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios.
•A novel top-N interactive recommender system based on deep reinforcement learning is proposed.•The interactions between recommender system and users are simulated by recurrent neural networks.•The proposed model can deal with both cold-start and warm-start scenarios.•Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy.•Experiments and comparisons are conducted to show the merits of the proposed model.</description><subject>Cold starts</subject><subject>Cold-start</subject><subject>Decision making</subject><subject>Deep learning</subject><subject>Deep reinforcement learning</subject><subject>Interactive systems</subject><subject>Long-term recommendation</subject><subject>Markov processes</subject><subject>Optimization</subject><subject>Recommender system</subject><subject>Recommender systems</subject><subject>Recurrent neural networks</subject><issn>0950-7051</issn><issn>1872-7409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>F2A</sourceid><recordid>eNp9kE1LwzAYx4MoOKffwEPBc2uSJk1yEcfwDQZe9Bya5sloXZOZdMK-vRn17OmBh_8L_x9CtwRXBJPmfqi-fEjHVFFMT69G4OYMLYgUtBQMq3O0wIrjUmBOLtFVSgPGmFIiF-hxVViAfRGh9y7EDkbwU7GDNvrebwvTJrDFLvhtOUEcs6wLY5ZYiEUunGC8Rheu3SW4-btL9Pn89LF-LTfvL2_r1abs6ppNZS0tMdxa64SswRmhVEudI5IayhhQToRVlgnOeUsJlVRZXBNncCukAQP1Et3NufsYvg-QJj2EQ_S5UlOex9BGMJVVbFZ1MaQUwel97Mc2HjXB-sRKD3pmpU-s9Mwq2x5mG-QFPz1EnboefAe2z4snbUP_f8AvBX10Ow</recordid><startdate>20210215</startdate><enddate>20210215</enddate><creator>Huang, Liwei</creator><creator>Fu, Mingsheng</creator><creator>Li, Fan</creator><creator>Qu, Hong</creator><creator>Liu, Yangjun</creator><creator>Chen, Wenyu</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20210215</creationdate><title>A deep reinforcement learning based long-term recommender system</title><author>Huang, Liwei ; Fu, Mingsheng ; Li, Fan ; Qu, Hong ; Liu, Yangjun ; Chen, Wenyu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Cold starts</topic><topic>Cold-start</topic><topic>Decision making</topic><topic>Deep learning</topic><topic>Deep reinforcement learning</topic><topic>Interactive systems</topic><topic>Long-term recommendation</topic><topic>Markov processes</topic><topic>Optimization</topic><topic>Recommender system</topic><topic>Recommender systems</topic><topic>Recurrent neural networks</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Liwei</creatorcontrib><creatorcontrib>Fu, Mingsheng</creatorcontrib><creatorcontrib>Li, Fan</creatorcontrib><creatorcontrib>Qu, Hong</creatorcontrib><creatorcontrib>Liu, Yangjun</creatorcontrib><creatorcontrib>Chen, Wenyu</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Liwei</au><au>Fu, Mingsheng</au><au>Li, Fan</au><au>Qu, Hong</au><au>Liu, Yangjun</au><au>Chen, Wenyu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A deep reinforcement learning based long-term recommender system</atitle><jtitle>Knowledge-based systems</jtitle><date>2021-02-15</date><risdate>2021</risdate><volume>213</volume><spage>106706</spage><pages>106706-</pages><artnum>106706</artnum><issn>0950-7051</issn><eissn>1872-7409</eissn><abstract>Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios.
•A novel top-N interactive recommender system based on deep reinforcement learning is proposed.•The interactions between recommender system and users are simulated by recurrent neural networks.•The proposed model can deal with both cold-start and warm-start scenarios.•Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy.•Experiments and comparisons are conducted to show the merits of the proposed model.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2020.106706</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0950-7051 |
ispartof | Knowledge-based systems, 2021-02, Vol.213, p.106706, Article 106706 |
issn | 0950-7051 1872-7409 |
language | eng |
recordid | cdi_proquest_journals_2502226749 |
source | Library & Information Science Abstracts (LISA); Elsevier |
subjects | Cold starts Cold-start Decision making Deep learning Deep reinforcement learning Interactive systems Long-term recommendation Markov processes Optimization Recommender system Recommender systems Recurrent neural networks |
title | A deep reinforcement learning based long-term recommender system |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T18%3A47%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20deep%20reinforcement%20learning%20based%20long-term%20recommender%20system&rft.jtitle=Knowledge-based%20systems&rft.au=Huang,%20Liwei&rft.date=2021-02-15&rft.volume=213&rft.spage=106706&rft.pages=106706-&rft.artnum=106706&rft.issn=0950-7051&rft.eissn=1872-7409&rft_id=info:doi/10.1016/j.knosys.2020.106706&rft_dat=%3Cproquest_cross%3E2502226749%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c334t-38d1b5dddf783efb799a2ff182b244e2517d9d47555a212829d031fb0a78bebe3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2502226749&rft_id=info:pmid/&rfr_iscdi=true |