Loading…

Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products

Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting t...

Full description

Saved in:

Bibliographic Details
Published in:	International journal of multimedia information retrieval 2022-12, Vol.11 (4), p.717-729
Main Authors:	Papadopoulos, Stefanos-Iordanis, Koutlis, Christos, Papadopoulos, Symeon, Kompatsiaris, Ioannis
Format:	Article
Language:	English
Subjects:	Ablation Adequacy Autoregressive models Comparative studies Computer Science Computer vision Creative process Data Mining and Knowledge Discovery Database Management Datasets Deep learning Design Fashion designers Fashion goods Fashion models Forecasting Image classification Image Processing and Computer Vision Information Storage and Retrieval Information Systems Applications (incl.Internet) Machine learning Multilayer perceptrons Multimedia Information Systems Neural networks Product design Product development Regression analysis Regular Paper Sales forecasting Time series Trends
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503
cites	cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503
container_end_page	729
container_issue	4
container_start_page	717
container_title	International journal of multimedia information retrieval
container_volume	11
creator	Papadopoulos, Stefanos-Iordanis Koutlis, Christos Papadopoulos, Symeon Kompatsiaris, Ioannis
description	Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.
doi_str_mv	10.1007/s13735-022-00262-5
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2920220435</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920220435</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</originalsourceid><addsrcrecordid>eNp9kE1LAzEQhoMoWGr_gKcFz9FJsrvJeivFL6iIoiBeQjbNtlu2m5oPpf_e1BW9OZeZw_O8Ay9CpwTOCQC_8IRxVmCgFAPQkuLiAI0oqSguS_p6-HsTcowm3q8hjaAlAT5Cb_exC-3GLlSXPUblWzyNwT6ZpTPet7a_zBrrjFY-tP0yCyuTfbQ-Jnhrt7FTrg27zDZZbz6zRvlVMrKts4uogz9BR43qvJn87DF6ub56nt3i-cPN3Ww6x5qVLGBWcaFBi0JoThQXoBhUwFVeiLomOWeVYGVTmJpS4FDqpjJC0TrPDStMXgAbo7MhNz1-j8YHubbR9emlpBVNrUDOikTRgdLOeu9MI7eu3Si3kwTkvkY51CiTIL9rlHuJDZJPcL807i_6H-sLvI11Eg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920220435</pqid></control><display><type>article</type><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><source>Springer Link</source><creator>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</creator><creatorcontrib>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</creatorcontrib><description>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</description><identifier>ISSN: 2192-6611</identifier><identifier>EISSN: 2192-662X</identifier><identifier>DOI: 10.1007/s13735-022-00262-5</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Ablation ; Adequacy ; Autoregressive models ; Comparative studies ; Computer Science ; Computer vision ; Creative process ; Data Mining and Knowledge Discovery ; Database Management ; Datasets ; Deep learning ; Design ; Fashion designers ; Fashion goods ; Fashion models ; Forecasting ; Image classification ; Image Processing and Computer Vision ; Information Storage and Retrieval ; Information Systems Applications (incl.Internet) ; Machine learning ; Multilayer perceptrons ; Multimedia Information Systems ; Neural networks ; Product design ; Product development ; Regression analysis ; Regular Paper ; Sales forecasting ; Time series ; Trends</subject><ispartof>International journal of multimedia information retrieval, 2022-12, Vol.11 (4), p.717-729</ispartof><rights>The Author(s) 2022</rights><rights>The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</citedby><cites>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</cites><orcidid>0000-0002-1424-2647</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Papadopoulos, Stefanos-Iordanis</creatorcontrib><creatorcontrib>Koutlis, Christos</creatorcontrib><creatorcontrib>Papadopoulos, Symeon</creatorcontrib><creatorcontrib>Kompatsiaris, Ioannis</creatorcontrib><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><title>International journal of multimedia information retrieval</title><addtitle>Int J Multimed Info Retr</addtitle><description>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</description><subject>Ablation</subject><subject>Adequacy</subject><subject>Autoregressive models</subject><subject>Comparative studies</subject><subject>Computer Science</subject><subject>Computer vision</subject><subject>Creative process</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Database Management</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Design</subject><subject>Fashion designers</subject><subject>Fashion goods</subject><subject>Fashion models</subject><subject>Forecasting</subject><subject>Image classification</subject><subject>Image Processing and Computer Vision</subject><subject>Information Storage and Retrieval</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Machine learning</subject><subject>Multilayer perceptrons</subject><subject>Multimedia Information Systems</subject><subject>Neural networks</subject><subject>Product design</subject><subject>Product development</subject><subject>Regression analysis</subject><subject>Regular Paper</subject><subject>Sales forecasting</subject><subject>Time series</subject><subject>Trends</subject><issn>2192-6611</issn><issn>2192-662X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LAzEQhoMoWGr_gKcFz9FJsrvJeivFL6iIoiBeQjbNtlu2m5oPpf_e1BW9OZeZw_O8Ay9CpwTOCQC_8IRxVmCgFAPQkuLiAI0oqSguS_p6-HsTcowm3q8hjaAlAT5Cb_exC-3GLlSXPUblWzyNwT6ZpTPet7a_zBrrjFY-tP0yCyuTfbQ-Jnhrt7FTrg27zDZZbz6zRvlVMrKts4uogz9BR43qvJn87DF6ub56nt3i-cPN3Ww6x5qVLGBWcaFBi0JoThQXoBhUwFVeiLomOWeVYGVTmJpS4FDqpjJC0TrPDStMXgAbo7MhNz1-j8YHubbR9emlpBVNrUDOikTRgdLOeu9MI7eu3Si3kwTkvkY51CiTIL9rlHuJDZJPcL807i_6H-sLvI11Eg</recordid><startdate>20221201</startdate><enddate>20221201</enddate><creator>Papadopoulos, Stefanos-Iordanis</creator><creator>Koutlis, Christos</creator><creator>Papadopoulos, Symeon</creator><creator>Kompatsiaris, Ioannis</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><orcidid>https://orcid.org/0000-0002-1424-2647</orcidid></search><sort><creationdate>20221201</creationdate><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><author>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Ablation</topic><topic>Adequacy</topic><topic>Autoregressive models</topic><topic>Comparative studies</topic><topic>Computer Science</topic><topic>Computer vision</topic><topic>Creative process</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Database Management</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Design</topic><topic>Fashion designers</topic><topic>Fashion goods</topic><topic>Fashion models</topic><topic>Forecasting</topic><topic>Image classification</topic><topic>Image Processing and Computer Vision</topic><topic>Information Storage and Retrieval</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Machine learning</topic><topic>Multilayer perceptrons</topic><topic>Multimedia Information Systems</topic><topic>Neural networks</topic><topic>Product design</topic><topic>Product development</topic><topic>Regression analysis</topic><topic>Regular Paper</topic><topic>Sales forecasting</topic><topic>Time series</topic><topic>Trends</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Papadopoulos, Stefanos-Iordanis</creatorcontrib><creatorcontrib>Koutlis, Christos</creatorcontrib><creatorcontrib>Papadopoulos, Symeon</creatorcontrib><creatorcontrib>Kompatsiaris, Ioannis</creatorcontrib><collection>Springer_OA刊</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><jtitle>International journal of multimedia information retrieval</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Papadopoulos, Stefanos-Iordanis</au><au>Koutlis, Christos</au><au>Papadopoulos, Symeon</au><au>Kompatsiaris, Ioannis</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</atitle><jtitle>International journal of multimedia information retrieval</jtitle><stitle>Int J Multimed Info Retr</stitle><date>2022-12-01</date><risdate>2022</risdate><volume>11</volume><issue>4</issue><spage>717</spage><epage>729</epage><pages>717-729</pages><issn>2192-6611</issn><eissn>2192-662X</eissn><abstract>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s13735-022-00262-5</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-1424-2647</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2192-6611
ispartof	International journal of multimedia information retrieval, 2022-12, Vol.11 (4), p.717-729
issn	2192-6611 2192-662X
language	eng
recordid	cdi_proquest_journals_2920220435
source	Springer Link
subjects	Ablation Adequacy Autoregressive models Comparative studies Computer Science Computer vision Creative process Data Mining and Knowledge Discovery Database Management Datasets Deep learning Design Fashion designers Fashion goods Fashion models Forecasting Image classification Image Processing and Computer Vision Information Storage and Retrieval Information Systems Applications (incl.Internet) Machine learning Multilayer perceptrons Multimedia Information Systems Neural networks Product design Product development Regression analysis Regular Paper Sales forecasting Time series Trends
title	Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T03%3A23%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multimodal%20Quasi-AutoRegression:%20forecasting%20the%20visual%20popularity%20of%20new%20fashion%20products&rft.jtitle=International%20journal%20of%20multimedia%20information%20retrieval&rft.au=Papadopoulos,%20Stefanos-Iordanis&rft.date=2022-12-01&rft.volume=11&rft.issue=4&rft.spage=717&rft.epage=729&rft.pages=717-729&rft.issn=2192-6611&rft.eissn=2192-662X&rft_id=info:doi/10.1007/s13735-022-00262-5&rft_dat=%3Cproquest_cross%3E2920220435%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2920220435&rft_id=info:pmid/&rfr_iscdi=true