Loading…
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products
Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting t...
Saved in:
Published in: | International journal of multimedia information retrieval 2022-12, Vol.11 (4), p.717-729 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503 |
---|---|
cites | cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503 |
container_end_page | 729 |
container_issue | 4 |
container_start_page | 717 |
container_title | International journal of multimedia information retrieval |
container_volume | 11 |
creator | Papadopoulos, Stefanos-Iordanis Koutlis, Christos Papadopoulos, Symeon Kompatsiaris, Ioannis |
description | Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively. |
doi_str_mv | 10.1007/s13735-022-00262-5 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2920220435</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920220435</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</originalsourceid><addsrcrecordid>eNp9kE1LAzEQhoMoWGr_gKcFz9FJsrvJeivFL6iIoiBeQjbNtlu2m5oPpf_e1BW9OZeZw_O8Ay9CpwTOCQC_8IRxVmCgFAPQkuLiAI0oqSguS_p6-HsTcowm3q8hjaAlAT5Cb_exC-3GLlSXPUblWzyNwT6ZpTPet7a_zBrrjFY-tP0yCyuTfbQ-Jnhrt7FTrg27zDZZbz6zRvlVMrKts4uogz9BR43qvJn87DF6ub56nt3i-cPN3Ww6x5qVLGBWcaFBi0JoThQXoBhUwFVeiLomOWeVYGVTmJpS4FDqpjJC0TrPDStMXgAbo7MhNz1-j8YHubbR9emlpBVNrUDOikTRgdLOeu9MI7eu3Si3kwTkvkY51CiTIL9rlHuJDZJPcL807i_6H-sLvI11Eg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920220435</pqid></control><display><type>article</type><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><source>Springer Link</source><creator>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</creator><creatorcontrib>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</creatorcontrib><description>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</description><identifier>ISSN: 2192-6611</identifier><identifier>EISSN: 2192-662X</identifier><identifier>DOI: 10.1007/s13735-022-00262-5</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Ablation ; Adequacy ; Autoregressive models ; Comparative studies ; Computer Science ; Computer vision ; Creative process ; Data Mining and Knowledge Discovery ; Database Management ; Datasets ; Deep learning ; Design ; Fashion designers ; Fashion goods ; Fashion models ; Forecasting ; Image classification ; Image Processing and Computer Vision ; Information Storage and Retrieval ; Information Systems Applications (incl.Internet) ; Machine learning ; Multilayer perceptrons ; Multimedia Information Systems ; Neural networks ; Product design ; Product development ; Regression analysis ; Regular Paper ; Sales forecasting ; Time series ; Trends</subject><ispartof>International journal of multimedia information retrieval, 2022-12, Vol.11 (4), p.717-729</ispartof><rights>The Author(s) 2022</rights><rights>The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</citedby><cites>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</cites><orcidid>0000-0002-1424-2647</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Papadopoulos, Stefanos-Iordanis</creatorcontrib><creatorcontrib>Koutlis, Christos</creatorcontrib><creatorcontrib>Papadopoulos, Symeon</creatorcontrib><creatorcontrib>Kompatsiaris, Ioannis</creatorcontrib><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><title>International journal of multimedia information retrieval</title><addtitle>Int J Multimed Info Retr</addtitle><description>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</description><subject>Ablation</subject><subject>Adequacy</subject><subject>Autoregressive models</subject><subject>Comparative studies</subject><subject>Computer Science</subject><subject>Computer vision</subject><subject>Creative process</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Database Management</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Design</subject><subject>Fashion designers</subject><subject>Fashion goods</subject><subject>Fashion models</subject><subject>Forecasting</subject><subject>Image classification</subject><subject>Image Processing and Computer Vision</subject><subject>Information Storage and Retrieval</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Machine learning</subject><subject>Multilayer perceptrons</subject><subject>Multimedia Information Systems</subject><subject>Neural networks</subject><subject>Product design</subject><subject>Product development</subject><subject>Regression analysis</subject><subject>Regular Paper</subject><subject>Sales forecasting</subject><subject>Time series</subject><subject>Trends</subject><issn>2192-6611</issn><issn>2192-662X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LAzEQhoMoWGr_gKcFz9FJsrvJeivFL6iIoiBeQjbNtlu2m5oPpf_e1BW9OZeZw_O8Ay9CpwTOCQC_8IRxVmCgFAPQkuLiAI0oqSguS_p6-HsTcowm3q8hjaAlAT5Cb_exC-3GLlSXPUblWzyNwT6ZpTPet7a_zBrrjFY-tP0yCyuTfbQ-Jnhrt7FTrg27zDZZbz6zRvlVMrKts4uogz9BR43qvJn87DF6ub56nt3i-cPN3Ww6x5qVLGBWcaFBi0JoThQXoBhUwFVeiLomOWeVYGVTmJpS4FDqpjJC0TrPDStMXgAbo7MhNz1-j8YHubbR9emlpBVNrUDOikTRgdLOeu9MI7eu3Si3kwTkvkY51CiTIL9rlHuJDZJPcL807i_6H-sLvI11Eg</recordid><startdate>20221201</startdate><enddate>20221201</enddate><creator>Papadopoulos, Stefanos-Iordanis</creator><creator>Koutlis, Christos</creator><creator>Papadopoulos, Symeon</creator><creator>Kompatsiaris, Ioannis</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><orcidid>https://orcid.org/0000-0002-1424-2647</orcidid></search><sort><creationdate>20221201</creationdate><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><author>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Ablation</topic><topic>Adequacy</topic><topic>Autoregressive models</topic><topic>Comparative studies</topic><topic>Computer Science</topic><topic>Computer vision</topic><topic>Creative process</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Database Management</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Design</topic><topic>Fashion designers</topic><topic>Fashion goods</topic><topic>Fashion models</topic><topic>Forecasting</topic><topic>Image classification</topic><topic>Image Processing and Computer Vision</topic><topic>Information Storage and Retrieval</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Machine learning</topic><topic>Multilayer perceptrons</topic><topic>Multimedia Information Systems</topic><topic>Neural networks</topic><topic>Product design</topic><topic>Product development</topic><topic>Regression analysis</topic><topic>Regular Paper</topic><topic>Sales forecasting</topic><topic>Time series</topic><topic>Trends</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Papadopoulos, Stefanos-Iordanis</creatorcontrib><creatorcontrib>Koutlis, Christos</creatorcontrib><creatorcontrib>Papadopoulos, Symeon</creatorcontrib><creatorcontrib>Kompatsiaris, Ioannis</creatorcontrib><collection>Springer_OA刊</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><jtitle>International journal of multimedia information retrieval</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Papadopoulos, Stefanos-Iordanis</au><au>Koutlis, Christos</au><au>Papadopoulos, Symeon</au><au>Kompatsiaris, Ioannis</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</atitle><jtitle>International journal of multimedia information retrieval</jtitle><stitle>Int J Multimed Info Retr</stitle><date>2022-12-01</date><risdate>2022</risdate><volume>11</volume><issue>4</issue><spage>717</spage><epage>729</epage><pages>717-729</pages><issn>2192-6611</issn><eissn>2192-662X</eissn><abstract>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s13735-022-00262-5</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-1424-2647</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2192-6611 |
ispartof | International journal of multimedia information retrieval, 2022-12, Vol.11 (4), p.717-729 |
issn | 2192-6611 2192-662X |
language | eng |
recordid | cdi_proquest_journals_2920220435 |
source | Springer Link |
subjects | Ablation Adequacy Autoregressive models Comparative studies Computer Science Computer vision Creative process Data Mining and Knowledge Discovery Database Management Datasets Deep learning Design Fashion designers Fashion goods Fashion models Forecasting Image classification Image Processing and Computer Vision Information Storage and Retrieval Information Systems Applications (incl.Internet) Machine learning Multilayer perceptrons Multimedia Information Systems Neural networks Product design Product development Regression analysis Regular Paper Sales forecasting Time series Trends |
title | Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T03%3A23%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multimodal%20Quasi-AutoRegression:%20forecasting%20the%20visual%20popularity%20of%20new%20fashion%20products&rft.jtitle=International%20journal%20of%20multimedia%20information%20retrieval&rft.au=Papadopoulos,%20Stefanos-Iordanis&rft.date=2022-12-01&rft.volume=11&rft.issue=4&rft.spage=717&rft.epage=729&rft.pages=717-729&rft.issn=2192-6611&rft.eissn=2192-662X&rft_id=info:doi/10.1007/s13735-022-00262-5&rft_dat=%3Cproquest_cross%3E2920220435%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2920220435&rft_id=info:pmid/&rfr_iscdi=true |