Loading…

Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products

Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting t...

Full description

Saved in:
Bibliographic Details
Published in:International journal of multimedia information retrieval 2022-12, Vol.11 (4), p.717-729
Main Authors: Papadopoulos, Stefanos-Iordanis, Koutlis, Christos, Papadopoulos, Symeon, Kompatsiaris, Ioannis
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503
cites cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503
container_end_page 729
container_issue 4
container_start_page 717
container_title International journal of multimedia information retrieval
container_volume 11
creator Papadopoulos, Stefanos-Iordanis
Koutlis, Christos
Papadopoulos, Symeon
Kompatsiaris, Ioannis
description Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.
doi_str_mv 10.1007/s13735-022-00262-5
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2920220435</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920220435</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</originalsourceid><addsrcrecordid>eNp9kE1LAzEQhoMoWGr_gKcFz9FJsrvJeivFL6iIoiBeQjbNtlu2m5oPpf_e1BW9OZeZw_O8Ay9CpwTOCQC_8IRxVmCgFAPQkuLiAI0oqSguS_p6-HsTcowm3q8hjaAlAT5Cb_exC-3GLlSXPUblWzyNwT6ZpTPet7a_zBrrjFY-tP0yCyuTfbQ-Jnhrt7FTrg27zDZZbz6zRvlVMrKts4uogz9BR43qvJn87DF6ub56nt3i-cPN3Ww6x5qVLGBWcaFBi0JoThQXoBhUwFVeiLomOWeVYGVTmJpS4FDqpjJC0TrPDStMXgAbo7MhNz1-j8YHubbR9emlpBVNrUDOikTRgdLOeu9MI7eu3Si3kwTkvkY51CiTIL9rlHuJDZJPcL807i_6H-sLvI11Eg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920220435</pqid></control><display><type>article</type><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><source>Springer Link</source><creator>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</creator><creatorcontrib>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</creatorcontrib><description>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</description><identifier>ISSN: 2192-6611</identifier><identifier>EISSN: 2192-662X</identifier><identifier>DOI: 10.1007/s13735-022-00262-5</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Ablation ; Adequacy ; Autoregressive models ; Comparative studies ; Computer Science ; Computer vision ; Creative process ; Data Mining and Knowledge Discovery ; Database Management ; Datasets ; Deep learning ; Design ; Fashion designers ; Fashion goods ; Fashion models ; Forecasting ; Image classification ; Image Processing and Computer Vision ; Information Storage and Retrieval ; Information Systems Applications (incl.Internet) ; Machine learning ; Multilayer perceptrons ; Multimedia Information Systems ; Neural networks ; Product design ; Product development ; Regression analysis ; Regular Paper ; Sales forecasting ; Time series ; Trends</subject><ispartof>International journal of multimedia information retrieval, 2022-12, Vol.11 (4), p.717-729</ispartof><rights>The Author(s) 2022</rights><rights>The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</citedby><cites>FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</cites><orcidid>0000-0002-1424-2647</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Papadopoulos, Stefanos-Iordanis</creatorcontrib><creatorcontrib>Koutlis, Christos</creatorcontrib><creatorcontrib>Papadopoulos, Symeon</creatorcontrib><creatorcontrib>Kompatsiaris, Ioannis</creatorcontrib><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><title>International journal of multimedia information retrieval</title><addtitle>Int J Multimed Info Retr</addtitle><description>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</description><subject>Ablation</subject><subject>Adequacy</subject><subject>Autoregressive models</subject><subject>Comparative studies</subject><subject>Computer Science</subject><subject>Computer vision</subject><subject>Creative process</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Database Management</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Design</subject><subject>Fashion designers</subject><subject>Fashion goods</subject><subject>Fashion models</subject><subject>Forecasting</subject><subject>Image classification</subject><subject>Image Processing and Computer Vision</subject><subject>Information Storage and Retrieval</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Machine learning</subject><subject>Multilayer perceptrons</subject><subject>Multimedia Information Systems</subject><subject>Neural networks</subject><subject>Product design</subject><subject>Product development</subject><subject>Regression analysis</subject><subject>Regular Paper</subject><subject>Sales forecasting</subject><subject>Time series</subject><subject>Trends</subject><issn>2192-6611</issn><issn>2192-662X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LAzEQhoMoWGr_gKcFz9FJsrvJeivFL6iIoiBeQjbNtlu2m5oPpf_e1BW9OZeZw_O8Ay9CpwTOCQC_8IRxVmCgFAPQkuLiAI0oqSguS_p6-HsTcowm3q8hjaAlAT5Cb_exC-3GLlSXPUblWzyNwT6ZpTPet7a_zBrrjFY-tP0yCyuTfbQ-Jnhrt7FTrg27zDZZbz6zRvlVMrKts4uogz9BR43qvJn87DF6ub56nt3i-cPN3Ww6x5qVLGBWcaFBi0JoThQXoBhUwFVeiLomOWeVYGVTmJpS4FDqpjJC0TrPDStMXgAbo7MhNz1-j8YHubbR9emlpBVNrUDOikTRgdLOeu9MI7eu3Si3kwTkvkY51CiTIL9rlHuJDZJPcL807i_6H-sLvI11Eg</recordid><startdate>20221201</startdate><enddate>20221201</enddate><creator>Papadopoulos, Stefanos-Iordanis</creator><creator>Koutlis, Christos</creator><creator>Papadopoulos, Symeon</creator><creator>Kompatsiaris, Ioannis</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><orcidid>https://orcid.org/0000-0002-1424-2647</orcidid></search><sort><creationdate>20221201</creationdate><title>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</title><author>Papadopoulos, Stefanos-Iordanis ; Koutlis, Christos ; Papadopoulos, Symeon ; Kompatsiaris, Ioannis</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Ablation</topic><topic>Adequacy</topic><topic>Autoregressive models</topic><topic>Comparative studies</topic><topic>Computer Science</topic><topic>Computer vision</topic><topic>Creative process</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Database Management</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Design</topic><topic>Fashion designers</topic><topic>Fashion goods</topic><topic>Fashion models</topic><topic>Forecasting</topic><topic>Image classification</topic><topic>Image Processing and Computer Vision</topic><topic>Information Storage and Retrieval</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Machine learning</topic><topic>Multilayer perceptrons</topic><topic>Multimedia Information Systems</topic><topic>Neural networks</topic><topic>Product design</topic><topic>Product development</topic><topic>Regression analysis</topic><topic>Regular Paper</topic><topic>Sales forecasting</topic><topic>Time series</topic><topic>Trends</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Papadopoulos, Stefanos-Iordanis</creatorcontrib><creatorcontrib>Koutlis, Christos</creatorcontrib><creatorcontrib>Papadopoulos, Symeon</creatorcontrib><creatorcontrib>Kompatsiaris, Ioannis</creatorcontrib><collection>Springer_OA刊</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><jtitle>International journal of multimedia information retrieval</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Papadopoulos, Stefanos-Iordanis</au><au>Koutlis, Christos</au><au>Papadopoulos, Symeon</au><au>Kompatsiaris, Ioannis</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products</atitle><jtitle>International journal of multimedia information retrieval</jtitle><stitle>Int J Multimed Info Retr</stitle><date>2022-12-01</date><risdate>2022</risdate><volume>11</volume><issue>4</issue><spage>717</spage><epage>729</epage><pages>717-729</pages><issn>2192-6611</issn><eissn>2192-662X</eissn><abstract>Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multimodal multilayer perceptron processing categorical, visual and textual features of the product and (2) a Quasi-AutoRegressive neural network modelling the “target” time series of the product’s attributes along with the “exogenous” time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting visual features and textual descriptions from the images of new products. Product design in fashion is initially expressed visually and these features represent the products’ unique characteristics without interfering with the creative process of its designers by requiring additional inputs (e.g. manually written texts). We employ the product’s target attributes time series as a proxy of temporal popularity patterns, mitigating the lack of historical data, while exogenous time series help capture trends among interrelated attributes. We perform an extensive ablation analysis on two large-scale image fashion datasets, Mallzee-P and SHIFT15m to assess the adequacy of MuQAR and also use the Amazon Reviews: Home and Kitchen dataset to assess generalization to other domains. A comparative study on the VISUELLE dataset shows that MuQAR is capable of competing and surpassing the domain’s current state of the art by 4.65% and 4.8% in terms of WAPE and MAE, respectively.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s13735-022-00262-5</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-1424-2647</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2192-6611
ispartof International journal of multimedia information retrieval, 2022-12, Vol.11 (4), p.717-729
issn 2192-6611
2192-662X
language eng
recordid cdi_proquest_journals_2920220435
source Springer Link
subjects Ablation
Adequacy
Autoregressive models
Comparative studies
Computer Science
Computer vision
Creative process
Data Mining and Knowledge Discovery
Database Management
Datasets
Deep learning
Design
Fashion designers
Fashion goods
Fashion models
Forecasting
Image classification
Image Processing and Computer Vision
Information Storage and Retrieval
Information Systems Applications (incl.Internet)
Machine learning
Multilayer perceptrons
Multimedia Information Systems
Neural networks
Product design
Product development
Regression analysis
Regular Paper
Sales forecasting
Time series
Trends
title Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T03%3A23%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multimodal%20Quasi-AutoRegression:%20forecasting%20the%20visual%20popularity%20of%20new%20fashion%20products&rft.jtitle=International%20journal%20of%20multimedia%20information%20retrieval&rft.au=Papadopoulos,%20Stefanos-Iordanis&rft.date=2022-12-01&rft.volume=11&rft.issue=4&rft.spage=717&rft.epage=729&rft.pages=717-729&rft.issn=2192-6611&rft.eissn=2192-662X&rft_id=info:doi/10.1007/s13735-022-00262-5&rft_dat=%3Cproquest_cross%3E2920220435%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c363t-3978c0c858c71a780a30907a458bb14739836f5eb220706cf9e8a2b44e35e4503%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2920220435&rft_id=info:pmid/&rfr_iscdi=true