Loading…

Frequency learning for image classification

Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain on specific tasks due to the great improvements brought by the deep neural networks (DNN). The majority of state-of-the-art architectures nowadays are DNN related, but only a few expl...

Full description

Saved in:

Bibliographic Details
Published in:	arXiv.org 2020-06
Main Authors:	Stuchi, José Augusto, Levy Boccato, Attux, Romis
Format:	Article
Language:	English
Subjects:	Artificial neural networks Computer vision Fourier transforms Frequency domain analysis Frequency filters Image classification Image processing Machine learning Signal processing Slicing
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Stuchi, José Augusto Levy Boccato Attux, Romis
description	Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain on specific tasks due to the great improvements brought by the deep neural networks (DNN). The majority of state-of-the-art architectures nowadays are DNN related, but only a few explore the frequency domain to extract useful information and improve the results, like in the image processing field. In this context, this paper presents a new approach for exploring the Fourier transform of the input images, which is composed of trainable frequency filters that boost discriminative components in the spectrum. Additionally, we propose a slicing procedure to allow the network to learn both global and local features from the frequency-domain representations of the image blocks. The proposed method proved to be competitive with respect to well-known DNN architectures in the selected experiments, with the advantage of being a simpler and lightweight model. This work also raises the discussion on how the state-of-the-art DNNs architectures can exploit not only spatial features, but also the frequency, in order to improve its performance when solving real world problems.
format	article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2418898607</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2418898607</sourcerecordid><originalsourceid>FETCH-proquest_journals_24188986073</originalsourceid><addsrcrecordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mTQditKLSxNzUuuVMhJTSzKy8xLV0jLL1LIzE1MT1VIzkksLs5My0xOLMnMz-NhYE1LzClO5YXS3AzKbq4hzh66BUX5QDOKS-Kz8kuL8oBS8UYmhhYWlhZmBubGxKkCAHe5MbI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2418898607</pqid></control><display><type>article</type><title>Frequency learning for image classification</title><source>Publicly Available Content Database</source><creator>Stuchi, José Augusto ; Levy Boccato ; Attux, Romis</creator><creatorcontrib>Stuchi, José Augusto ; Levy Boccato ; Attux, Romis</creatorcontrib><description>Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain on specific tasks due to the great improvements brought by the deep neural networks (DNN). The majority of state-of-the-art architectures nowadays are DNN related, but only a few explore the frequency domain to extract useful information and improve the results, like in the image processing field. In this context, this paper presents a new approach for exploring the Fourier transform of the input images, which is composed of trainable frequency filters that boost discriminative components in the spectrum. Additionally, we propose a slicing procedure to allow the network to learn both global and local features from the frequency-domain representations of the image blocks. The proposed method proved to be competitive with respect to well-known DNN architectures in the selected experiments, with the advantage of being a simpler and lightweight model. This work also raises the discussion on how the state-of-the-art DNNs architectures can exploit not only spatial features, but also the frequency, in order to improve its performance when solving real world problems.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Computer vision ; Fourier transforms ; Frequency domain analysis ; Frequency filters ; Image classification ; Image processing ; Machine learning ; Signal processing ; Slicing</subject><ispartof>arXiv.org, 2020-06</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2418898607?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25752,37011,44589</link.rule.ids></links><search><creatorcontrib>Stuchi, José Augusto</creatorcontrib><creatorcontrib>Levy Boccato</creatorcontrib><creatorcontrib>Attux, Romis</creatorcontrib><title>Frequency learning for image classification</title><title>arXiv.org</title><description>Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain on specific tasks due to the great improvements brought by the deep neural networks (DNN). The majority of state-of-the-art architectures nowadays are DNN related, but only a few explore the frequency domain to extract useful information and improve the results, like in the image processing field. In this context, this paper presents a new approach for exploring the Fourier transform of the input images, which is composed of trainable frequency filters that boost discriminative components in the spectrum. Additionally, we propose a slicing procedure to allow the network to learn both global and local features from the frequency-domain representations of the image blocks. The proposed method proved to be competitive with respect to well-known DNN architectures in the selected experiments, with the advantage of being a simpler and lightweight model. This work also raises the discussion on how the state-of-the-art DNNs architectures can exploit not only spatial features, but also the frequency, in order to improve its performance when solving real world problems.</description><subject>Artificial neural networks</subject><subject>Computer vision</subject><subject>Fourier transforms</subject><subject>Frequency domain analysis</subject><subject>Frequency filters</subject><subject>Image classification</subject><subject>Image processing</subject><subject>Machine learning</subject><subject>Signal processing</subject><subject>Slicing</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mTQditKLSxNzUuuVMhJTSzKy8xLV0jLL1LIzE1MT1VIzkksLs5My0xOLMnMz-NhYE1LzClO5YXS3AzKbq4hzh66BUX5QDOKS-Kz8kuL8oBS8UYmhhYWlhZmBubGxKkCAHe5MbI</recordid><startdate>20200628</startdate><enddate>20200628</enddate><creator>Stuchi, José Augusto</creator><creator>Levy Boccato</creator><creator>Attux, Romis</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20200628</creationdate><title>Frequency learning for image classification</title><author>Stuchi, José Augusto ; Levy Boccato ; Attux, Romis</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_24188986073</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Artificial neural networks</topic><topic>Computer vision</topic><topic>Fourier transforms</topic><topic>Frequency domain analysis</topic><topic>Frequency filters</topic><topic>Image classification</topic><topic>Image processing</topic><topic>Machine learning</topic><topic>Signal processing</topic><topic>Slicing</topic><toplevel>online_resources</toplevel><creatorcontrib>Stuchi, José Augusto</creatorcontrib><creatorcontrib>Levy Boccato</creatorcontrib><creatorcontrib>Attux, Romis</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Stuchi, José Augusto</au><au>Levy Boccato</au><au>Attux, Romis</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Frequency learning for image classification</atitle><jtitle>arXiv.org</jtitle><date>2020-06-28</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain on specific tasks due to the great improvements brought by the deep neural networks (DNN). The majority of state-of-the-art architectures nowadays are DNN related, but only a few explore the frequency domain to extract useful information and improve the results, like in the image processing field. In this context, this paper presents a new approach for exploring the Fourier transform of the input images, which is composed of trainable frequency filters that boost discriminative components in the spectrum. Additionally, we propose a slicing procedure to allow the network to learn both global and local features from the frequency-domain representations of the image blocks. The proposed method proved to be competitive with respect to well-known DNN architectures in the selected experiments, with the advantage of being a simpler and lightweight model. This work also raises the discussion on how the state-of-the-art DNNs architectures can exploit not only spatial features, but also the frequency, in order to improve its performance when solving real world problems.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2020-06
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2418898607
source	Publicly Available Content Database
subjects	Artificial neural networks Computer vision Fourier transforms Frequency domain analysis Frequency filters Image classification Image processing Machine learning Signal processing Slicing
title	Frequency learning for image classification
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T08%3A50%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Frequency%20learning%20for%20image%20classification&rft.jtitle=arXiv.org&rft.au=Stuchi,%20Jos%C3%A9%20Augusto&rft.date=2020-06-28&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2418898607%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_24188986073%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2418898607&rft_id=info:pmid/&rfr_iscdi=true