Loading…

WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs

The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works...

Full description

Saved in:
Bibliographic Details
Published in:PloS one 2022-04, Vol.17 (4), p.e0267106-e0267106
Main Authors: Akbari Rokn Abadi, Saeedeh, Mohammadi, Amirhossein, Koohi, Somayyeh
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3
cites cdi_FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3
container_end_page e0267106
container_issue 4
container_start_page e0267106
container_title PloS one
container_volume 17
creator Akbari Rokn Abadi, Saeedeh
Mohammadi, Amirhossein
Koohi, Somayyeh
description The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC, CASTOR, and DLM-CNN) for a variety of biological sequences. Using WalkIm for classifying various data sets (i.e. viruses whole-genome data, metagenomics read data, and metabarcoding data), it achieves the same performance as the existing methods, with no enforcement of parameter initialization or network architecture adjustment for each data set. It is worth noting that even in the case of classifying high-mutant data sets, such as Coronaviruses, it achieves almost 100% accuracy for classifying its various types. In addition, WalkIm achieves high-speed convergence during network training, as well as reduction of network complexity. Therefore WalkIm method enables us to execute the classifying neural networks on a normal desktop system in a short time interval. Moreover, we addressed the compatibility of WalkIm encoding method with free-space optical processing technology. Taking advantages of optical implementation of convolutional layers, we illustrated that the training time can be reduced by up to 500 time. In addition to all aforementioned advantages, this encoding method preserves the structure of generated images in various modes of sequence transformation, such as reverse complement, complement, and reverse modes.
doi_str_mv 10.1371/journal.pone.0267106
format article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2650397324</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A700580255</galeid><doaj_id>oai_doaj_org_article_872178e8ac6b47569a5f3f579cf92ddb</doaj_id><sourcerecordid>A700580255</sourcerecordid><originalsourceid>FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3</originalsourceid><addsrcrecordid>eNqNk01v1DAQhiMEoqXwDxBEQkJwyBLbie1wQKpWfKxUtRKfR8txxlkXJw52guDCb8fbTasN6gH5EGf8vK_t8UySPEb5ChGGXl26yffSrgbXwyrHlKGc3kmOUUVwRnFO7h7Mj5IHIVzmeUk4pfeTI1IWmEWT4-TPN2m_b7rX6dp1g1RjajrZQlbLAE0KvXKN6dtUO59uTbvNBvBx3sleQaqsDMFoo-RoXJ86ndbGWdfGgE0D_JiiHEI6hZ1DMN1gIR2nPv5l2gOk6_Pz8DC5p6UN8Gj-niRf3r39vP6QnV2836xPzzJFMR4zzWXZoIY2NcKcsVIyWhdEIay4JIoirClHOdRF5BSWhOMKGCeV4mWNgSlykjzd-w7WBTGnLghMy5xUjOAiEps90Th5KQYf8-B_CyeNuAo43wrpR6MsCM4wYhy4VPEUrKSVLDXRJauUrnDT1NHrzbzbVHfQKOhHL-3CdLnSm61o3U9R5QiTgkeDF7OBdzGPYRSdCQqslT246erciManJDiiz_5Bb7_dTLUyXsD02sV91c5UnLJYFzzHZRmp1S1UHA10RsU60ybGF4KXC0FkRvg1tnIKQWw-ffx_9uLrkn1-wG5B2nEbnJ12hRaWYLEHlXcheNA3SUa52LXJdTbErk3E3CZR9uTwgW5E131B_gJBGw3X</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2650397324</pqid></control><display><type>article</type><title>WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs</title><source>Publicly Available Content (ProQuest)</source><source>PubMed Central</source><source>Coronavirus Research Database</source><creator>Akbari Rokn Abadi, Saeedeh ; Mohammadi, Amirhossein ; Koohi, Somayyeh</creator><contributor>Le, Nguyen Quoc Khanh</contributor><creatorcontrib>Akbari Rokn Abadi, Saeedeh ; Mohammadi, Amirhossein ; Koohi, Somayyeh ; Le, Nguyen Quoc Khanh</creatorcontrib><description>The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC, CASTOR, and DLM-CNN) for a variety of biological sequences. Using WalkIm for classifying various data sets (i.e. viruses whole-genome data, metagenomics read data, and metabarcoding data), it achieves the same performance as the existing methods, with no enforcement of parameter initialization or network architecture adjustment for each data set. It is worth noting that even in the case of classifying high-mutant data sets, such as Coronaviruses, it achieves almost 100% accuracy for classifying its various types. In addition, WalkIm achieves high-speed convergence during network training, as well as reduction of network complexity. Therefore WalkIm method enables us to execute the classifying neural networks on a normal desktop system in a short time interval. Moreover, we addressed the compatibility of WalkIm encoding method with free-space optical processing technology. Taking advantages of optical implementation of convolutional layers, we illustrated that the training time can be reduced by up to 500 time. In addition to all aforementioned advantages, this encoding method preserves the structure of generated images in various modes of sequence transformation, such as reverse complement, complement, and reverse modes.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0267106</identifier><identifier>PMID: 35427371</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Binding sites ; Biology ; Biology and Life Sciences ; Classification ; Computer and Information Sciences ; Computer architecture ; Computer engineering ; Coronaviruses ; Data Collection ; Datasets ; Disease ; Energy consumption ; Genetic aspects ; Genomes ; Genomics ; Identification and classification ; Image classification ; Medicine and health sciences ; Metagenomics ; Methods ; Mutation ; Neural networks ; Neural Networks, Computer ; Optical data processing ; Proteins ; Research and Analysis Methods ; Research Design ; Technology ; Training ; Viruses</subject><ispartof>PloS one, 2022-04, Vol.17 (4), p.e0267106-e0267106</ispartof><rights>COPYRIGHT 2022 Public Library of Science</rights><rights>2022 Akbari Rokn Abadi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2022 Akbari Rokn Abadi et al 2022 Akbari Rokn Abadi et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3</citedby><cites>FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3</cites><orcidid>0000-0002-3105-2511 ; 0000-0002-5040-1940</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2650397324/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2650397324?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,38516,43895,44590,53791,53793,74412,75126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35427371$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Le, Nguyen Quoc Khanh</contributor><creatorcontrib>Akbari Rokn Abadi, Saeedeh</creatorcontrib><creatorcontrib>Mohammadi, Amirhossein</creatorcontrib><creatorcontrib>Koohi, Somayyeh</creatorcontrib><title>WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC, CASTOR, and DLM-CNN) for a variety of biological sequences. Using WalkIm for classifying various data sets (i.e. viruses whole-genome data, metagenomics read data, and metabarcoding data), it achieves the same performance as the existing methods, with no enforcement of parameter initialization or network architecture adjustment for each data set. It is worth noting that even in the case of classifying high-mutant data sets, such as Coronaviruses, it achieves almost 100% accuracy for classifying its various types. In addition, WalkIm achieves high-speed convergence during network training, as well as reduction of network complexity. Therefore WalkIm method enables us to execute the classifying neural networks on a normal desktop system in a short time interval. Moreover, we addressed the compatibility of WalkIm encoding method with free-space optical processing technology. Taking advantages of optical implementation of convolutional layers, we illustrated that the training time can be reduced by up to 500 time. In addition to all aforementioned advantages, this encoding method preserves the structure of generated images in various modes of sequence transformation, such as reverse complement, complement, and reverse modes.</description><subject>Binding sites</subject><subject>Biology</subject><subject>Biology and Life Sciences</subject><subject>Classification</subject><subject>Computer and Information Sciences</subject><subject>Computer architecture</subject><subject>Computer engineering</subject><subject>Coronaviruses</subject><subject>Data Collection</subject><subject>Datasets</subject><subject>Disease</subject><subject>Energy consumption</subject><subject>Genetic aspects</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Identification and classification</subject><subject>Image classification</subject><subject>Medicine and health sciences</subject><subject>Metagenomics</subject><subject>Methods</subject><subject>Mutation</subject><subject>Neural networks</subject><subject>Neural Networks, Computer</subject><subject>Optical data processing</subject><subject>Proteins</subject><subject>Research and Analysis Methods</subject><subject>Research Design</subject><subject>Technology</subject><subject>Training</subject><subject>Viruses</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>COVID</sourceid><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqNk01v1DAQhiMEoqXwDxBEQkJwyBLbie1wQKpWfKxUtRKfR8txxlkXJw52guDCb8fbTasN6gH5EGf8vK_t8UySPEb5ChGGXl26yffSrgbXwyrHlKGc3kmOUUVwRnFO7h7Mj5IHIVzmeUk4pfeTI1IWmEWT4-TPN2m_b7rX6dp1g1RjajrZQlbLAE0KvXKN6dtUO59uTbvNBvBx3sleQaqsDMFoo-RoXJ86ndbGWdfGgE0D_JiiHEI6hZ1DMN1gIR2nPv5l2gOk6_Pz8DC5p6UN8Gj-niRf3r39vP6QnV2836xPzzJFMR4zzWXZoIY2NcKcsVIyWhdEIay4JIoirClHOdRF5BSWhOMKGCeV4mWNgSlykjzd-w7WBTGnLghMy5xUjOAiEps90Th5KQYf8-B_CyeNuAo43wrpR6MsCM4wYhy4VPEUrKSVLDXRJauUrnDT1NHrzbzbVHfQKOhHL-3CdLnSm61o3U9R5QiTgkeDF7OBdzGPYRSdCQqslT246erciManJDiiz_5Bb7_dTLUyXsD02sV91c5UnLJYFzzHZRmp1S1UHA10RsU60ybGF4KXC0FkRvg1tnIKQWw-ffx_9uLrkn1-wG5B2nEbnJ12hRaWYLEHlXcheNA3SUa52LXJdTbErk3E3CZR9uTwgW5E131B_gJBGw3X</recordid><startdate>20220415</startdate><enddate>20220415</enddate><creator>Akbari Rokn Abadi, Saeedeh</creator><creator>Mohammadi, Amirhossein</creator><creator>Koohi, Somayyeh</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>COVID</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-3105-2511</orcidid><orcidid>https://orcid.org/0000-0002-5040-1940</orcidid></search><sort><creationdate>20220415</creationdate><title>WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs</title><author>Akbari Rokn Abadi, Saeedeh ; Mohammadi, Amirhossein ; Koohi, Somayyeh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Binding sites</topic><topic>Biology</topic><topic>Biology and Life Sciences</topic><topic>Classification</topic><topic>Computer and Information Sciences</topic><topic>Computer architecture</topic><topic>Computer engineering</topic><topic>Coronaviruses</topic><topic>Data Collection</topic><topic>Datasets</topic><topic>Disease</topic><topic>Energy consumption</topic><topic>Genetic aspects</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Identification and classification</topic><topic>Image classification</topic><topic>Medicine and health sciences</topic><topic>Metagenomics</topic><topic>Methods</topic><topic>Mutation</topic><topic>Neural networks</topic><topic>Neural Networks, Computer</topic><topic>Optical data processing</topic><topic>Proteins</topic><topic>Research and Analysis Methods</topic><topic>Research Design</topic><topic>Technology</topic><topic>Training</topic><topic>Viruses</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Akbari Rokn Abadi, Saeedeh</creatorcontrib><creatorcontrib>Mohammadi, Amirhossein</creatorcontrib><creatorcontrib>Koohi, Somayyeh</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Opposing Viewpoints In Context</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Nursing &amp; Allied Health Database</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>PHMC-Proquest健康医学期刊库</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Akbari Rokn Abadi, Saeedeh</au><au>Mohammadi, Amirhossein</au><au>Koohi, Somayyeh</au><au>Le, Nguyen Quoc Khanh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2022-04-15</date><risdate>2022</risdate><volume>17</volume><issue>4</issue><spage>e0267106</spage><epage>e0267106</epage><pages>e0267106-e0267106</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC, CASTOR, and DLM-CNN) for a variety of biological sequences. Using WalkIm for classifying various data sets (i.e. viruses whole-genome data, metagenomics read data, and metabarcoding data), it achieves the same performance as the existing methods, with no enforcement of parameter initialization or network architecture adjustment for each data set. It is worth noting that even in the case of classifying high-mutant data sets, such as Coronaviruses, it achieves almost 100% accuracy for classifying its various types. In addition, WalkIm achieves high-speed convergence during network training, as well as reduction of network complexity. Therefore WalkIm method enables us to execute the classifying neural networks on a normal desktop system in a short time interval. Moreover, we addressed the compatibility of WalkIm encoding method with free-space optical processing technology. Taking advantages of optical implementation of convolutional layers, we illustrated that the training time can be reduced by up to 500 time. In addition to all aforementioned advantages, this encoding method preserves the structure of generated images in various modes of sequence transformation, such as reverse complement, complement, and reverse modes.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>35427371</pmid><doi>10.1371/journal.pone.0267106</doi><tpages>e0267106</tpages><orcidid>https://orcid.org/0000-0002-3105-2511</orcidid><orcidid>https://orcid.org/0000-0002-5040-1940</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6203
ispartof PloS one, 2022-04, Vol.17 (4), p.e0267106-e0267106
issn 1932-6203
1932-6203
language eng
recordid cdi_plos_journals_2650397324
source Publicly Available Content (ProQuest); PubMed Central; Coronavirus Research Database
subjects Binding sites
Biology
Biology and Life Sciences
Classification
Computer and Information Sciences
Computer architecture
Computer engineering
Coronaviruses
Data Collection
Datasets
Disease
Energy consumption
Genetic aspects
Genomes
Genomics
Identification and classification
Image classification
Medicine and health sciences
Metagenomics
Methods
Mutation
Neural networks
Neural Networks, Computer
Optical data processing
Proteins
Research and Analysis Methods
Research Design
Technology
Training
Viruses
title WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T15%3A45%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=WalkIm:%20Compact%20image-based%20encoding%20for%20high-performance%20classification%20of%20biological%20sequences%20using%20simple%20tuning-free%20CNNs&rft.jtitle=PloS%20one&rft.au=Akbari%20Rokn%20Abadi,%20Saeedeh&rft.date=2022-04-15&rft.volume=17&rft.issue=4&rft.spage=e0267106&rft.epage=e0267106&rft.pages=e0267106-e0267106&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0267106&rft_dat=%3Cgale_plos_%3EA700580255%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2650397324&rft_id=info:pmid/35427371&rft_galeid=A700580255&rfr_iscdi=true