Loading…
WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs
The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works...
Saved in:
Published in: | PloS one 2022-04, Vol.17 (4), p.e0267106-e0267106 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3 |
---|---|
cites | cdi_FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3 |
container_end_page | e0267106 |
container_issue | 4 |
container_start_page | e0267106 |
container_title | PloS one |
container_volume | 17 |
creator | Akbari Rokn Abadi, Saeedeh Mohammadi, Amirhossein Koohi, Somayyeh |
description | The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC, CASTOR, and DLM-CNN) for a variety of biological sequences. Using WalkIm for classifying various data sets (i.e. viruses whole-genome data, metagenomics read data, and metabarcoding data), it achieves the same performance as the existing methods, with no enforcement of parameter initialization or network architecture adjustment for each data set. It is worth noting that even in the case of classifying high-mutant data sets, such as Coronaviruses, it achieves almost 100% accuracy for classifying its various types. In addition, WalkIm achieves high-speed convergence during network training, as well as reduction of network complexity. Therefore WalkIm method enables us to execute the classifying neural networks on a normal desktop system in a short time interval. Moreover, we addressed the compatibility of WalkIm encoding method with free-space optical processing technology. Taking advantages of optical implementation of convolutional layers, we illustrated that the training time can be reduced by up to 500 time. In addition to all aforementioned advantages, this encoding method preserves the structure of generated images in various modes of sequence transformation, such as reverse complement, complement, and reverse modes. |
doi_str_mv | 10.1371/journal.pone.0267106 |
format | article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2650397324</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A700580255</galeid><doaj_id>oai_doaj_org_article_872178e8ac6b47569a5f3f579cf92ddb</doaj_id><sourcerecordid>A700580255</sourcerecordid><originalsourceid>FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3</originalsourceid><addsrcrecordid>eNqNk01v1DAQhiMEoqXwDxBEQkJwyBLbie1wQKpWfKxUtRKfR8txxlkXJw52guDCb8fbTasN6gH5EGf8vK_t8UySPEb5ChGGXl26yffSrgbXwyrHlKGc3kmOUUVwRnFO7h7Mj5IHIVzmeUk4pfeTI1IWmEWT4-TPN2m_b7rX6dp1g1RjajrZQlbLAE0KvXKN6dtUO59uTbvNBvBx3sleQaqsDMFoo-RoXJ86ndbGWdfGgE0D_JiiHEI6hZ1DMN1gIR2nPv5l2gOk6_Pz8DC5p6UN8Gj-niRf3r39vP6QnV2836xPzzJFMR4zzWXZoIY2NcKcsVIyWhdEIay4JIoirClHOdRF5BSWhOMKGCeV4mWNgSlykjzd-w7WBTGnLghMy5xUjOAiEps90Th5KQYf8-B_CyeNuAo43wrpR6MsCM4wYhy4VPEUrKSVLDXRJauUrnDT1NHrzbzbVHfQKOhHL-3CdLnSm61o3U9R5QiTgkeDF7OBdzGPYRSdCQqslT246erciManJDiiz_5Bb7_dTLUyXsD02sV91c5UnLJYFzzHZRmp1S1UHA10RsU60ybGF4KXC0FkRvg1tnIKQWw-ffx_9uLrkn1-wG5B2nEbnJ12hRaWYLEHlXcheNA3SUa52LXJdTbErk3E3CZR9uTwgW5E131B_gJBGw3X</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2650397324</pqid></control><display><type>article</type><title>WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs</title><source>Publicly Available Content (ProQuest)</source><source>PubMed Central</source><source>Coronavirus Research Database</source><creator>Akbari Rokn Abadi, Saeedeh ; Mohammadi, Amirhossein ; Koohi, Somayyeh</creator><contributor>Le, Nguyen Quoc Khanh</contributor><creatorcontrib>Akbari Rokn Abadi, Saeedeh ; Mohammadi, Amirhossein ; Koohi, Somayyeh ; Le, Nguyen Quoc Khanh</creatorcontrib><description>The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC, CASTOR, and DLM-CNN) for a variety of biological sequences. Using WalkIm for classifying various data sets (i.e. viruses whole-genome data, metagenomics read data, and metabarcoding data), it achieves the same performance as the existing methods, with no enforcement of parameter initialization or network architecture adjustment for each data set. It is worth noting that even in the case of classifying high-mutant data sets, such as Coronaviruses, it achieves almost 100% accuracy for classifying its various types. In addition, WalkIm achieves high-speed convergence during network training, as well as reduction of network complexity. Therefore WalkIm method enables us to execute the classifying neural networks on a normal desktop system in a short time interval. Moreover, we addressed the compatibility of WalkIm encoding method with free-space optical processing technology. Taking advantages of optical implementation of convolutional layers, we illustrated that the training time can be reduced by up to 500 time. In addition to all aforementioned advantages, this encoding method preserves the structure of generated images in various modes of sequence transformation, such as reverse complement, complement, and reverse modes.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0267106</identifier><identifier>PMID: 35427371</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Binding sites ; Biology ; Biology and Life Sciences ; Classification ; Computer and Information Sciences ; Computer architecture ; Computer engineering ; Coronaviruses ; Data Collection ; Datasets ; Disease ; Energy consumption ; Genetic aspects ; Genomes ; Genomics ; Identification and classification ; Image classification ; Medicine and health sciences ; Metagenomics ; Methods ; Mutation ; Neural networks ; Neural Networks, Computer ; Optical data processing ; Proteins ; Research and Analysis Methods ; Research Design ; Technology ; Training ; Viruses</subject><ispartof>PloS one, 2022-04, Vol.17 (4), p.e0267106-e0267106</ispartof><rights>COPYRIGHT 2022 Public Library of Science</rights><rights>2022 Akbari Rokn Abadi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2022 Akbari Rokn Abadi et al 2022 Akbari Rokn Abadi et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3</citedby><cites>FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3</cites><orcidid>0000-0002-3105-2511 ; 0000-0002-5040-1940</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2650397324/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2650397324?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,38516,43895,44590,53791,53793,74412,75126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35427371$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Le, Nguyen Quoc Khanh</contributor><creatorcontrib>Akbari Rokn Abadi, Saeedeh</creatorcontrib><creatorcontrib>Mohammadi, Amirhossein</creatorcontrib><creatorcontrib>Koohi, Somayyeh</creatorcontrib><title>WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC, CASTOR, and DLM-CNN) for a variety of biological sequences. Using WalkIm for classifying various data sets (i.e. viruses whole-genome data, metagenomics read data, and metabarcoding data), it achieves the same performance as the existing methods, with no enforcement of parameter initialization or network architecture adjustment for each data set. It is worth noting that even in the case of classifying high-mutant data sets, such as Coronaviruses, it achieves almost 100% accuracy for classifying its various types. In addition, WalkIm achieves high-speed convergence during network training, as well as reduction of network complexity. Therefore WalkIm method enables us to execute the classifying neural networks on a normal desktop system in a short time interval. Moreover, we addressed the compatibility of WalkIm encoding method with free-space optical processing technology. Taking advantages of optical implementation of convolutional layers, we illustrated that the training time can be reduced by up to 500 time. In addition to all aforementioned advantages, this encoding method preserves the structure of generated images in various modes of sequence transformation, such as reverse complement, complement, and reverse modes.</description><subject>Binding sites</subject><subject>Biology</subject><subject>Biology and Life Sciences</subject><subject>Classification</subject><subject>Computer and Information Sciences</subject><subject>Computer architecture</subject><subject>Computer engineering</subject><subject>Coronaviruses</subject><subject>Data Collection</subject><subject>Datasets</subject><subject>Disease</subject><subject>Energy consumption</subject><subject>Genetic aspects</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Identification and classification</subject><subject>Image classification</subject><subject>Medicine and health sciences</subject><subject>Metagenomics</subject><subject>Methods</subject><subject>Mutation</subject><subject>Neural networks</subject><subject>Neural Networks, Computer</subject><subject>Optical data processing</subject><subject>Proteins</subject><subject>Research and Analysis Methods</subject><subject>Research Design</subject><subject>Technology</subject><subject>Training</subject><subject>Viruses</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>COVID</sourceid><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqNk01v1DAQhiMEoqXwDxBEQkJwyBLbie1wQKpWfKxUtRKfR8txxlkXJw52guDCb8fbTasN6gH5EGf8vK_t8UySPEb5ChGGXl26yffSrgbXwyrHlKGc3kmOUUVwRnFO7h7Mj5IHIVzmeUk4pfeTI1IWmEWT4-TPN2m_b7rX6dp1g1RjajrZQlbLAE0KvXKN6dtUO59uTbvNBvBx3sleQaqsDMFoo-RoXJ86ndbGWdfGgE0D_JiiHEI6hZ1DMN1gIR2nPv5l2gOk6_Pz8DC5p6UN8Gj-niRf3r39vP6QnV2836xPzzJFMR4zzWXZoIY2NcKcsVIyWhdEIay4JIoirClHOdRF5BSWhOMKGCeV4mWNgSlykjzd-w7WBTGnLghMy5xUjOAiEps90Th5KQYf8-B_CyeNuAo43wrpR6MsCM4wYhy4VPEUrKSVLDXRJauUrnDT1NHrzbzbVHfQKOhHL-3CdLnSm61o3U9R5QiTgkeDF7OBdzGPYRSdCQqslT246erciManJDiiz_5Bb7_dTLUyXsD02sV91c5UnLJYFzzHZRmp1S1UHA10RsU60ybGF4KXC0FkRvg1tnIKQWw-ffx_9uLrkn1-wG5B2nEbnJ12hRaWYLEHlXcheNA3SUa52LXJdTbErk3E3CZR9uTwgW5E131B_gJBGw3X</recordid><startdate>20220415</startdate><enddate>20220415</enddate><creator>Akbari Rokn Abadi, Saeedeh</creator><creator>Mohammadi, Amirhossein</creator><creator>Koohi, Somayyeh</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>COVID</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-3105-2511</orcidid><orcidid>https://orcid.org/0000-0002-5040-1940</orcidid></search><sort><creationdate>20220415</creationdate><title>WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs</title><author>Akbari Rokn Abadi, Saeedeh ; Mohammadi, Amirhossein ; Koohi, Somayyeh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Binding sites</topic><topic>Biology</topic><topic>Biology and Life Sciences</topic><topic>Classification</topic><topic>Computer and Information Sciences</topic><topic>Computer architecture</topic><topic>Computer engineering</topic><topic>Coronaviruses</topic><topic>Data Collection</topic><topic>Datasets</topic><topic>Disease</topic><topic>Energy consumption</topic><topic>Genetic aspects</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Identification and classification</topic><topic>Image classification</topic><topic>Medicine and health sciences</topic><topic>Metagenomics</topic><topic>Methods</topic><topic>Mutation</topic><topic>Neural networks</topic><topic>Neural Networks, Computer</topic><topic>Optical data processing</topic><topic>Proteins</topic><topic>Research and Analysis Methods</topic><topic>Research Design</topic><topic>Technology</topic><topic>Training</topic><topic>Viruses</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Akbari Rokn Abadi, Saeedeh</creatorcontrib><creatorcontrib>Mohammadi, Amirhossein</creatorcontrib><creatorcontrib>Koohi, Somayyeh</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Opposing Viewpoints In Context</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Nursing & Allied Health Database</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>PHMC-Proquest健康医学期刊库</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database (1962 - current)</collection><collection>Agricultural & Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Nursing & Allied Health Premium</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Akbari Rokn Abadi, Saeedeh</au><au>Mohammadi, Amirhossein</au><au>Koohi, Somayyeh</au><au>Le, Nguyen Quoc Khanh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2022-04-15</date><risdate>2022</risdate><volume>17</volume><issue>4</issue><spage>e0267106</spage><epage>e0267106</epage><pages>e0267106-e0267106</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>The classification of biological sequences is an open issue for a variety of data sets, such as viral and metagenomics sequences. Therefore, many studies utilize neural network tools, as the well-known methods in this field, and focus on designing customized network structures. However, a few works focus on more effective factors, such as input encoding method or implementation technology, to address accuracy and efficiency issues in this area. Therefore, in this work, we propose an image-based encoding method, called as WalkIm, whose adoption, even in a simple neural network, provides competitive accuracy and superior efficiency, compared to the existing classification methods (e.g. VGDC, CASTOR, and DLM-CNN) for a variety of biological sequences. Using WalkIm for classifying various data sets (i.e. viruses whole-genome data, metagenomics read data, and metabarcoding data), it achieves the same performance as the existing methods, with no enforcement of parameter initialization or network architecture adjustment for each data set. It is worth noting that even in the case of classifying high-mutant data sets, such as Coronaviruses, it achieves almost 100% accuracy for classifying its various types. In addition, WalkIm achieves high-speed convergence during network training, as well as reduction of network complexity. Therefore WalkIm method enables us to execute the classifying neural networks on a normal desktop system in a short time interval. Moreover, we addressed the compatibility of WalkIm encoding method with free-space optical processing technology. Taking advantages of optical implementation of convolutional layers, we illustrated that the training time can be reduced by up to 500 time. In addition to all aforementioned advantages, this encoding method preserves the structure of generated images in various modes of sequence transformation, such as reverse complement, complement, and reverse modes.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>35427371</pmid><doi>10.1371/journal.pone.0267106</doi><tpages>e0267106</tpages><orcidid>https://orcid.org/0000-0002-3105-2511</orcidid><orcidid>https://orcid.org/0000-0002-5040-1940</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1932-6203 |
ispartof | PloS one, 2022-04, Vol.17 (4), p.e0267106-e0267106 |
issn | 1932-6203 1932-6203 |
language | eng |
recordid | cdi_plos_journals_2650397324 |
source | Publicly Available Content (ProQuest); PubMed Central; Coronavirus Research Database |
subjects | Binding sites Biology Biology and Life Sciences Classification Computer and Information Sciences Computer architecture Computer engineering Coronaviruses Data Collection Datasets Disease Energy consumption Genetic aspects Genomes Genomics Identification and classification Image classification Medicine and health sciences Metagenomics Methods Mutation Neural networks Neural Networks, Computer Optical data processing Proteins Research and Analysis Methods Research Design Technology Training Viruses |
title | WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T15%3A45%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=WalkIm:%20Compact%20image-based%20encoding%20for%20high-performance%20classification%20of%20biological%20sequences%20using%20simple%20tuning-free%20CNNs&rft.jtitle=PloS%20one&rft.au=Akbari%20Rokn%20Abadi,%20Saeedeh&rft.date=2022-04-15&rft.volume=17&rft.issue=4&rft.spage=e0267106&rft.epage=e0267106&rft.pages=e0267106-e0267106&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0267106&rft_dat=%3Cgale_plos_%3EA700580255%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c622t-f8a5d1d6db128775a76b43c12c8a3c612f6810eb48a5c2a3829e7839c85b2e7c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2650397324&rft_id=info:pmid/35427371&rft_galeid=A700580255&rfr_iscdi=true |