Loading…

The art of time-bending: Data augmentation and early prediction for efficient traffic classification

The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a l...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications 2024-10, Vol.252, p.124166, Article 124166
Main Authors: Hajaj, Chen, Aharon, Porat, Dubin, Ran, Dvir, Amit
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73
container_end_page
container_issue
container_start_page 124166
container_title Expert systems with applications
container_volume 252
creator Hajaj, Chen
Aharon, Porat
Dubin, Ran
Dvir, Amit
description The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy’s practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems. •Enhanced Traffic Analysis via LSTM Augmentation.•Efficiency Boost: Earlier Traffic Classification.•Improved Classifier Performance with Generated Data.•Diverse Data Synthesis for Improved Classification.
doi_str_mv 10.1016/j.eswa.2024.124166
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_eswa_2024_124166</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417424010327</els_id><sourcerecordid>S0957417424010327</sourcerecordid><originalsourceid>FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73</originalsourceid><addsrcrecordid>eNp9kM9OwzAMh3MAiTF4AU55gQ4nbZMWcUHjrzSJyzhHXuKMTFs7JQG0t6fdOHPyT5Y_y_4YuxEwEyDU7WZG6QdnEmQ1E7ISSp2xCbS1Liqhqwt2mdIGQGgAPWFu-UkcY-a95znsqFhR50K3vuOPmJHj13pHXcYc-o5j5zhh3B74PpIL9tj0feTkfbBhmOM54pi53WJKYUhH8oqde9wmuv6rU_bx_LScvxaL95e3-cOisLIWuWipVI0CcPWqVbWrpKtJovKNQ9nWzapxjQTvEFWrSqWtqr2udNmAEC0A6nLK5GmvjX1KkbzZx7DDeDACzOjGbMzoxoxuzMnNAN2fIBou-w4UTRp_scOHkWw2rg__4b-7Km_0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>The art of time-bending: Data augmentation and early prediction for efficient traffic classification</title><source>ScienceDirect Freedom Collection</source><creator>Hajaj, Chen ; Aharon, Porat ; Dubin, Ran ; Dvir, Amit</creator><creatorcontrib>Hajaj, Chen ; Aharon, Porat ; Dubin, Ran ; Dvir, Amit</creatorcontrib><description>The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy’s practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems. •Enhanced Traffic Analysis via LSTM Augmentation.•Efficiency Boost: Earlier Traffic Classification.•Improved Classifier Performance with Generated Data.•Diverse Data Synthesis for Improved Classification.</description><identifier>ISSN: 0957-4174</identifier><identifier>DOI: 10.1016/j.eswa.2024.124166</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Data augmentation ; Internet traffic classification ; Long Short-Term Memory (LSTM) networks</subject><ispartof>Expert systems with applications, 2024-10, Vol.252, p.124166, Article 124166</ispartof><rights>2024 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73</cites><orcidid>0000-0001-9940-5654 ; 0009-0006-1493-7322 ; 0000-0002-2055-2211 ; 0000-0002-3670-0784</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Hajaj, Chen</creatorcontrib><creatorcontrib>Aharon, Porat</creatorcontrib><creatorcontrib>Dubin, Ran</creatorcontrib><creatorcontrib>Dvir, Amit</creatorcontrib><title>The art of time-bending: Data augmentation and early prediction for efficient traffic classification</title><title>Expert systems with applications</title><description>The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy’s practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems. •Enhanced Traffic Analysis via LSTM Augmentation.•Efficiency Boost: Earlier Traffic Classification.•Improved Classifier Performance with Generated Data.•Diverse Data Synthesis for Improved Classification.</description><subject>Data augmentation</subject><subject>Internet traffic classification</subject><subject>Long Short-Term Memory (LSTM) networks</subject><issn>0957-4174</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM9OwzAMh3MAiTF4AU55gQ4nbZMWcUHjrzSJyzhHXuKMTFs7JQG0t6fdOHPyT5Y_y_4YuxEwEyDU7WZG6QdnEmQ1E7ISSp2xCbS1Liqhqwt2mdIGQGgAPWFu-UkcY-a95znsqFhR50K3vuOPmJHj13pHXcYc-o5j5zhh3B74PpIL9tj0feTkfbBhmOM54pi53WJKYUhH8oqde9wmuv6rU_bx_LScvxaL95e3-cOisLIWuWipVI0CcPWqVbWrpKtJovKNQ9nWzapxjQTvEFWrSqWtqr2udNmAEC0A6nLK5GmvjX1KkbzZx7DDeDACzOjGbMzoxoxuzMnNAN2fIBou-w4UTRp_scOHkWw2rg__4b-7Km_0</recordid><startdate>20241015</startdate><enddate>20241015</enddate><creator>Hajaj, Chen</creator><creator>Aharon, Porat</creator><creator>Dubin, Ran</creator><creator>Dvir, Amit</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-9940-5654</orcidid><orcidid>https://orcid.org/0009-0006-1493-7322</orcidid><orcidid>https://orcid.org/0000-0002-2055-2211</orcidid><orcidid>https://orcid.org/0000-0002-3670-0784</orcidid></search><sort><creationdate>20241015</creationdate><title>The art of time-bending: Data augmentation and early prediction for efficient traffic classification</title><author>Hajaj, Chen ; Aharon, Porat ; Dubin, Ran ; Dvir, Amit</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Data augmentation</topic><topic>Internet traffic classification</topic><topic>Long Short-Term Memory (LSTM) networks</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hajaj, Chen</creatorcontrib><creatorcontrib>Aharon, Porat</creatorcontrib><creatorcontrib>Dubin, Ran</creatorcontrib><creatorcontrib>Dvir, Amit</creatorcontrib><collection>CrossRef</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hajaj, Chen</au><au>Aharon, Porat</au><au>Dubin, Ran</au><au>Dvir, Amit</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The art of time-bending: Data augmentation and early prediction for efficient traffic classification</atitle><jtitle>Expert systems with applications</jtitle><date>2024-10-15</date><risdate>2024</risdate><volume>252</volume><spage>124166</spage><pages>124166-</pages><artnum>124166</artnum><issn>0957-4174</issn><abstract>The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy’s practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems. •Enhanced Traffic Analysis via LSTM Augmentation.•Efficiency Boost: Earlier Traffic Classification.•Improved Classifier Performance with Generated Data.•Diverse Data Synthesis for Improved Classification.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2024.124166</doi><orcidid>https://orcid.org/0000-0001-9940-5654</orcidid><orcidid>https://orcid.org/0009-0006-1493-7322</orcidid><orcidid>https://orcid.org/0000-0002-2055-2211</orcidid><orcidid>https://orcid.org/0000-0002-3670-0784</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2024-10, Vol.252, p.124166, Article 124166
issn 0957-4174
language eng
recordid cdi_crossref_primary_10_1016_j_eswa_2024_124166
source ScienceDirect Freedom Collection
subjects Data augmentation
Internet traffic classification
Long Short-Term Memory (LSTM) networks
title The art of time-bending: Data augmentation and early prediction for efficient traffic classification
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T18%3A41%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20art%20of%20time-bending:%20Data%20augmentation%20and%20early%20prediction%20for%20efficient%20traffic%20classification&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Hajaj,%20Chen&rft.date=2024-10-15&rft.volume=252&rft.spage=124166&rft.pages=124166-&rft.artnum=124166&rft.issn=0957-4174&rft_id=info:doi/10.1016/j.eswa.2024.124166&rft_dat=%3Celsevier_cross%3ES0957417424010327%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true