Loading…
The art of time-bending: Data augmentation and early prediction for efficient traffic classification
The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a l...
Saved in:
Published in: | Expert systems with applications 2024-10, Vol.252, p.124166, Article 124166 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73 |
container_end_page | |
container_issue | |
container_start_page | 124166 |
container_title | Expert systems with applications |
container_volume | 252 |
creator | Hajaj, Chen Aharon, Porat Dubin, Ran Dvir, Amit |
description | The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy’s practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems.
•Enhanced Traffic Analysis via LSTM Augmentation.•Efficiency Boost: Earlier Traffic Classification.•Improved Classifier Performance with Generated Data.•Diverse Data Synthesis for Improved Classification. |
doi_str_mv | 10.1016/j.eswa.2024.124166 |
format | article |
fullrecord | <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_eswa_2024_124166</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417424010327</els_id><sourcerecordid>S0957417424010327</sourcerecordid><originalsourceid>FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73</originalsourceid><addsrcrecordid>eNp9kM9OwzAMh3MAiTF4AU55gQ4nbZMWcUHjrzSJyzhHXuKMTFs7JQG0t6fdOHPyT5Y_y_4YuxEwEyDU7WZG6QdnEmQ1E7ISSp2xCbS1Liqhqwt2mdIGQGgAPWFu-UkcY-a95znsqFhR50K3vuOPmJHj13pHXcYc-o5j5zhh3B74PpIL9tj0feTkfbBhmOM54pi53WJKYUhH8oqde9wmuv6rU_bx_LScvxaL95e3-cOisLIWuWipVI0CcPWqVbWrpKtJovKNQ9nWzapxjQTvEFWrSqWtqr2udNmAEC0A6nLK5GmvjX1KkbzZx7DDeDACzOjGbMzoxoxuzMnNAN2fIBou-w4UTRp_scOHkWw2rg__4b-7Km_0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>The art of time-bending: Data augmentation and early prediction for efficient traffic classification</title><source>ScienceDirect Freedom Collection</source><creator>Hajaj, Chen ; Aharon, Porat ; Dubin, Ran ; Dvir, Amit</creator><creatorcontrib>Hajaj, Chen ; Aharon, Porat ; Dubin, Ran ; Dvir, Amit</creatorcontrib><description>The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy’s practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems.
•Enhanced Traffic Analysis via LSTM Augmentation.•Efficiency Boost: Earlier Traffic Classification.•Improved Classifier Performance with Generated Data.•Diverse Data Synthesis for Improved Classification.</description><identifier>ISSN: 0957-4174</identifier><identifier>DOI: 10.1016/j.eswa.2024.124166</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Data augmentation ; Internet traffic classification ; Long Short-Term Memory (LSTM) networks</subject><ispartof>Expert systems with applications, 2024-10, Vol.252, p.124166, Article 124166</ispartof><rights>2024 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73</cites><orcidid>0000-0001-9940-5654 ; 0009-0006-1493-7322 ; 0000-0002-2055-2211 ; 0000-0002-3670-0784</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Hajaj, Chen</creatorcontrib><creatorcontrib>Aharon, Porat</creatorcontrib><creatorcontrib>Dubin, Ran</creatorcontrib><creatorcontrib>Dvir, Amit</creatorcontrib><title>The art of time-bending: Data augmentation and early prediction for efficient traffic classification</title><title>Expert systems with applications</title><description>The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy’s practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems.
•Enhanced Traffic Analysis via LSTM Augmentation.•Efficiency Boost: Earlier Traffic Classification.•Improved Classifier Performance with Generated Data.•Diverse Data Synthesis for Improved Classification.</description><subject>Data augmentation</subject><subject>Internet traffic classification</subject><subject>Long Short-Term Memory (LSTM) networks</subject><issn>0957-4174</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM9OwzAMh3MAiTF4AU55gQ4nbZMWcUHjrzSJyzhHXuKMTFs7JQG0t6fdOHPyT5Y_y_4YuxEwEyDU7WZG6QdnEmQ1E7ISSp2xCbS1Liqhqwt2mdIGQGgAPWFu-UkcY-a95znsqFhR50K3vuOPmJHj13pHXcYc-o5j5zhh3B74PpIL9tj0feTkfbBhmOM54pi53WJKYUhH8oqde9wmuv6rU_bx_LScvxaL95e3-cOisLIWuWipVI0CcPWqVbWrpKtJovKNQ9nWzapxjQTvEFWrSqWtqr2udNmAEC0A6nLK5GmvjX1KkbzZx7DDeDACzOjGbMzoxoxuzMnNAN2fIBou-w4UTRp_scOHkWw2rg__4b-7Km_0</recordid><startdate>20241015</startdate><enddate>20241015</enddate><creator>Hajaj, Chen</creator><creator>Aharon, Porat</creator><creator>Dubin, Ran</creator><creator>Dvir, Amit</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-9940-5654</orcidid><orcidid>https://orcid.org/0009-0006-1493-7322</orcidid><orcidid>https://orcid.org/0000-0002-2055-2211</orcidid><orcidid>https://orcid.org/0000-0002-3670-0784</orcidid></search><sort><creationdate>20241015</creationdate><title>The art of time-bending: Data augmentation and early prediction for efficient traffic classification</title><author>Hajaj, Chen ; Aharon, Porat ; Dubin, Ran ; Dvir, Amit</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Data augmentation</topic><topic>Internet traffic classification</topic><topic>Long Short-Term Memory (LSTM) networks</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hajaj, Chen</creatorcontrib><creatorcontrib>Aharon, Porat</creatorcontrib><creatorcontrib>Dubin, Ran</creatorcontrib><creatorcontrib>Dvir, Amit</creatorcontrib><collection>CrossRef</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hajaj, Chen</au><au>Aharon, Porat</au><au>Dubin, Ran</au><au>Dvir, Amit</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The art of time-bending: Data augmentation and early prediction for efficient traffic classification</atitle><jtitle>Expert systems with applications</jtitle><date>2024-10-15</date><risdate>2024</risdate><volume>252</volume><spage>124166</spage><pages>124166-</pages><artnum>124166</artnum><issn>0957-4174</issn><abstract>The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy’s practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems.
•Enhanced Traffic Analysis via LSTM Augmentation.•Efficiency Boost: Earlier Traffic Classification.•Improved Classifier Performance with Generated Data.•Diverse Data Synthesis for Improved Classification.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2024.124166</doi><orcidid>https://orcid.org/0000-0001-9940-5654</orcidid><orcidid>https://orcid.org/0009-0006-1493-7322</orcidid><orcidid>https://orcid.org/0000-0002-2055-2211</orcidid><orcidid>https://orcid.org/0000-0002-3670-0784</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2024-10, Vol.252, p.124166, Article 124166 |
issn | 0957-4174 |
language | eng |
recordid | cdi_crossref_primary_10_1016_j_eswa_2024_124166 |
source | ScienceDirect Freedom Collection |
subjects | Data augmentation Internet traffic classification Long Short-Term Memory (LSTM) networks |
title | The art of time-bending: Data augmentation and early prediction for efficient traffic classification |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T18%3A41%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20art%20of%20time-bending:%20Data%20augmentation%20and%20early%20prediction%20for%20efficient%20traffic%20classification&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Hajaj,%20Chen&rft.date=2024-10-15&rft.volume=252&rft.spage=124166&rft.pages=124166-&rft.artnum=124166&rft.issn=0957-4174&rft_id=info:doi/10.1016/j.eswa.2024.124166&rft_dat=%3Celsevier_cross%3ES0957417424010327%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c251t-9e368600d5b965d42d5e2a6f8da2958b8d820fdaa696367c65f74738011900a73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |