Loading…

NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting

The illegal use of electricity, defective meters, and a malfunctioning infrastructure are major causes of Non-technical losses (NTLs) in electric distribution systems. Although the use of supervised machine learning techniques to detect NTLs has been widely studied, further research is needed in ord...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on power systems 2018-11, Vol.33 (6), p.7171-7180
Main Authors: Avila, Nelson Fabian, Figueroa, Gerardo, Chu, Chia-Chi
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c344t-4b7da9b02ac230fc77d7328a5cb43d53296d92b304e26542de91313c5f8117163
cites cdi_FETCH-LOGICAL-c344t-4b7da9b02ac230fc77d7328a5cb43d53296d92b304e26542de91313c5f8117163
container_end_page 7180
container_issue 6
container_start_page 7171
container_title IEEE transactions on power systems
container_volume 33
creator Avila, Nelson Fabian
Figueroa, Gerardo
Chu, Chia-Chi
description The illegal use of electricity, defective meters, and a malfunctioning infrastructure are major causes of Non-technical losses (NTLs) in electric distribution systems. Although the use of supervised machine learning techniques to detect NTLs has been widely studied, further research is needed in order to address some significant challenges. (i) Given that fraudulent consumers remarkably outnumber non-fraudulent ones, the imbalanced nature of the dataset can have a major negative impact on the performance of supervised machine learning methods. (ii) Given the large number of dimensions present in the time series data used for training and testing classifiers, advanced signal processing techniques are required in order to extract the most relevant information. (iii) The effectiveness of classifiers must be evaluated using meaningful performance measures for imbalanced data. This paper proposes a framework that addresses the three previous challenges. The core of the proposed framework is the application of the maximal overlap discrete wavelet-packet transform (MODWPT) for feature extraction from time series data and the random undersampling boosting (RUSBoost) algorithm for NTL detection. Moreover, our framework is evaluated using an extensive list of performance metrics. Experiments show that the MODWPT combined with the RUSBoost algorithm can significantly improve the quality of NTL predictions.
doi_str_mv 10.1109/TPWRS.2018.2853162
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TPWRS_2018_2853162</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8404135</ieee_id><sourcerecordid>2121980728</sourcerecordid><originalsourceid>FETCH-LOGICAL-c344t-4b7da9b02ac230fc77d7328a5cb43d53296d92b304e26542de91313c5f8117163</originalsourceid><addsrcrecordid>eNo9UMlOwzAQtRBIlMIPwMUS5xQvceIcoZRFKrTqoh4jx5lASpZiuxX9Bz4adxGXeSO9ZTQPoWtKepSS5G42XkymPUao7DEpOI3YCepQIWRAojg5RR0ipQhkIsg5urB2SQiJPNFBv--zIX4EB9qVbYPLBg8qv5tS48fSeszWe2K6tQ5qi-e2bD6w-wT8pn7KWlV4tAFTqdVOro0Pwgu1gQpcMFb6CxyeGdXYojU1Vk2OJ360NZ43ORir6lW1i3toW-v8conOClVZuDpiF82fBrP-SzAcPb_274eB5mHogjCLc5VkhCnNOCl0HOcxZ1IJnYU8F5wlUZ6wjJMQWCRClkNCOeVaFJLSmEa8i24PuSvTfq_BunTZrk3jT6aMMppIEjPpVeyg0qa11kCRroz_2GxTStJd6-m-9XTXenps3ZtuDqYSAP4NMiQh5YL_ARV6f90</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2121980728</pqid></control><display><type>article</type><title>NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Avila, Nelson Fabian ; Figueroa, Gerardo ; Chu, Chia-Chi</creator><creatorcontrib>Avila, Nelson Fabian ; Figueroa, Gerardo ; Chu, Chia-Chi</creatorcontrib><description>The illegal use of electricity, defective meters, and a malfunctioning infrastructure are major causes of Non-technical losses (NTLs) in electric distribution systems. Although the use of supervised machine learning techniques to detect NTLs has been widely studied, further research is needed in order to address some significant challenges. (i) Given that fraudulent consumers remarkably outnumber non-fraudulent ones, the imbalanced nature of the dataset can have a major negative impact on the performance of supervised machine learning methods. (ii) Given the large number of dimensions present in the time series data used for training and testing classifiers, advanced signal processing techniques are required in order to extract the most relevant information. (iii) The effectiveness of classifiers must be evaluated using meaningful performance measures for imbalanced data. This paper proposes a framework that addresses the three previous challenges. The core of the proposed framework is the application of the maximal overlap discrete wavelet-packet transform (MODWPT) for feature extraction from time series data and the random undersampling boosting (RUSBoost) algorithm for NTL detection. Moreover, our framework is evaluated using an extensive list of performance metrics. Experiments show that the MODWPT combined with the RUSBoost algorithm can significantly improve the quality of NTL predictions.</description><identifier>ISSN: 0885-8950</identifier><identifier>EISSN: 1558-0679</identifier><identifier>DOI: 10.1109/TPWRS.2018.2853162</identifier><identifier>CODEN: ITPSEG</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Artificial intelligence ; boosting methods ; classification algorithms ; Classifiers ; Data mining ; Discrete Wavelet Transform ; Feature extraction ; Machine learning ; Machine learning algorithms ; maximal overlap discrete wavelet packet transform ; Measurement ; Measuring instruments ; Non-technical losses ; outlier detection ; Performance measurement ; Signal classification ; Signal processing ; Time series ; Wavelet packets</subject><ispartof>IEEE transactions on power systems, 2018-11, Vol.33 (6), p.7171-7180</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c344t-4b7da9b02ac230fc77d7328a5cb43d53296d92b304e26542de91313c5f8117163</citedby><cites>FETCH-LOGICAL-c344t-4b7da9b02ac230fc77d7328a5cb43d53296d92b304e26542de91313c5f8117163</cites><orcidid>0000-0002-8341-2507 ; 0000-0001-6403-6078</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8404135$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Avila, Nelson Fabian</creatorcontrib><creatorcontrib>Figueroa, Gerardo</creatorcontrib><creatorcontrib>Chu, Chia-Chi</creatorcontrib><title>NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting</title><title>IEEE transactions on power systems</title><addtitle>TPWRS</addtitle><description>The illegal use of electricity, defective meters, and a malfunctioning infrastructure are major causes of Non-technical losses (NTLs) in electric distribution systems. Although the use of supervised machine learning techniques to detect NTLs has been widely studied, further research is needed in order to address some significant challenges. (i) Given that fraudulent consumers remarkably outnumber non-fraudulent ones, the imbalanced nature of the dataset can have a major negative impact on the performance of supervised machine learning methods. (ii) Given the large number of dimensions present in the time series data used for training and testing classifiers, advanced signal processing techniques are required in order to extract the most relevant information. (iii) The effectiveness of classifiers must be evaluated using meaningful performance measures for imbalanced data. This paper proposes a framework that addresses the three previous challenges. The core of the proposed framework is the application of the maximal overlap discrete wavelet-packet transform (MODWPT) for feature extraction from time series data and the random undersampling boosting (RUSBoost) algorithm for NTL detection. Moreover, our framework is evaluated using an extensive list of performance metrics. Experiments show that the MODWPT combined with the RUSBoost algorithm can significantly improve the quality of NTL predictions.</description><subject>Algorithms</subject><subject>Artificial intelligence</subject><subject>boosting methods</subject><subject>classification algorithms</subject><subject>Classifiers</subject><subject>Data mining</subject><subject>Discrete Wavelet Transform</subject><subject>Feature extraction</subject><subject>Machine learning</subject><subject>Machine learning algorithms</subject><subject>maximal overlap discrete wavelet packet transform</subject><subject>Measurement</subject><subject>Measuring instruments</subject><subject>Non-technical losses</subject><subject>outlier detection</subject><subject>Performance measurement</subject><subject>Signal classification</subject><subject>Signal processing</subject><subject>Time series</subject><subject>Wavelet packets</subject><issn>0885-8950</issn><issn>1558-0679</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNo9UMlOwzAQtRBIlMIPwMUS5xQvceIcoZRFKrTqoh4jx5lASpZiuxX9Bz4adxGXeSO9ZTQPoWtKepSS5G42XkymPUao7DEpOI3YCepQIWRAojg5RR0ipQhkIsg5urB2SQiJPNFBv--zIX4EB9qVbYPLBg8qv5tS48fSeszWe2K6tQ5qi-e2bD6w-wT8pn7KWlV4tAFTqdVOro0Pwgu1gQpcMFb6CxyeGdXYojU1Vk2OJ360NZ43ORir6lW1i3toW-v8conOClVZuDpiF82fBrP-SzAcPb_274eB5mHogjCLc5VkhCnNOCl0HOcxZ1IJnYU8F5wlUZ6wjJMQWCRClkNCOeVaFJLSmEa8i24PuSvTfq_BunTZrk3jT6aMMppIEjPpVeyg0qa11kCRroz_2GxTStJd6-m-9XTXenps3ZtuDqYSAP4NMiQh5YL_ARV6f90</recordid><startdate>201811</startdate><enddate>201811</enddate><creator>Avila, Nelson Fabian</creator><creator>Figueroa, Gerardo</creator><creator>Chu, Chia-Chi</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0002-8341-2507</orcidid><orcidid>https://orcid.org/0000-0001-6403-6078</orcidid></search><sort><creationdate>201811</creationdate><title>NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting</title><author>Avila, Nelson Fabian ; Figueroa, Gerardo ; Chu, Chia-Chi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c344t-4b7da9b02ac230fc77d7328a5cb43d53296d92b304e26542de91313c5f8117163</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Algorithms</topic><topic>Artificial intelligence</topic><topic>boosting methods</topic><topic>classification algorithms</topic><topic>Classifiers</topic><topic>Data mining</topic><topic>Discrete Wavelet Transform</topic><topic>Feature extraction</topic><topic>Machine learning</topic><topic>Machine learning algorithms</topic><topic>maximal overlap discrete wavelet packet transform</topic><topic>Measurement</topic><topic>Measuring instruments</topic><topic>Non-technical losses</topic><topic>outlier detection</topic><topic>Performance measurement</topic><topic>Signal classification</topic><topic>Signal processing</topic><topic>Time series</topic><topic>Wavelet packets</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Avila, Nelson Fabian</creatorcontrib><creatorcontrib>Figueroa, Gerardo</creatorcontrib><creatorcontrib>Chu, Chia-Chi</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on power systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Avila, Nelson Fabian</au><au>Figueroa, Gerardo</au><au>Chu, Chia-Chi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting</atitle><jtitle>IEEE transactions on power systems</jtitle><stitle>TPWRS</stitle><date>2018-11</date><risdate>2018</risdate><volume>33</volume><issue>6</issue><spage>7171</spage><epage>7180</epage><pages>7171-7180</pages><issn>0885-8950</issn><eissn>1558-0679</eissn><coden>ITPSEG</coden><abstract>The illegal use of electricity, defective meters, and a malfunctioning infrastructure are major causes of Non-technical losses (NTLs) in electric distribution systems. Although the use of supervised machine learning techniques to detect NTLs has been widely studied, further research is needed in order to address some significant challenges. (i) Given that fraudulent consumers remarkably outnumber non-fraudulent ones, the imbalanced nature of the dataset can have a major negative impact on the performance of supervised machine learning methods. (ii) Given the large number of dimensions present in the time series data used for training and testing classifiers, advanced signal processing techniques are required in order to extract the most relevant information. (iii) The effectiveness of classifiers must be evaluated using meaningful performance measures for imbalanced data. This paper proposes a framework that addresses the three previous challenges. The core of the proposed framework is the application of the maximal overlap discrete wavelet-packet transform (MODWPT) for feature extraction from time series data and the random undersampling boosting (RUSBoost) algorithm for NTL detection. Moreover, our framework is evaluated using an extensive list of performance metrics. Experiments show that the MODWPT combined with the RUSBoost algorithm can significantly improve the quality of NTL predictions.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TPWRS.2018.2853162</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-8341-2507</orcidid><orcidid>https://orcid.org/0000-0001-6403-6078</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0885-8950
ispartof IEEE transactions on power systems, 2018-11, Vol.33 (6), p.7171-7180
issn 0885-8950
1558-0679
language eng
recordid cdi_crossref_primary_10_1109_TPWRS_2018_2853162
source IEEE Electronic Library (IEL) Journals
subjects Algorithms
Artificial intelligence
boosting methods
classification algorithms
Classifiers
Data mining
Discrete Wavelet Transform
Feature extraction
Machine learning
Machine learning algorithms
maximal overlap discrete wavelet packet transform
Measurement
Measuring instruments
Non-technical losses
outlier detection
Performance measurement
Signal classification
Signal processing
Time series
Wavelet packets
title NTL Detection in Electric Distribution Systems Using the Maximal Overlap Discrete Wavelet-Packet Transform and Random Undersampling Boosting
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T12%3A43%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=NTL%20Detection%20in%20Electric%20Distribution%20Systems%20Using%20the%20Maximal%20Overlap%20Discrete%20Wavelet-Packet%20Transform%20and%20Random%20Undersampling%20Boosting&rft.jtitle=IEEE%20transactions%20on%20power%20systems&rft.au=Avila,%20Nelson%20Fabian&rft.date=2018-11&rft.volume=33&rft.issue=6&rft.spage=7171&rft.epage=7180&rft.pages=7171-7180&rft.issn=0885-8950&rft.eissn=1558-0679&rft.coden=ITPSEG&rft_id=info:doi/10.1109/TPWRS.2018.2853162&rft_dat=%3Cproquest_cross%3E2121980728%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c344t-4b7da9b02ac230fc77d7328a5cb43d53296d92b304e26542de91313c5f8117163%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2121980728&rft_id=info:pmid/&rft_ieee_id=8404135&rfr_iscdi=true