Loading…

DDC: Deep Distribution Classifier, A Convolutional Neural Network-based Approach for Identifying Data Distributions

In domains such as the stock market and manufacturing, there’s a growing demand for faster and more accurate data distribution identification methods due to the rapid generation of vast volumes of data, highlighting the need for enhanced real-time decision-making capabilities. Traditional methods of...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of the Indian Society of Agricultural Statistics 2024-09, Vol.78 (2), p.169-178
Main Authors:	Godara, Samarth, G, Avinash, Parsad, Rajender, Marwaha, Sudeep
Format:	Article
Language:	English
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page	178
container_issue	2
container_start_page	169
container_title	Journal of the Indian Society of Agricultural Statistics
container_volume	78
creator	Godara, Samarth G, Avinash Parsad, Rajender Marwaha, Sudeep
description	In domains such as the stock market and manufacturing, there’s a growing demand for faster and more accurate data distribution identification methods due to the rapid generation of vast volumes of data, highlighting the need for enhanced real-time decision-making capabilities. Traditional methods of identifying data distributions often rely on manual inspection, limited statistical tests and time-consuming analysis, leading to inefficiencies and inaccuracies in classification. In this scenario, the presented research offers a novel approach leveraging Deep Learning (DL) models to automate the process. The presented methodology also enables faster and more accurate identification of data distributions by the generation of synthetic data points and training of the DL model for identifying different distribution types. The primary objective of this study is to develop a DL model that categorizes data points into specific distributions based on an input dataset. Moreover, for model training and evaluation, a total of 1000 datasets are generated,each comprising 1000 data points. The study considers five distributions (Normal, Uniform, Exponential, Log-normal and Beta distribution), with 200 datasets generated (with randomly selected parameters) for each distribution. In the study, the DL model is trained first, and later, the model is evaluated on a separate test (unseen) dataset. Then, its performance in classifying the distributions is assessed based on metrics such as accuracy and loss. The study results demonstrate the effectiveness of the proposed approach in accurately classifying the distribution of data points, providing valuable insights into the application of DL for distribution classification tasks. The proposed method enhances scalability, robustness and efficiency by harnessing the power of convolutional neural networks and advanced preprocessing techniques.
doi_str_mv	10.56093/jisas.v78i2.11
format	article
fullrecord	<record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_56093_jisas_v78i2_11</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_56093_jisas_v78i2_11</sourcerecordid><originalsourceid>FETCH-crossref_primary_10_56093_jisas_v78i2_113</originalsourceid><addsrcrecordid>eNqVj7FOAzEQRF2ARASpafcDuIsdCyehi84gaFLRW5vEhg3H-bTrBOXvQRYNJdO8YjQjPaVujW7vnV7Z2YEEpT0tljRvjblQE63NqnHW2Ss1FTnon7j5wrrlRIn33QP4GEfwJIVpeyyUB-h6FKFEke9gDV0eTrmvDfawiUeuKF-ZP5otStzDehw54-4dUmZ42cehUDrT8AYeC_75lht1mbCXOP3ltZo9Pb52z82OswjHFEamT-RzMDpUpVCVQlUKxtj_L74BqcFaeg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>DDC: Deep Distribution Classifier, A Convolutional Neural Network-based Approach for Identifying Data Distributions</title><source>EZB Electronic Journals Library</source><creator>Godara, Samarth ; G, Avinash ; Parsad, Rajender ; Marwaha, Sudeep</creator><creatorcontrib>Godara, Samarth ; G, Avinash ; Parsad, Rajender ; Marwaha, Sudeep</creatorcontrib><description>In domains such as the stock market and manufacturing, there’s a growing demand for faster and more accurate data distribution identification methods due to the rapid generation of vast volumes of data, highlighting the need for enhanced real-time decision-making capabilities. Traditional methods of identifying data distributions often rely on manual inspection, limited statistical tests and time-consuming analysis, leading to inefficiencies and inaccuracies in classification. In this scenario, the presented research offers a novel approach leveraging Deep Learning (DL) models to automate the process. The presented methodology also enables faster and more accurate identification of data distributions by the generation of synthetic data points and training of the DL model for identifying different distribution types. The primary objective of this study is to develop a DL model that categorizes data points into specific distributions based on an input dataset. Moreover, for model training and evaluation, a total of 1000 datasets are generated,each comprising 1000 data points. The study considers five distributions (Normal, Uniform, Exponential, Log-normal and Beta distribution), with 200 datasets generated (with randomly selected parameters) for each distribution. In the study, the DL model is trained first, and later, the model is evaluated on a separate test (unseen) dataset. Then, its performance in classifying the distributions is assessed based on metrics such as accuracy and loss. The study results demonstrate the effectiveness of the proposed approach in accurately classifying the distribution of data points, providing valuable insights into the application of DL for distribution classification tasks. The proposed method enhances scalability, robustness and efficiency by harnessing the power of convolutional neural networks and advanced preprocessing techniques.</description><identifier>ISSN: 0019-6363</identifier><identifier>DOI: 10.56093/jisas.v78i2.11</identifier><language>eng</language><ispartof>Journal of the Indian Society of Agricultural Statistics, 2024-09, Vol.78 (2), p.169-178</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Godara, Samarth</creatorcontrib><creatorcontrib>G, Avinash</creatorcontrib><creatorcontrib>Parsad, Rajender</creatorcontrib><creatorcontrib>Marwaha, Sudeep</creatorcontrib><title>DDC: Deep Distribution Classifier, A Convolutional Neural Network-based Approach for Identifying Data Distributions</title><title>Journal of the Indian Society of Agricultural Statistics</title><description>In domains such as the stock market and manufacturing, there’s a growing demand for faster and more accurate data distribution identification methods due to the rapid generation of vast volumes of data, highlighting the need for enhanced real-time decision-making capabilities. Traditional methods of identifying data distributions often rely on manual inspection, limited statistical tests and time-consuming analysis, leading to inefficiencies and inaccuracies in classification. In this scenario, the presented research offers a novel approach leveraging Deep Learning (DL) models to automate the process. The presented methodology also enables faster and more accurate identification of data distributions by the generation of synthetic data points and training of the DL model for identifying different distribution types. The primary objective of this study is to develop a DL model that categorizes data points into specific distributions based on an input dataset. Moreover, for model training and evaluation, a total of 1000 datasets are generated,each comprising 1000 data points. The study considers five distributions (Normal, Uniform, Exponential, Log-normal and Beta distribution), with 200 datasets generated (with randomly selected parameters) for each distribution. In the study, the DL model is trained first, and later, the model is evaluated on a separate test (unseen) dataset. Then, its performance in classifying the distributions is assessed based on metrics such as accuracy and loss. The study results demonstrate the effectiveness of the proposed approach in accurately classifying the distribution of data points, providing valuable insights into the application of DL for distribution classification tasks. The proposed method enhances scalability, robustness and efficiency by harnessing the power of convolutional neural networks and advanced preprocessing techniques.</description><issn>0019-6363</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNqVj7FOAzEQRF2ARASpafcDuIsdCyehi84gaFLRW5vEhg3H-bTrBOXvQRYNJdO8YjQjPaVujW7vnV7Z2YEEpT0tljRvjblQE63NqnHW2Ss1FTnon7j5wrrlRIn33QP4GEfwJIVpeyyUB-h6FKFEke9gDV0eTrmvDfawiUeuKF-ZP5otStzDehw54-4dUmZ42cehUDrT8AYeC_75lht1mbCXOP3ltZo9Pb52z82OswjHFEamT-RzMDpUpVCVQlUKxtj_L74BqcFaeg</recordid><startdate>20240910</startdate><enddate>20240910</enddate><creator>Godara, Samarth</creator><creator>G, Avinash</creator><creator>Parsad, Rajender</creator><creator>Marwaha, Sudeep</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20240910</creationdate><title>DDC: Deep Distribution Classifier, A Convolutional Neural Network-based Approach for Identifying Data Distributions</title><author>Godara, Samarth ; G, Avinash ; Parsad, Rajender ; Marwaha, Sudeep</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-crossref_primary_10_56093_jisas_v78i2_113</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Godara, Samarth</creatorcontrib><creatorcontrib>G, Avinash</creatorcontrib><creatorcontrib>Parsad, Rajender</creatorcontrib><creatorcontrib>Marwaha, Sudeep</creatorcontrib><collection>CrossRef</collection><jtitle>Journal of the Indian Society of Agricultural Statistics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Godara, Samarth</au><au>G, Avinash</au><au>Parsad, Rajender</au><au>Marwaha, Sudeep</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DDC: Deep Distribution Classifier, A Convolutional Neural Network-based Approach for Identifying Data Distributions</atitle><jtitle>Journal of the Indian Society of Agricultural Statistics</jtitle><date>2024-09-10</date><risdate>2024</risdate><volume>78</volume><issue>2</issue><spage>169</spage><epage>178</epage><pages>169-178</pages><issn>0019-6363</issn><abstract>In domains such as the stock market and manufacturing, there’s a growing demand for faster and more accurate data distribution identification methods due to the rapid generation of vast volumes of data, highlighting the need for enhanced real-time decision-making capabilities. Traditional methods of identifying data distributions often rely on manual inspection, limited statistical tests and time-consuming analysis, leading to inefficiencies and inaccuracies in classification. In this scenario, the presented research offers a novel approach leveraging Deep Learning (DL) models to automate the process. The presented methodology also enables faster and more accurate identification of data distributions by the generation of synthetic data points and training of the DL model for identifying different distribution types. The primary objective of this study is to develop a DL model that categorizes data points into specific distributions based on an input dataset. Moreover, for model training and evaluation, a total of 1000 datasets are generated,each comprising 1000 data points. The study considers five distributions (Normal, Uniform, Exponential, Log-normal and Beta distribution), with 200 datasets generated (with randomly selected parameters) for each distribution. In the study, the DL model is trained first, and later, the model is evaluated on a separate test (unseen) dataset. Then, its performance in classifying the distributions is assessed based on metrics such as accuracy and loss. The study results demonstrate the effectiveness of the proposed approach in accurately classifying the distribution of data points, providing valuable insights into the application of DL for distribution classification tasks. The proposed method enhances scalability, robustness and efficiency by harnessing the power of convolutional neural networks and advanced preprocessing techniques.</abstract><doi>10.56093/jisas.v78i2.11</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 0019-6363
ispartof	Journal of the Indian Society of Agricultural Statistics, 2024-09, Vol.78 (2), p.169-178
issn	0019-6363
language	eng
recordid	cdi_crossref_primary_10_56093_jisas_v78i2_11
source	EZB Electronic Journals Library
title	DDC: Deep Distribution Classifier, A Convolutional Neural Network-based Approach for Identifying Data Distributions
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T15%3A20%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DDC:%20Deep%20Distribution%20Classifier,%20A%20Convolutional%20Neural%20Network-based%20Approach%20for%20Identifying%20Data%20Distributions&rft.jtitle=Journal%20of%20the%20Indian%20Society%20of%20Agricultural%20Statistics&rft.au=Godara,%20Samarth&rft.date=2024-09-10&rft.volume=78&rft.issue=2&rft.spage=169&rft.epage=178&rft.pages=169-178&rft.issn=0019-6363&rft_id=info:doi/10.56093/jisas.v78i2.11&rft_dat=%3Ccrossref%3E10_56093_jisas_v78i2_11%3C/crossref%3E%3Cgrp_id%3Ecdi_FETCH-crossref_primary_10_56093_jisas_v78i2_113%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true