Loading…

Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data

People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual in...

Full description

Saved in:

Bibliographic Details
Published in:	New generation computing 2024-03, Vol.42 (1), p.135-155
Main Authors:	Dalal, Sumit, Jain, Sarika, Dave, Mayank
Format:	Article
Language:	English
Subjects:	Artificial Intelligence Artificial neural networks Channels Classification Computer Hardware Computer Science Computer Systems Organization and Communication Networks Context Deep learning Machine learning Model accuracy Neural networks Recall Recurrent neural networks Software Engineering/Programming and Operating Systems
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663
cites	cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663
container_end_page	155
container_issue	1
container_start_page	135
container_title	New generation computing
container_volume	42
creator	Dalal, Sumit Jain, Sarika Dave, Mayank
description	People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.
doi_str_mv	10.1007/s00354-023-00237-y
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3048755888</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3048755888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EEqXwApwscQ44_kmcI0qBIhV6AM6WY5w2JbWD7bTK2-M2SNy47K52v5mVBoDrFN2mCOV3HiHCaIIwSVAseTKcgEnKOU5yxNgpmCDMeUIyws7BhfebiGeE4gnYldbsbNuHxhr4qnsn29jC3rovOJe7xqzgS9-Gpms1LNfSGN16uG_CGi73Bt6HoM1RupCDdrC2Ds5057T3h-VMB62O59rZLXyzqon2MxnkJTirZev11W-fgo_Hh_dyniyWT8_l_SJRJC1CojHL04owXKn8s8Y4xUpRKmnOMaeaI8aLmpJCoRplqWJ1lVeqIgWOM9Uqy8gU3Iy-nbPfvfZBbGzvTHwpCKI8Z4xzHik8UspZ752uReearXSDSJE45CvGfEWMVhzzFUMUkVHkI2xW2v1Z_6P6AUgdf0c</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3048755888</pqid></control><display><type>article</type><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><source>Springer Link</source><creator>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</creator><creatorcontrib>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</creatorcontrib><description>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</description><identifier>ISSN: 0288-3635</identifier><identifier>EISSN: 1882-7055</identifier><identifier>DOI: 10.1007/s00354-023-00237-y</identifier><language>eng</language><publisher>Tokyo: Springer Japan</publisher><subject>Artificial Intelligence ; Artificial neural networks ; Channels ; Classification ; Computer Hardware ; Computer Science ; Computer Systems Organization and Communication Networks ; Context ; Deep learning ; Machine learning ; Model accuracy ; Neural networks ; Recall ; Recurrent neural networks ; Software Engineering/Programming and Operating Systems</subject><ispartof>New generation computing, 2024-03, Vol.42 (1), p.135-155</ispartof><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.. corrected publication 2024</rights><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. corrected publication 2024.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</citedby><cites>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</cites><orcidid>0000-0002-8736-2148</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Dalal, Sumit</creatorcontrib><creatorcontrib>Jain, Sarika</creatorcontrib><creatorcontrib>Dave, Mayank</creatorcontrib><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><title>New generation computing</title><addtitle>New Gener. Comput</addtitle><description>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</description><subject>Artificial Intelligence</subject><subject>Artificial neural networks</subject><subject>Channels</subject><subject>Classification</subject><subject>Computer Hardware</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Context</subject><subject>Deep learning</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Neural networks</subject><subject>Recall</subject><subject>Recurrent neural networks</subject><subject>Software Engineering/Programming and Operating Systems</subject><issn>0288-3635</issn><issn>1882-7055</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhC0EEqXwApwscQ44_kmcI0qBIhV6AM6WY5w2JbWD7bTK2-M2SNy47K52v5mVBoDrFN2mCOV3HiHCaIIwSVAseTKcgEnKOU5yxNgpmCDMeUIyws7BhfebiGeE4gnYldbsbNuHxhr4qnsn29jC3rovOJe7xqzgS9-Gpms1LNfSGN16uG_CGi73Bt6HoM1RupCDdrC2Ds5057T3h-VMB62O59rZLXyzqon2MxnkJTirZev11W-fgo_Hh_dyniyWT8_l_SJRJC1CojHL04owXKn8s8Y4xUpRKmnOMaeaI8aLmpJCoRplqWJ1lVeqIgWOM9Uqy8gU3Iy-nbPfvfZBbGzvTHwpCKI8Z4xzHik8UspZ752uReearXSDSJE45CvGfEWMVhzzFUMUkVHkI2xW2v1Z_6P6AUgdf0c</recordid><startdate>20240301</startdate><enddate>20240301</enddate><creator>Dalal, Sumit</creator><creator>Jain, Sarika</creator><creator>Dave, Mayank</creator><general>Springer Japan</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-8736-2148</orcidid></search><sort><creationdate>20240301</creationdate><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><author>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial Intelligence</topic><topic>Artificial neural networks</topic><topic>Channels</topic><topic>Classification</topic><topic>Computer Hardware</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Context</topic><topic>Deep learning</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Neural networks</topic><topic>Recall</topic><topic>Recurrent neural networks</topic><topic>Software Engineering/Programming and Operating Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dalal, Sumit</creatorcontrib><creatorcontrib>Jain, Sarika</creatorcontrib><creatorcontrib>Dave, Mayank</creatorcontrib><collection>CrossRef</collection><jtitle>New generation computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dalal, Sumit</au><au>Jain, Sarika</au><au>Dave, Mayank</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</atitle><jtitle>New generation computing</jtitle><stitle>New Gener. Comput</stitle><date>2024-03-01</date><risdate>2024</risdate><volume>42</volume><issue>1</issue><spage>135</spage><epage>155</epage><pages>135-155</pages><issn>0288-3635</issn><eissn>1882-7055</eissn><abstract>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</abstract><cop>Tokyo</cop><pub>Springer Japan</pub><doi>10.1007/s00354-023-00237-y</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-8736-2148</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0288-3635
ispartof	New generation computing, 2024-03, Vol.42 (1), p.135-155
issn	0288-3635 1882-7055
language	eng
recordid	cdi_proquest_journals_3048755888
source	Springer Link
subjects	Artificial Intelligence Artificial neural networks Channels Classification Computer Hardware Computer Science Computer Systems Organization and Communication Networks Context Deep learning Machine learning Model accuracy Neural networks Recall Recurrent neural networks Software Engineering/Programming and Operating Systems
title	Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T16%3A05%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Convolution%20Neural%20Network%20Having%20Multiple%20Channels%20with%20Own%20Attention%20Layer%20for%20Depression%20Detection%20from%20Social%20Data&rft.jtitle=New%20generation%20computing&rft.au=Dalal,%20Sumit&rft.date=2024-03-01&rft.volume=42&rft.issue=1&rft.spage=135&rft.epage=155&rft.pages=135-155&rft.issn=0288-3635&rft.eissn=1882-7055&rft_id=info:doi/10.1007/s00354-023-00237-y&rft_dat=%3Cproquest_cross%3E3048755888%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3048755888&rft_id=info:pmid/&rfr_iscdi=true