Loading…
Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development
This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that th...
Saved in:
Published in: | F1000 research 2023, Vol.12, p.379-379 |
---|---|
Main Author: | |
Format: | Article |
Language: | eng ; jpn |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3 |
container_end_page | 379 |
container_issue | |
container_start_page | 379 |
container_title | F1000 research |
container_volume | 12 |
creator | Imada, Mizuho |
description | This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that the sentence length in random data is well suited to a geometric distribution, whereas MDD is well suited to a lognormal distribution. In contrast, data from children's compositions show a shift in the distribution of the number of clauses from a lognormal to a gamma distribution, depending on the school year, with MDD suiting a gamma distribution. Mean MDD increases exponentially with the logarithm of the number of clauses in random data, while it increases linearly in composition data, thus generally supporting previous findings that dependency distances are optimized in natural language. However, MDDs exhibit non-monotonic changes with grades, suggesting the complexity of children's language development. |
doi_str_mv | 10.12688/f1000research.132383.1 |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_9fc999bf2ae24573915bd04838af882a</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_9fc999bf2ae24573915bd04838af882a</doaj_id><sourcerecordid>2830668407</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3</originalsourceid><addsrcrecordid>eNpVkc9u1DAQxi1ERattXwF8g8su_pfE4YaWApUqcYGzNbEnu64SO9hJpb4Mz1rvblnBaayZ7_uNRx8h7zjbcFFr_bHnjLGEGSHZ_YZLIbXc8FfkSjBVr7li4vU_70tyk_NDcbC2lbVo3pBL2chGM11dkT9ffJ6T75bZx0BjTzOGGYNFOmDYzXsKwVGHEwZXuk_UFTkcxj5Qu_eDSxjeZ2rjOMXsD5D8iW73kMDOmIrY23zABpiXBAMdIOwW2OGR-wjJw9FzwJ1HDh9xiNNYfnJNLnoYMt681BX59fX25_b7-v7Ht7vt5_u1Fbria1tZJ0VXN65CzqSyHCSzPUeFTcMYClcJxbtacAVQoXSt0tZBpWoUvO2cXJG7E9dFeDBT8iOkJxPBm2Mjpp2BVG4Z0LS9bdu26wWgUFUjW151jiktNfRaCyisDyfWlOLvBfNsRp8tDuU-jEs2QktW11qxpkibk9SmmHPC_ryaM3PM2vyXtTllXcqKvH1ZsnQjurPvb7LyGZToqy0</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2830668407</pqid></control><display><type>article</type><title>Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><source>PubMed Central(OpenAccess)</source><creator>Imada, Mizuho</creator><creatorcontrib>Imada, Mizuho</creatorcontrib><description>This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that the sentence length in random data is well suited to a geometric distribution, whereas MDD is well suited to a lognormal distribution. In contrast, data from children's compositions show a shift in the distribution of the number of clauses from a lognormal to a gamma distribution, depending on the school year, with MDD suiting a gamma distribution. Mean MDD increases exponentially with the logarithm of the number of clauses in random data, while it increases linearly in composition data, thus generally supporting previous findings that dependency distances are optimized in natural language. However, MDDs exhibit non-monotonic changes with grades, suggesting the complexity of children's language development.</description><identifier>ISSN: 2046-1402</identifier><identifier>EISSN: 2046-1402</identifier><identifier>DOI: 10.12688/f1000research.132383.1</identifier><identifier>PMID: 37378085</identifier><language>eng ; jpn</language><publisher>England: F1000 Research Ltd</publisher><subject>Child ; children's compositions ; dependency distance ; Depressive Disorder, Major ; eng ; generalized linear mixed model ; Humans ; Language ; Language Development ; probability distribution ; sentence length ; 文長</subject><ispartof>F1000 research, 2023, Vol.12, p.379-379</ispartof><rights>Copyright: © 2023 Imada M.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3</cites><orcidid>0000-0001-6505-3988</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,4009,27902,27903,27904,36992</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37378085$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Imada, Mizuho</creatorcontrib><title>Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development</title><title>F1000 research</title><addtitle>F1000Res</addtitle><description>This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that the sentence length in random data is well suited to a geometric distribution, whereas MDD is well suited to a lognormal distribution. In contrast, data from children's compositions show a shift in the distribution of the number of clauses from a lognormal to a gamma distribution, depending on the school year, with MDD suiting a gamma distribution. Mean MDD increases exponentially with the logarithm of the number of clauses in random data, while it increases linearly in composition data, thus generally supporting previous findings that dependency distances are optimized in natural language. However, MDDs exhibit non-monotonic changes with grades, suggesting the complexity of children's language development.</description><subject>Child</subject><subject>children's compositions</subject><subject>dependency distance</subject><subject>Depressive Disorder, Major</subject><subject>eng</subject><subject>generalized linear mixed model</subject><subject>Humans</subject><subject>Language</subject><subject>Language Development</subject><subject>probability distribution</subject><subject>sentence length</subject><subject>文長</subject><issn>2046-1402</issn><issn>2046-1402</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNpVkc9u1DAQxi1ERattXwF8g8su_pfE4YaWApUqcYGzNbEnu64SO9hJpb4Mz1rvblnBaayZ7_uNRx8h7zjbcFFr_bHnjLGEGSHZ_YZLIbXc8FfkSjBVr7li4vU_70tyk_NDcbC2lbVo3pBL2chGM11dkT9ffJ6T75bZx0BjTzOGGYNFOmDYzXsKwVGHEwZXuk_UFTkcxj5Qu_eDSxjeZ2rjOMXsD5D8iW73kMDOmIrY23zABpiXBAMdIOwW2OGR-wjJw9FzwJ1HDh9xiNNYfnJNLnoYMt681BX59fX25_b7-v7Ht7vt5_u1Fbria1tZJ0VXN65CzqSyHCSzPUeFTcMYClcJxbtacAVQoXSt0tZBpWoUvO2cXJG7E9dFeDBT8iOkJxPBm2Mjpp2BVG4Z0LS9bdu26wWgUFUjW151jiktNfRaCyisDyfWlOLvBfNsRp8tDuU-jEs2QktW11qxpkibk9SmmHPC_ryaM3PM2vyXtTllXcqKvH1ZsnQjurPvb7LyGZToqy0</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Imada, Mizuho</creator><general>F1000 Research Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-6505-3988</orcidid></search><sort><creationdate>2023</creationdate><title>Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development</title><author>Imada, Mizuho</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng ; jpn</language><creationdate>2023</creationdate><topic>Child</topic><topic>children's compositions</topic><topic>dependency distance</topic><topic>Depressive Disorder, Major</topic><topic>eng</topic><topic>generalized linear mixed model</topic><topic>Humans</topic><topic>Language</topic><topic>Language Development</topic><topic>probability distribution</topic><topic>sentence length</topic><topic>文長</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Imada, Mizuho</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>F1000 research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Imada, Mizuho</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development</atitle><jtitle>F1000 research</jtitle><addtitle>F1000Res</addtitle><date>2023</date><risdate>2023</risdate><volume>12</volume><spage>379</spage><epage>379</epage><pages>379-379</pages><issn>2046-1402</issn><eissn>2046-1402</eissn><abstract>This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that the sentence length in random data is well suited to a geometric distribution, whereas MDD is well suited to a lognormal distribution. In contrast, data from children's compositions show a shift in the distribution of the number of clauses from a lognormal to a gamma distribution, depending on the school year, with MDD suiting a gamma distribution. Mean MDD increases exponentially with the logarithm of the number of clauses in random data, while it increases linearly in composition data, thus generally supporting previous findings that dependency distances are optimized in natural language. However, MDDs exhibit non-monotonic changes with grades, suggesting the complexity of children's language development.</abstract><cop>England</cop><pub>F1000 Research Ltd</pub><pmid>37378085</pmid><doi>10.12688/f1000research.132383.1</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0001-6505-3988</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2046-1402 |
ispartof | F1000 research, 2023, Vol.12, p.379-379 |
issn | 2046-1402 2046-1402 |
language | eng ; jpn |
recordid | cdi_doaj_primary_oai_doaj_org_article_9fc999bf2ae24573915bd04838af882a |
source | Publicly Available Content Database (Proquest) (PQ_SDU_P3); PubMed Central(OpenAccess) |
subjects | Child children's compositions dependency distance Depressive Disorder, Major eng generalized linear mixed model Humans Language Language Development probability distribution sentence length 文長 |
title | Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T13%3A39%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Distribution%20of%20sentence%20length%20and%20dependency%20distance%20in%20children's%20compositions:%20Characteristics%20of%20natural%20language%20and%20variations%20in%20language%20development&rft.jtitle=F1000%20research&rft.au=Imada,%20Mizuho&rft.date=2023&rft.volume=12&rft.spage=379&rft.epage=379&rft.pages=379-379&rft.issn=2046-1402&rft.eissn=2046-1402&rft_id=info:doi/10.12688/f1000research.132383.1&rft_dat=%3Cproquest_doaj_%3E2830668407%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2830668407&rft_id=info:pmid/37378085&rfr_iscdi=true |