Loading…

Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development

This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that th...

Full description

Saved in:
Bibliographic Details
Published in:F1000 research 2023, Vol.12, p.379-379
Main Author: Imada, Mizuho
Format: Article
Language:eng ; jpn
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3
container_end_page 379
container_issue
container_start_page 379
container_title F1000 research
container_volume 12
creator Imada, Mizuho
description This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that the sentence length in random data is well suited to a geometric distribution, whereas MDD is well suited to a lognormal distribution. In contrast, data from children's compositions show a shift in the distribution of the number of clauses from a lognormal to a gamma distribution, depending on the school year, with MDD suiting a gamma distribution. Mean MDD increases exponentially with the logarithm of the number of clauses in random data, while it increases linearly in composition data, thus generally supporting previous findings that dependency distances are optimized in natural language. However, MDDs exhibit non-monotonic changes with grades, suggesting the complexity of children's language development.
doi_str_mv 10.12688/f1000research.132383.1
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_9fc999bf2ae24573915bd04838af882a</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_9fc999bf2ae24573915bd04838af882a</doaj_id><sourcerecordid>2830668407</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3</originalsourceid><addsrcrecordid>eNpVkc9u1DAQxi1ERattXwF8g8su_pfE4YaWApUqcYGzNbEnu64SO9hJpb4Mz1rvblnBaayZ7_uNRx8h7zjbcFFr_bHnjLGEGSHZ_YZLIbXc8FfkSjBVr7li4vU_70tyk_NDcbC2lbVo3pBL2chGM11dkT9ffJ6T75bZx0BjTzOGGYNFOmDYzXsKwVGHEwZXuk_UFTkcxj5Qu_eDSxjeZ2rjOMXsD5D8iW73kMDOmIrY23zABpiXBAMdIOwW2OGR-wjJw9FzwJ1HDh9xiNNYfnJNLnoYMt681BX59fX25_b7-v7Ht7vt5_u1Fbria1tZJ0VXN65CzqSyHCSzPUeFTcMYClcJxbtacAVQoXSt0tZBpWoUvO2cXJG7E9dFeDBT8iOkJxPBm2Mjpp2BVG4Z0LS9bdu26wWgUFUjW151jiktNfRaCyisDyfWlOLvBfNsRp8tDuU-jEs2QktW11qxpkibk9SmmHPC_ryaM3PM2vyXtTllXcqKvH1ZsnQjurPvb7LyGZToqy0</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2830668407</pqid></control><display><type>article</type><title>Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><source>PubMed Central(OpenAccess)</source><creator>Imada, Mizuho</creator><creatorcontrib>Imada, Mizuho</creatorcontrib><description>This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that the sentence length in random data is well suited to a geometric distribution, whereas MDD is well suited to a lognormal distribution. In contrast, data from children's compositions show a shift in the distribution of the number of clauses from a lognormal to a gamma distribution, depending on the school year, with MDD suiting a gamma distribution. Mean MDD increases exponentially with the logarithm of the number of clauses in random data, while it increases linearly in composition data, thus generally supporting previous findings that dependency distances are optimized in natural language. However, MDDs exhibit non-monotonic changes with grades, suggesting the complexity of children's language development.</description><identifier>ISSN: 2046-1402</identifier><identifier>EISSN: 2046-1402</identifier><identifier>DOI: 10.12688/f1000research.132383.1</identifier><identifier>PMID: 37378085</identifier><language>eng ; jpn</language><publisher>England: F1000 Research Ltd</publisher><subject>Child ; children's compositions ; dependency distance ; Depressive Disorder, Major ; eng ; generalized linear mixed model ; Humans ; Language ; Language Development ; probability distribution ; sentence length ; 文長</subject><ispartof>F1000 research, 2023, Vol.12, p.379-379</ispartof><rights>Copyright: © 2023 Imada M.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3</cites><orcidid>0000-0001-6505-3988</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,4009,27902,27903,27904,36992</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37378085$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Imada, Mizuho</creatorcontrib><title>Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development</title><title>F1000 research</title><addtitle>F1000Res</addtitle><description>This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that the sentence length in random data is well suited to a geometric distribution, whereas MDD is well suited to a lognormal distribution. In contrast, data from children's compositions show a shift in the distribution of the number of clauses from a lognormal to a gamma distribution, depending on the school year, with MDD suiting a gamma distribution. Mean MDD increases exponentially with the logarithm of the number of clauses in random data, while it increases linearly in composition data, thus generally supporting previous findings that dependency distances are optimized in natural language. However, MDDs exhibit non-monotonic changes with grades, suggesting the complexity of children's language development.</description><subject>Child</subject><subject>children's compositions</subject><subject>dependency distance</subject><subject>Depressive Disorder, Major</subject><subject>eng</subject><subject>generalized linear mixed model</subject><subject>Humans</subject><subject>Language</subject><subject>Language Development</subject><subject>probability distribution</subject><subject>sentence length</subject><subject>文長</subject><issn>2046-1402</issn><issn>2046-1402</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNpVkc9u1DAQxi1ERattXwF8g8su_pfE4YaWApUqcYGzNbEnu64SO9hJpb4Mz1rvblnBaayZ7_uNRx8h7zjbcFFr_bHnjLGEGSHZ_YZLIbXc8FfkSjBVr7li4vU_70tyk_NDcbC2lbVo3pBL2chGM11dkT9ffJ6T75bZx0BjTzOGGYNFOmDYzXsKwVGHEwZXuk_UFTkcxj5Qu_eDSxjeZ2rjOMXsD5D8iW73kMDOmIrY23zABpiXBAMdIOwW2OGR-wjJw9FzwJ1HDh9xiNNYfnJNLnoYMt681BX59fX25_b7-v7Ht7vt5_u1Fbria1tZJ0VXN65CzqSyHCSzPUeFTcMYClcJxbtacAVQoXSt0tZBpWoUvO2cXJG7E9dFeDBT8iOkJxPBm2Mjpp2BVG4Z0LS9bdu26wWgUFUjW151jiktNfRaCyisDyfWlOLvBfNsRp8tDuU-jEs2QktW11qxpkibk9SmmHPC_ryaM3PM2vyXtTllXcqKvH1ZsnQjurPvb7LyGZToqy0</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Imada, Mizuho</creator><general>F1000 Research Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-6505-3988</orcidid></search><sort><creationdate>2023</creationdate><title>Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development</title><author>Imada, Mizuho</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng ; jpn</language><creationdate>2023</creationdate><topic>Child</topic><topic>children's compositions</topic><topic>dependency distance</topic><topic>Depressive Disorder, Major</topic><topic>eng</topic><topic>generalized linear mixed model</topic><topic>Humans</topic><topic>Language</topic><topic>Language Development</topic><topic>probability distribution</topic><topic>sentence length</topic><topic>文長</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Imada, Mizuho</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>F1000 research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Imada, Mizuho</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development</atitle><jtitle>F1000 research</jtitle><addtitle>F1000Res</addtitle><date>2023</date><risdate>2023</risdate><volume>12</volume><spage>379</spage><epage>379</epage><pages>379-379</pages><issn>2046-1402</issn><eissn>2046-1402</eissn><abstract>This study analyzed the distribution of the sentence length and mean of dependency distances (MDD) in Japanese sentences, comparing data from random sources with that obtained from children's compositions, and identifying changes in distribution according to grade level. Findings reveal that the sentence length in random data is well suited to a geometric distribution, whereas MDD is well suited to a lognormal distribution. In contrast, data from children's compositions show a shift in the distribution of the number of clauses from a lognormal to a gamma distribution, depending on the school year, with MDD suiting a gamma distribution. Mean MDD increases exponentially with the logarithm of the number of clauses in random data, while it increases linearly in composition data, thus generally supporting previous findings that dependency distances are optimized in natural language. However, MDDs exhibit non-monotonic changes with grades, suggesting the complexity of children's language development.</abstract><cop>England</cop><pub>F1000 Research Ltd</pub><pmid>37378085</pmid><doi>10.12688/f1000research.132383.1</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0001-6505-3988</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2046-1402
ispartof F1000 research, 2023, Vol.12, p.379-379
issn 2046-1402
2046-1402
language eng ; jpn
recordid cdi_doaj_primary_oai_doaj_org_article_9fc999bf2ae24573915bd04838af882a
source Publicly Available Content Database (Proquest) (PQ_SDU_P3); PubMed Central(OpenAccess)
subjects Child
children's compositions
dependency distance
Depressive Disorder, Major
eng
generalized linear mixed model
Humans
Language
Language Development
probability distribution
sentence length
文長
title Distribution of sentence length and dependency distance in children's compositions: Characteristics of natural language and variations in language development
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T13%3A39%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Distribution%20of%20sentence%20length%20and%20dependency%20distance%20in%20children's%20compositions:%20Characteristics%20of%20natural%20language%20and%20variations%20in%20language%20development&rft.jtitle=F1000%20research&rft.au=Imada,%20Mizuho&rft.date=2023&rft.volume=12&rft.spage=379&rft.epage=379&rft.pages=379-379&rft.issn=2046-1402&rft.eissn=2046-1402&rft_id=info:doi/10.12688/f1000research.132383.1&rft_dat=%3Cproquest_doaj_%3E2830668407%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c2851-c5cd32b67d5e1034c1a30cf1e4e7700e2d5241b6214aa5e3d948cda546e219bd3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2830668407&rft_id=info:pmid/37378085&rfr_iscdi=true