Loading…

On the Convergence of Upper Bound Techniques for the Average Length of Longest Common Subsequences

It has long been known [2] that the average length of the longest common subsequence of two random strings of length n over an alphabet of size k is asymptotic to γ^sub k^n for some constant γ^sub k^ depending on k. The value of these constants remains unknown, and a number of papers have proved upp...

Full description

Saved in:
Bibliographic Details
Main Author: Lueker, George S
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page 169
container_title
container_volume
creator Lueker, George S
description It has long been known [2] that the average length of the longest common subsequence of two random strings of length n over an alphabet of size k is asymptotic to γ^sub k^n for some constant γ^sub k^ depending on k. The value of these constants remains unknown, and a number of papers have proved upper and lower bounds on them. In particular, in [6] we used a modification of methods of [3, 4] for determining lower and upper bounds on γ^sub k^, combined with large computer computations, to obtain improved bounds on γ^sub 2^. The method of [6] involved a parameter h; empirically, increasing h increased the computation time but gave better upper bounds. Here we show, for arbitrary k, a sufficient condition for a parameterized method to produce a sequence of upper bounds approaching the true value of γ^sub k^, and show that a generalization of the method of [6] meets this condition for all k ≥ 2. While [3, 4] do not explicitly discuss how to parameterize their method, which is based on a concept they call domination, to trade off the tightness of the bound vs. the amount of computation, we discuss a very natural parameterization of their method; for the case of alphabet size k = 2 we conjecture but do not prove that it also meets the sufficient condition and hence also yields a sequence of bounds that converges to the correct value of γ^sub 2^. For k > 2, it does not meet our sufficient condition. Thus we leave open the question of whether some method based on the undominated collations of [3, 4] gives bounds converging to the correct value for any k ≥ 2. [PUBLICATION ABSTRACT]
format conference_proceeding
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_962440068</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2618264621</sourcerecordid><originalsourceid>FETCH-proquest_journals_9624400683</originalsourceid><addsrcrecordid>eNqNjM0KgkAURocgSMp3uLQXRp2sliVFC6FFtRa1qyZ5x-an52-MHqDVt_jOORPmRWEiAh6LeMZ8rTvOechXaxFFHivPBKZFSCW9UTVIFYKs4TYMqGAvLd3hilVLj5dFDbVUX3rn2KJByJAa045CJqlBbVyn7yXBxZYaneJyesGmdfHU6P92zpbHwzU9BYOSY9XknbSK3JVvk0gIzpNN_Bf0AYw5RJw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>962440068</pqid></control><display><type>conference_proceeding</type><title>On the Convergence of Upper Bound Techniques for the Average Length of Longest Common Subsequences</title><source>ABI/INFORM Global</source><creator>Lueker, George S</creator><creatorcontrib>Lueker, George S</creatorcontrib><description>It has long been known [2] that the average length of the longest common subsequence of two random strings of length n over an alphabet of size k is asymptotic to γ^sub k^n for some constant γ^sub k^ depending on k. The value of these constants remains unknown, and a number of papers have proved upper and lower bounds on them. In particular, in [6] we used a modification of methods of [3, 4] for determining lower and upper bounds on γ^sub k^, combined with large computer computations, to obtain improved bounds on γ^sub 2^. The method of [6] involved a parameter h; empirically, increasing h increased the computation time but gave better upper bounds. Here we show, for arbitrary k, a sufficient condition for a parameterized method to produce a sequence of upper bounds approaching the true value of γ^sub k^, and show that a generalization of the method of [6] meets this condition for all k ≥ 2. While [3, 4] do not explicitly discuss how to parameterize their method, which is based on a concept they call domination, to trade off the tightness of the bound vs. the amount of computation, we discuss a very natural parameterization of their method; for the case of alphabet size k = 2 we conjecture but do not prove that it also meets the sufficient condition and hence also yields a sequence of bounds that converges to the correct value of γ^sub 2^. For k &gt; 2, it does not meet our sufficient condition. Thus we leave open the question of whether some method based on the undominated collations of [3, 4] gives bounds converging to the correct value for any k ≥ 2. [PUBLICATION ABSTRACT]</description><identifier>EISSN: 2164-0343</identifier><language>eng</language><publisher>Philadelphia: Society for Industrial and Applied Mathematics</publisher><subject>Codes</subject><ispartof>Proceedings of the ... Workshop on Analytic Algorithmics and Combinatorics, 2008, p.169</ispartof><rights>Copyright Society for Industrial and Applied Mathematics 2008</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/962440068?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,11688,23930,23931,25140,36060,44363</link.rule.ids></links><search><creatorcontrib>Lueker, George S</creatorcontrib><title>On the Convergence of Upper Bound Techniques for the Average Length of Longest Common Subsequences</title><title>Proceedings of the ... Workshop on Analytic Algorithmics and Combinatorics</title><description>It has long been known [2] that the average length of the longest common subsequence of two random strings of length n over an alphabet of size k is asymptotic to γ^sub k^n for some constant γ^sub k^ depending on k. The value of these constants remains unknown, and a number of papers have proved upper and lower bounds on them. In particular, in [6] we used a modification of methods of [3, 4] for determining lower and upper bounds on γ^sub k^, combined with large computer computations, to obtain improved bounds on γ^sub 2^. The method of [6] involved a parameter h; empirically, increasing h increased the computation time but gave better upper bounds. Here we show, for arbitrary k, a sufficient condition for a parameterized method to produce a sequence of upper bounds approaching the true value of γ^sub k^, and show that a generalization of the method of [6] meets this condition for all k ≥ 2. While [3, 4] do not explicitly discuss how to parameterize their method, which is based on a concept they call domination, to trade off the tightness of the bound vs. the amount of computation, we discuss a very natural parameterization of their method; for the case of alphabet size k = 2 we conjecture but do not prove that it also meets the sufficient condition and hence also yields a sequence of bounds that converges to the correct value of γ^sub 2^. For k &gt; 2, it does not meet our sufficient condition. Thus we leave open the question of whether some method based on the undominated collations of [3, 4] gives bounds converging to the correct value for any k ≥ 2. [PUBLICATION ABSTRACT]</description><subject>Codes</subject><issn>2164-0343</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>M0C</sourceid><recordid>eNqNjM0KgkAURocgSMp3uLQXRp2sliVFC6FFtRa1qyZ5x-an52-MHqDVt_jOORPmRWEiAh6LeMZ8rTvOechXaxFFHivPBKZFSCW9UTVIFYKs4TYMqGAvLd3hilVLj5dFDbVUX3rn2KJByJAa045CJqlBbVyn7yXBxZYaneJyesGmdfHU6P92zpbHwzU9BYOSY9XknbSK3JVvk0gIzpNN_Bf0AYw5RJw</recordid><startdate>20080101</startdate><enddate>20080101</enddate><creator>Lueker, George S</creator><general>Society for Industrial and Applied Mathematics</general><scope>3V.</scope><scope>7WY</scope><scope>7WZ</scope><scope>7X2</scope><scope>7XB</scope><scope>87Z</scope><scope>88A</scope><scope>88F</scope><scope>88I</scope><scope>88K</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>KB.</scope><scope>L.-</scope><scope>L6V</scope><scope>LK8</scope><scope>M0C</scope><scope>M0K</scope><scope>M0N</scope><scope>M1Q</scope><scope>M2O</scope><scope>M2P</scope><scope>M2T</scope><scope>M7P</scope><scope>M7S</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>Q9U</scope></search><sort><creationdate>20080101</creationdate><title>On the Convergence of Upper Bound Techniques for the Average Length of Longest Common Subsequences</title><author>Lueker, George S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_9624400683</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Codes</topic><toplevel>online_resources</toplevel><creatorcontrib>Lueker, George S</creatorcontrib><collection>ProQuest Central (Corporate)</collection><collection>ABI商业信息数据库</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>Agricultural Science Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Biology Database (Alumni Edition)</collection><collection>Military Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Telecommunications (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer science database</collection><collection>Materials Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>Biological Sciences</collection><collection>ABI/INFORM Global</collection><collection>Agriculture Science Database</collection><collection>Computing Database</collection><collection>Military Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest research library</collection><collection>ProQuest Science Journals</collection><collection>Telecommunications Database</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Research Library (Corporate)</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>ProQuest Central Basic</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lueker, George S</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>On the Convergence of Upper Bound Techniques for the Average Length of Longest Common Subsequences</atitle><btitle>Proceedings of the ... Workshop on Analytic Algorithmics and Combinatorics</btitle><date>2008-01-01</date><risdate>2008</risdate><spage>169</spage><pages>169-</pages><eissn>2164-0343</eissn><abstract>It has long been known [2] that the average length of the longest common subsequence of two random strings of length n over an alphabet of size k is asymptotic to γ^sub k^n for some constant γ^sub k^ depending on k. The value of these constants remains unknown, and a number of papers have proved upper and lower bounds on them. In particular, in [6] we used a modification of methods of [3, 4] for determining lower and upper bounds on γ^sub k^, combined with large computer computations, to obtain improved bounds on γ^sub 2^. The method of [6] involved a parameter h; empirically, increasing h increased the computation time but gave better upper bounds. Here we show, for arbitrary k, a sufficient condition for a parameterized method to produce a sequence of upper bounds approaching the true value of γ^sub k^, and show that a generalization of the method of [6] meets this condition for all k ≥ 2. While [3, 4] do not explicitly discuss how to parameterize their method, which is based on a concept they call domination, to trade off the tightness of the bound vs. the amount of computation, we discuss a very natural parameterization of their method; for the case of alphabet size k = 2 we conjecture but do not prove that it also meets the sufficient condition and hence also yields a sequence of bounds that converges to the correct value of γ^sub 2^. For k &gt; 2, it does not meet our sufficient condition. Thus we leave open the question of whether some method based on the undominated collations of [3, 4] gives bounds converging to the correct value for any k ≥ 2. [PUBLICATION ABSTRACT]</abstract><cop>Philadelphia</cop><pub>Society for Industrial and Applied Mathematics</pub></addata></record>
fulltext fulltext
identifier EISSN: 2164-0343
ispartof Proceedings of the ... Workshop on Analytic Algorithmics and Combinatorics, 2008, p.169
issn 2164-0343
language eng
recordid cdi_proquest_journals_962440068
source ABI/INFORM Global
subjects Codes
title On the Convergence of Upper Bound Techniques for the Average Length of Longest Common Subsequences
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T17%3A11%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=On%20the%20Convergence%20of%20Upper%20Bound%20Techniques%20for%20the%20Average%20Length%20of%20Longest%20Common%20Subsequences&rft.btitle=Proceedings%20of%20the%20...%20Workshop%20on%20Analytic%20Algorithmics%20and%20Combinatorics&rft.au=Lueker,%20George%20S&rft.date=2008-01-01&rft.spage=169&rft.pages=169-&rft.eissn=2164-0343&rft_id=info:doi/&rft_dat=%3Cproquest%3E2618264621%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_9624400683%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=962440068&rft_id=info:pmid/&rfr_iscdi=true