Loading…

An algorithm for random match probability calculation from peptide sequences

•Random match probability from peptide sequences.•Accommodates LD.•Robust to drop-in events.•Open source. For the past three decades, forensic genetic investigations have focused on elucidating DNA signatures. While DNA has a number of desirable properties (e.g., presence in most biological material...

Full description

Saved in:
Bibliographic Details
Published in:Forensic science international : genetics 2020-07, Vol.47, p.102295-102295, Article 102295
Main Authors: Woerner, August E., Hewitt, F. Curtis, Gardner, Myles W., Freitas, Michael A., Schulte, Kathleen Q., LeSassier, Danielle S., Baniasad, Maryam, Reed, Andrew J., Powals, Megan E., Smith, Alan R., Albright, Nicolette C., Ludolph, Benjamin C., Zhang, Liwen, Allen, Leah W., Weber, Katharina, Budowle, Bruce
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Random match probability from peptide sequences.•Accommodates LD.•Robust to drop-in events.•Open source. For the past three decades, forensic genetic investigations have focused on elucidating DNA signatures. While DNA has a number of desirable properties (e.g., presence in most biological materials, an amenable chemistry for analysis and well-developed statistics), DNA also has limitations. DNA may be in low quantity in some tissues, such as hair, and in some tissues it may degrade more readily than its protein counterparts. Recent research efforts have shown the feasibility of performing protein-based human identification in cases in which recovery of DNA is challenged; however, the methods involved in assessing the rarity of a given protein profile have not been addressed adequately. In this paper an algorithm is proposed that describes the computation of a random match probability (RMP) resulting from a genetically variable peptide signature. The approach described herein explicitly models proteomic error and genetic linkage, makes no assumptions as to allelic drop-out, and maps the observed proteomic alleles to their expected protein products from DNA which, in turn, permits standard corrections for population structure and finite database sizes. To assess the feasibility of this approach, RMPs were estimated from peptide profiles of skin samples from 25 individuals of European ancestry. 126 common peptide alleles were used in this approach, yielding a mean RMP of approximately 10−2.
ISSN:1872-4973
1878-0326
DOI:10.1016/j.fsigen.2020.102295