Loading…
Improvement of quantitative structure–retention relationship models for chromatographic retention prediction of peptides applying individual local partial least squares models
In Reversed-Phase Liquid Chromatography, Quantitative Structure–Retention Relationship (QSRR) models for retention prediction of peptides can be built, starting from large sets of theoretical molecular descriptors. Good predictive QSRR models can be obtained after selecting the most informative desc...
Saved in:
Published in: | Talanta (Oxford) 2020-11, Vol.219, p.121266-121266, Article 121266 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In Reversed-Phase Liquid Chromatography, Quantitative Structure–Retention Relationship (QSRR) models for retention prediction of peptides can be built, starting from large sets of theoretical molecular descriptors. Good predictive QSRR models can be obtained after selecting the most informative descriptors. Reliable retention prediction may be an aid in the correct identification of proteins/peptides in proteomics and in chromatographic method development. Traditionally, global QSRR models are built, using a calibration set containing a representative range of analytes. In this study, a strategy is presented to build individual local Partial Least Squares (PLS) models for peptides, based on selected local calibration samples, most similar to the specific query peptide to be predicted. Similar local calibration peptides are selected from a possible calibration set. The calibration samples with the lowest Euclidian distances to the query peptide are considered as most similar. Two Euclidian distances are investigated as similarity parameter, (i) in the autoscaled descriptor space and, (ii) in the PLS factor space of the global calibration samples, both after variable selection by the Final Complexity Adapted Models (FCAM) method. The predictive abilities of individual local QSRR PLS models for peptides, developed with both Euclidian distances, are found significantly better than those of two global models, i.e. before and after FCAM variable selection. The predictive abilities of the local models, developed with distances calculated in the PLS factor space, were best.
[Display omitted]
•Individual local QSRR PLS models are built for retention prediction of peptides.•Informative molecular descriptors are selected by the FCAM method.•Local calibration peptides are selected from a global set.•Predictivities of the local models are better than those of the global models. |
---|---|
ISSN: | 0039-9140 1873-3573 |
DOI: | 10.1016/j.talanta.2020.121266 |