Loading…

Prediction in high‐dimensional linear models and application to genomic selection under imperfect linkage disequilibrium

Genomic selection (GS) consists in predicting breeding values of selection candidates, using a large number of genetic markers. An important question in GS is to determine the number of markers required for a good prediction. For this purpose, we introduce new proxies for the accuracy of the predict...

Full description

Saved in:
Bibliographic Details
Published in:Journal of the Royal Statistical Society Series C: Applied Statistics 2021-08, Vol.70 (4), p.1001-1026
Main Authors: Rabier, Charles‐Elie, Grusea, Simona
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Genomic selection (GS) consists in predicting breeding values of selection candidates, using a large number of genetic markers. An important question in GS is to determine the number of markers required for a good prediction. For this purpose, we introduce new proxies for the accuracy of the prediction. These proxies are suitable under sparse genetic map where it is likely to observe some imperfect linkage disequilibrium, that is, the situation where the alleles at a gene location and at a marker located nearby vary. Moreover, our suggested proxies are helpful for designing cost‐effective SNP chips based on a moderate density of markers. We analyse rice data from Los Banos, Philippines and focus on the flowering time collected during the dry season 2012. Using different densities of markers, we show that at least 1553 markers are required to implement GS. Finding the optimal number of markers is crucial in order to optimize the breeding program.
ISSN:0035-9254
1467-9876
DOI:10.1111/rssc.12496