Loading…

Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa

High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda,...

Full description

Saved in:
Bibliographic Details
Published in:PloS one 2022-03, Vol.17 (3), p.e0264429
Main Authors: Orel, Erol, Esra, Rachel, Estill, Janne, Thiabaud, Amaury, Marchand-Maillet, Stéphane, Merzouki, Aziza, Keiser, Olivia
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063
cites cdi_FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063
container_end_page
container_issue 3
container_start_page e0264429
container_title PloS one
container_volume 17
creator Orel, Erol
Esra, Rachel
Estill, Janne
Thiabaud, Amaury
Marchand-Maillet, Stéphane
Merzouki, Aziza
Keiser, Olivia
description High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics. We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies. Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be teste
doi_str_mv 10.1371/journal.pone.0264429
format article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2635488032</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A695596877</galeid><doaj_id>oai_doaj_org_article_1fdab1f21ae34a4fa2312ec2237559f1</doaj_id><sourcerecordid>A695596877</sourcerecordid><originalsourceid>FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063</originalsourceid><addsrcrecordid>eNqNk01v1DAQhiMEoqXwDxBEQkJw2MUfiZNckFZVoStVKqLQqzVxxhuvsvFiOxX99_V202qDeuBka_zMOx-eSZK3lMwpL-iXtR1cD918a3ucEyayjFXPkmNacTYTjPDnB_ej5JX3a0JyXgrxMjniOeOVqIrjRP5w2BgVjO1Tq9Pz5XXqA4TBpzV4bNJo9lYZO6uxhRsTY0KXqhYcqIDO-GCUT02fnoEPKfRNemWH0KLr04V2RsHr5IWGzuOb8TxJfn87-3V6Pru4_L48XVzMlGAszCpWNY0qRUmwECRnNINa6TqraLzWhLA6F0JBg0oh0DqSueY5ljlSzAsi-Enyfq-77ayXY2-8ZILnWVkSziKx3BONhbXcOrMBdystGHlvsG4lwcVyOpRUN1BTzSggzyDTwDhlqBjjRZ5Xmkatr2O0od5go7APsS8T0elLb1q5sjeyLCsuyiwKfBoFnP0zoA9yY7zCroMe7XCft6BZFlOP6Id_0KerG6kVxAJMr22Mq3aiciGqmLUoiyJS8yco2DV2Y1QcJG2ifeLweeIQmYB_wwoG7-Xy6uf_s5fXU_bjAdsidKH1tht2c-inYLYHlbPeO9SPTaZE7vbgoRtytwdy3IPo9u7wgx6dHgaf3wGlNQL8</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2635488032</pqid></control><display><type>article</type><title>Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa</title><source>Access via ProQuest (Open Access)</source><source>PubMed Central</source><creator>Orel, Erol ; Esra, Rachel ; Estill, Janne ; Thiabaud, Amaury ; Marchand-Maillet, Stéphane ; Merzouki, Aziza ; Keiser, Olivia</creator><contributor>Boateng, Daniel</contributor><creatorcontrib>Orel, Erol ; Esra, Rachel ; Estill, Janne ; Thiabaud, Amaury ; Marchand-Maillet, Stéphane ; Merzouki, Aziza ; Keiser, Olivia ; Boateng, Daniel</creatorcontrib><description>High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics. We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies. Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be tested to find 55.7% of all males and 50.5% of all females living with HIV. We trained a gradient boosting trees algorithm to find 95% of PLHIV with a precision twice higher than with general population testing by using only a limited number of socio-behavioural characteristics. We also successfully identified people at high risk of infection who may be offered pre-exposure prophylaxis or voluntary medical male circumcision. These findings can inform the implementation of new high-yield HIV tests and help develop very precise strategies based on low-resource settings constraints.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0264429</identifier><identifier>PMID: 35239697</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Africa ; Africa, Southern - epidemiology ; Age ; Algorithms ; Biology and Life Sciences ; Care and treatment ; Circumcision, Male ; Computer and Information Sciences ; Confidence intervals ; Control ; Datasets ; Disease prevention ; Epidemics ; Female ; Females ; Health risks ; HIV ; HIV infection ; HIV Infections - diagnosis ; HIV Infections - drug therapy ; HIV Infections - epidemiology ; HIV patients ; HIV Testing ; Human immunodeficiency virus ; Humans ; Male ; Males ; Medicine and Health Sciences ; Methods ; Performance prediction ; Physical Sciences ; Pre-Exposure Prophylaxis ; Prevention ; Prophylaxis ; Regression models ; Research and Analysis Methods ; Risk factors ; Sex ; Sexual partners ; Sexually transmitted diseases ; Statistical analysis ; STD ; Support vector machines ; Trees ; Variables</subject><ispartof>PloS one, 2022-03, Vol.17 (3), p.e0264429</ispartof><rights>COPYRIGHT 2022 Public Library of Science</rights><rights>2022 Orel et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2022 Orel et al 2022 Orel et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063</citedby><cites>FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063</cites><orcidid>0000-0003-1544-415X ; 0000-0002-4875-6101 ; 0000-0003-4652-4750</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2635488032/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2635488032?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793,75126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35239697$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Boateng, Daniel</contributor><creatorcontrib>Orel, Erol</creatorcontrib><creatorcontrib>Esra, Rachel</creatorcontrib><creatorcontrib>Estill, Janne</creatorcontrib><creatorcontrib>Thiabaud, Amaury</creatorcontrib><creatorcontrib>Marchand-Maillet, Stéphane</creatorcontrib><creatorcontrib>Merzouki, Aziza</creatorcontrib><creatorcontrib>Keiser, Olivia</creatorcontrib><title>Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics. We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies. Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be tested to find 55.7% of all males and 50.5% of all females living with HIV. We trained a gradient boosting trees algorithm to find 95% of PLHIV with a precision twice higher than with general population testing by using only a limited number of socio-behavioural characteristics. We also successfully identified people at high risk of infection who may be offered pre-exposure prophylaxis or voluntary medical male circumcision. These findings can inform the implementation of new high-yield HIV tests and help develop very precise strategies based on low-resource settings constraints.</description><subject>Africa</subject><subject>Africa, Southern - epidemiology</subject><subject>Age</subject><subject>Algorithms</subject><subject>Biology and Life Sciences</subject><subject>Care and treatment</subject><subject>Circumcision, Male</subject><subject>Computer and Information Sciences</subject><subject>Confidence intervals</subject><subject>Control</subject><subject>Datasets</subject><subject>Disease prevention</subject><subject>Epidemics</subject><subject>Female</subject><subject>Females</subject><subject>Health risks</subject><subject>HIV</subject><subject>HIV infection</subject><subject>HIV Infections - diagnosis</subject><subject>HIV Infections - drug therapy</subject><subject>HIV Infections - epidemiology</subject><subject>HIV patients</subject><subject>HIV Testing</subject><subject>Human immunodeficiency virus</subject><subject>Humans</subject><subject>Male</subject><subject>Males</subject><subject>Medicine and Health Sciences</subject><subject>Methods</subject><subject>Performance prediction</subject><subject>Physical Sciences</subject><subject>Pre-Exposure Prophylaxis</subject><subject>Prevention</subject><subject>Prophylaxis</subject><subject>Regression models</subject><subject>Research and Analysis Methods</subject><subject>Risk factors</subject><subject>Sex</subject><subject>Sexual partners</subject><subject>Sexually transmitted diseases</subject><subject>Statistical analysis</subject><subject>STD</subject><subject>Support vector machines</subject><subject>Trees</subject><subject>Variables</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqNk01v1DAQhiMEoqXwDxBEQkJw2MUfiZNckFZVoStVKqLQqzVxxhuvsvFiOxX99_V202qDeuBka_zMOx-eSZK3lMwpL-iXtR1cD918a3ucEyayjFXPkmNacTYTjPDnB_ej5JX3a0JyXgrxMjniOeOVqIrjRP5w2BgVjO1Tq9Pz5XXqA4TBpzV4bNJo9lYZO6uxhRsTY0KXqhYcqIDO-GCUT02fnoEPKfRNemWH0KLr04V2RsHr5IWGzuOb8TxJfn87-3V6Pru4_L48XVzMlGAszCpWNY0qRUmwECRnNINa6TqraLzWhLA6F0JBg0oh0DqSueY5ljlSzAsi-Enyfq-77ayXY2-8ZILnWVkSziKx3BONhbXcOrMBdystGHlvsG4lwcVyOpRUN1BTzSggzyDTwDhlqBjjRZ5Xmkatr2O0od5go7APsS8T0elLb1q5sjeyLCsuyiwKfBoFnP0zoA9yY7zCroMe7XCft6BZFlOP6Id_0KerG6kVxAJMr22Mq3aiciGqmLUoiyJS8yco2DV2Y1QcJG2ifeLweeIQmYB_wwoG7-Xy6uf_s5fXU_bjAdsidKH1tht2c-inYLYHlbPeO9SPTaZE7vbgoRtytwdy3IPo9u7wgx6dHgaf3wGlNQL8</recordid><startdate>20220303</startdate><enddate>20220303</enddate><creator>Orel, Erol</creator><creator>Esra, Rachel</creator><creator>Estill, Janne</creator><creator>Thiabaud, Amaury</creator><creator>Marchand-Maillet, Stéphane</creator><creator>Merzouki, Aziza</creator><creator>Keiser, Olivia</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-1544-415X</orcidid><orcidid>https://orcid.org/0000-0002-4875-6101</orcidid><orcidid>https://orcid.org/0000-0003-4652-4750</orcidid></search><sort><creationdate>20220303</creationdate><title>Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa</title><author>Orel, Erol ; Esra, Rachel ; Estill, Janne ; Thiabaud, Amaury ; Marchand-Maillet, Stéphane ; Merzouki, Aziza ; Keiser, Olivia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Africa</topic><topic>Africa, Southern - epidemiology</topic><topic>Age</topic><topic>Algorithms</topic><topic>Biology and Life Sciences</topic><topic>Care and treatment</topic><topic>Circumcision, Male</topic><topic>Computer and Information Sciences</topic><topic>Confidence intervals</topic><topic>Control</topic><topic>Datasets</topic><topic>Disease prevention</topic><topic>Epidemics</topic><topic>Female</topic><topic>Females</topic><topic>Health risks</topic><topic>HIV</topic><topic>HIV infection</topic><topic>HIV Infections - diagnosis</topic><topic>HIV Infections - drug therapy</topic><topic>HIV Infections - epidemiology</topic><topic>HIV patients</topic><topic>HIV Testing</topic><topic>Human immunodeficiency virus</topic><topic>Humans</topic><topic>Male</topic><topic>Males</topic><topic>Medicine and Health Sciences</topic><topic>Methods</topic><topic>Performance prediction</topic><topic>Physical Sciences</topic><topic>Pre-Exposure Prophylaxis</topic><topic>Prevention</topic><topic>Prophylaxis</topic><topic>Regression models</topic><topic>Research and Analysis Methods</topic><topic>Risk factors</topic><topic>Sex</topic><topic>Sexual partners</topic><topic>Sexually transmitted diseases</topic><topic>Statistical analysis</topic><topic>STD</topic><topic>Support vector machines</topic><topic>Trees</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Orel, Erol</creatorcontrib><creatorcontrib>Esra, Rachel</creatorcontrib><creatorcontrib>Estill, Janne</creatorcontrib><creatorcontrib>Thiabaud, Amaury</creatorcontrib><creatorcontrib>Marchand-Maillet, Stéphane</creatorcontrib><creatorcontrib>Merzouki, Aziza</creatorcontrib><creatorcontrib>Keiser, Olivia</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Opposing Viewpoints</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Nursing &amp; Allied Health Database</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>https://resources.nclive.org/materials</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Orel, Erol</au><au>Esra, Rachel</au><au>Estill, Janne</au><au>Thiabaud, Amaury</au><au>Marchand-Maillet, Stéphane</au><au>Merzouki, Aziza</au><au>Keiser, Olivia</au><au>Boateng, Daniel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2022-03-03</date><risdate>2022</risdate><volume>17</volume><issue>3</issue><spage>e0264429</spage><pages>e0264429-</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics. We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies. Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be tested to find 55.7% of all males and 50.5% of all females living with HIV. We trained a gradient boosting trees algorithm to find 95% of PLHIV with a precision twice higher than with general population testing by using only a limited number of socio-behavioural characteristics. We also successfully identified people at high risk of infection who may be offered pre-exposure prophylaxis or voluntary medical male circumcision. These findings can inform the implementation of new high-yield HIV tests and help develop very precise strategies based on low-resource settings constraints.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>35239697</pmid><doi>10.1371/journal.pone.0264429</doi><tpages>e0264429</tpages><orcidid>https://orcid.org/0000-0003-1544-415X</orcidid><orcidid>https://orcid.org/0000-0002-4875-6101</orcidid><orcidid>https://orcid.org/0000-0003-4652-4750</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6203
ispartof PloS one, 2022-03, Vol.17 (3), p.e0264429
issn 1932-6203
1932-6203
language eng
recordid cdi_plos_journals_2635488032
source Access via ProQuest (Open Access); PubMed Central
subjects Africa
Africa, Southern - epidemiology
Age
Algorithms
Biology and Life Sciences
Care and treatment
Circumcision, Male
Computer and Information Sciences
Confidence intervals
Control
Datasets
Disease prevention
Epidemics
Female
Females
Health risks
HIV
HIV infection
HIV Infections - diagnosis
HIV Infections - drug therapy
HIV Infections - epidemiology
HIV patients
HIV Testing
Human immunodeficiency virus
Humans
Male
Males
Medicine and Health Sciences
Methods
Performance prediction
Physical Sciences
Pre-Exposure Prophylaxis
Prevention
Prophylaxis
Regression models
Research and Analysis Methods
Risk factors
Sex
Sexual partners
Sexually transmitted diseases
Statistical analysis
STD
Support vector machines
Trees
Variables
title Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T13%3A41%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20HIV%20status%20based%20on%20socio-behavioural%20characteristics%20in%20East%20and%20Southern%20Africa&rft.jtitle=PloS%20one&rft.au=Orel,%20Erol&rft.date=2022-03-03&rft.volume=17&rft.issue=3&rft.spage=e0264429&rft.pages=e0264429-&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0264429&rft_dat=%3Cgale_plos_%3EA695596877%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2635488032&rft_id=info:pmid/35239697&rft_galeid=A695596877&rfr_iscdi=true