Loading…
Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa
High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda,...
Saved in:
Published in: | PloS one 2022-03, Vol.17 (3), p.e0264429 |
---|---|
Main Authors: | , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063 |
---|---|
cites | cdi_FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063 |
container_end_page | |
container_issue | 3 |
container_start_page | e0264429 |
container_title | PloS one |
container_volume | 17 |
creator | Orel, Erol Esra, Rachel Estill, Janne Thiabaud, Amaury Marchand-Maillet, Stéphane Merzouki, Aziza Keiser, Olivia |
description | High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics.
We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies.
Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be teste |
doi_str_mv | 10.1371/journal.pone.0264429 |
format | article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2635488032</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A695596877</galeid><doaj_id>oai_doaj_org_article_1fdab1f21ae34a4fa2312ec2237559f1</doaj_id><sourcerecordid>A695596877</sourcerecordid><originalsourceid>FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063</originalsourceid><addsrcrecordid>eNqNk01v1DAQhiMEoqXwDxBEQkJw2MUfiZNckFZVoStVKqLQqzVxxhuvsvFiOxX99_V202qDeuBka_zMOx-eSZK3lMwpL-iXtR1cD918a3ucEyayjFXPkmNacTYTjPDnB_ej5JX3a0JyXgrxMjniOeOVqIrjRP5w2BgVjO1Tq9Pz5XXqA4TBpzV4bNJo9lYZO6uxhRsTY0KXqhYcqIDO-GCUT02fnoEPKfRNemWH0KLr04V2RsHr5IWGzuOb8TxJfn87-3V6Pru4_L48XVzMlGAszCpWNY0qRUmwECRnNINa6TqraLzWhLA6F0JBg0oh0DqSueY5ljlSzAsi-Enyfq-77ayXY2-8ZILnWVkSziKx3BONhbXcOrMBdystGHlvsG4lwcVyOpRUN1BTzSggzyDTwDhlqBjjRZ5Xmkatr2O0od5go7APsS8T0elLb1q5sjeyLCsuyiwKfBoFnP0zoA9yY7zCroMe7XCft6BZFlOP6Id_0KerG6kVxAJMr22Mq3aiciGqmLUoiyJS8yco2DV2Y1QcJG2ifeLweeIQmYB_wwoG7-Xy6uf_s5fXU_bjAdsidKH1tht2c-inYLYHlbPeO9SPTaZE7vbgoRtytwdy3IPo9u7wgx6dHgaf3wGlNQL8</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2635488032</pqid></control><display><type>article</type><title>Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa</title><source>Access via ProQuest (Open Access)</source><source>PubMed Central</source><creator>Orel, Erol ; Esra, Rachel ; Estill, Janne ; Thiabaud, Amaury ; Marchand-Maillet, Stéphane ; Merzouki, Aziza ; Keiser, Olivia</creator><contributor>Boateng, Daniel</contributor><creatorcontrib>Orel, Erol ; Esra, Rachel ; Estill, Janne ; Thiabaud, Amaury ; Marchand-Maillet, Stéphane ; Merzouki, Aziza ; Keiser, Olivia ; Boateng, Daniel</creatorcontrib><description>High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics.
We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies.
Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be tested to find 55.7% of all males and 50.5% of all females living with HIV.
We trained a gradient boosting trees algorithm to find 95% of PLHIV with a precision twice higher than with general population testing by using only a limited number of socio-behavioural characteristics. We also successfully identified people at high risk of infection who may be offered pre-exposure prophylaxis or voluntary medical male circumcision. These findings can inform the implementation of new high-yield HIV tests and help develop very precise strategies based on low-resource settings constraints.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0264429</identifier><identifier>PMID: 35239697</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Africa ; Africa, Southern - epidemiology ; Age ; Algorithms ; Biology and Life Sciences ; Care and treatment ; Circumcision, Male ; Computer and Information Sciences ; Confidence intervals ; Control ; Datasets ; Disease prevention ; Epidemics ; Female ; Females ; Health risks ; HIV ; HIV infection ; HIV Infections - diagnosis ; HIV Infections - drug therapy ; HIV Infections - epidemiology ; HIV patients ; HIV Testing ; Human immunodeficiency virus ; Humans ; Male ; Males ; Medicine and Health Sciences ; Methods ; Performance prediction ; Physical Sciences ; Pre-Exposure Prophylaxis ; Prevention ; Prophylaxis ; Regression models ; Research and Analysis Methods ; Risk factors ; Sex ; Sexual partners ; Sexually transmitted diseases ; Statistical analysis ; STD ; Support vector machines ; Trees ; Variables</subject><ispartof>PloS one, 2022-03, Vol.17 (3), p.e0264429</ispartof><rights>COPYRIGHT 2022 Public Library of Science</rights><rights>2022 Orel et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2022 Orel et al 2022 Orel et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063</citedby><cites>FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063</cites><orcidid>0000-0003-1544-415X ; 0000-0002-4875-6101 ; 0000-0003-4652-4750</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2635488032/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2635488032?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793,75126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35239697$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Boateng, Daniel</contributor><creatorcontrib>Orel, Erol</creatorcontrib><creatorcontrib>Esra, Rachel</creatorcontrib><creatorcontrib>Estill, Janne</creatorcontrib><creatorcontrib>Thiabaud, Amaury</creatorcontrib><creatorcontrib>Marchand-Maillet, Stéphane</creatorcontrib><creatorcontrib>Merzouki, Aziza</creatorcontrib><creatorcontrib>Keiser, Olivia</creatorcontrib><title>Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics.
We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies.
Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be tested to find 55.7% of all males and 50.5% of all females living with HIV.
We trained a gradient boosting trees algorithm to find 95% of PLHIV with a precision twice higher than with general population testing by using only a limited number of socio-behavioural characteristics. We also successfully identified people at high risk of infection who may be offered pre-exposure prophylaxis or voluntary medical male circumcision. These findings can inform the implementation of new high-yield HIV tests and help develop very precise strategies based on low-resource settings constraints.</description><subject>Africa</subject><subject>Africa, Southern - epidemiology</subject><subject>Age</subject><subject>Algorithms</subject><subject>Biology and Life Sciences</subject><subject>Care and treatment</subject><subject>Circumcision, Male</subject><subject>Computer and Information Sciences</subject><subject>Confidence intervals</subject><subject>Control</subject><subject>Datasets</subject><subject>Disease prevention</subject><subject>Epidemics</subject><subject>Female</subject><subject>Females</subject><subject>Health risks</subject><subject>HIV</subject><subject>HIV infection</subject><subject>HIV Infections - diagnosis</subject><subject>HIV Infections - drug therapy</subject><subject>HIV Infections - epidemiology</subject><subject>HIV patients</subject><subject>HIV Testing</subject><subject>Human immunodeficiency virus</subject><subject>Humans</subject><subject>Male</subject><subject>Males</subject><subject>Medicine and Health Sciences</subject><subject>Methods</subject><subject>Performance prediction</subject><subject>Physical Sciences</subject><subject>Pre-Exposure Prophylaxis</subject><subject>Prevention</subject><subject>Prophylaxis</subject><subject>Regression models</subject><subject>Research and Analysis Methods</subject><subject>Risk factors</subject><subject>Sex</subject><subject>Sexual partners</subject><subject>Sexually transmitted diseases</subject><subject>Statistical analysis</subject><subject>STD</subject><subject>Support vector machines</subject><subject>Trees</subject><subject>Variables</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqNk01v1DAQhiMEoqXwDxBEQkJw2MUfiZNckFZVoStVKqLQqzVxxhuvsvFiOxX99_V202qDeuBka_zMOx-eSZK3lMwpL-iXtR1cD918a3ucEyayjFXPkmNacTYTjPDnB_ej5JX3a0JyXgrxMjniOeOVqIrjRP5w2BgVjO1Tq9Pz5XXqA4TBpzV4bNJo9lYZO6uxhRsTY0KXqhYcqIDO-GCUT02fnoEPKfRNemWH0KLr04V2RsHr5IWGzuOb8TxJfn87-3V6Pru4_L48XVzMlGAszCpWNY0qRUmwECRnNINa6TqraLzWhLA6F0JBg0oh0DqSueY5ljlSzAsi-Enyfq-77ayXY2-8ZILnWVkSziKx3BONhbXcOrMBdystGHlvsG4lwcVyOpRUN1BTzSggzyDTwDhlqBjjRZ5Xmkatr2O0od5go7APsS8T0elLb1q5sjeyLCsuyiwKfBoFnP0zoA9yY7zCroMe7XCft6BZFlOP6Id_0KerG6kVxAJMr22Mq3aiciGqmLUoiyJS8yco2DV2Y1QcJG2ifeLweeIQmYB_wwoG7-Xy6uf_s5fXU_bjAdsidKH1tht2c-inYLYHlbPeO9SPTaZE7vbgoRtytwdy3IPo9u7wgx6dHgaf3wGlNQL8</recordid><startdate>20220303</startdate><enddate>20220303</enddate><creator>Orel, Erol</creator><creator>Esra, Rachel</creator><creator>Estill, Janne</creator><creator>Thiabaud, Amaury</creator><creator>Marchand-Maillet, Stéphane</creator><creator>Merzouki, Aziza</creator><creator>Keiser, Olivia</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-1544-415X</orcidid><orcidid>https://orcid.org/0000-0002-4875-6101</orcidid><orcidid>https://orcid.org/0000-0003-4652-4750</orcidid></search><sort><creationdate>20220303</creationdate><title>Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa</title><author>Orel, Erol ; Esra, Rachel ; Estill, Janne ; Thiabaud, Amaury ; Marchand-Maillet, Stéphane ; Merzouki, Aziza ; Keiser, Olivia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Africa</topic><topic>Africa, Southern - epidemiology</topic><topic>Age</topic><topic>Algorithms</topic><topic>Biology and Life Sciences</topic><topic>Care and treatment</topic><topic>Circumcision, Male</topic><topic>Computer and Information Sciences</topic><topic>Confidence intervals</topic><topic>Control</topic><topic>Datasets</topic><topic>Disease prevention</topic><topic>Epidemics</topic><topic>Female</topic><topic>Females</topic><topic>Health risks</topic><topic>HIV</topic><topic>HIV infection</topic><topic>HIV Infections - diagnosis</topic><topic>HIV Infections - drug therapy</topic><topic>HIV Infections - epidemiology</topic><topic>HIV patients</topic><topic>HIV Testing</topic><topic>Human immunodeficiency virus</topic><topic>Humans</topic><topic>Male</topic><topic>Males</topic><topic>Medicine and Health Sciences</topic><topic>Methods</topic><topic>Performance prediction</topic><topic>Physical Sciences</topic><topic>Pre-Exposure Prophylaxis</topic><topic>Prevention</topic><topic>Prophylaxis</topic><topic>Regression models</topic><topic>Research and Analysis Methods</topic><topic>Risk factors</topic><topic>Sex</topic><topic>Sexual partners</topic><topic>Sexually transmitted diseases</topic><topic>Statistical analysis</topic><topic>STD</topic><topic>Support vector machines</topic><topic>Trees</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Orel, Erol</creatorcontrib><creatorcontrib>Esra, Rachel</creatorcontrib><creatorcontrib>Estill, Janne</creatorcontrib><creatorcontrib>Thiabaud, Amaury</creatorcontrib><creatorcontrib>Marchand-Maillet, Stéphane</creatorcontrib><creatorcontrib>Merzouki, Aziza</creatorcontrib><creatorcontrib>Keiser, Olivia</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Opposing Viewpoints</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Nursing & Allied Health Database</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>Agricultural & Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>https://resources.nclive.org/materials</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Nursing & Allied Health Premium</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Orel, Erol</au><au>Esra, Rachel</au><au>Estill, Janne</au><au>Thiabaud, Amaury</au><au>Marchand-Maillet, Stéphane</au><au>Merzouki, Aziza</au><au>Keiser, Olivia</au><au>Boateng, Daniel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2022-03-03</date><risdate>2022</risdate><volume>17</volume><issue>3</issue><spage>e0264429</spage><pages>e0264429-</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics.
We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies.
Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be tested to find 55.7% of all males and 50.5% of all females living with HIV.
We trained a gradient boosting trees algorithm to find 95% of PLHIV with a precision twice higher than with general population testing by using only a limited number of socio-behavioural characteristics. We also successfully identified people at high risk of infection who may be offered pre-exposure prophylaxis or voluntary medical male circumcision. These findings can inform the implementation of new high-yield HIV tests and help develop very precise strategies based on low-resource settings constraints.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>35239697</pmid><doi>10.1371/journal.pone.0264429</doi><tpages>e0264429</tpages><orcidid>https://orcid.org/0000-0003-1544-415X</orcidid><orcidid>https://orcid.org/0000-0002-4875-6101</orcidid><orcidid>https://orcid.org/0000-0003-4652-4750</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1932-6203 |
ispartof | PloS one, 2022-03, Vol.17 (3), p.e0264429 |
issn | 1932-6203 1932-6203 |
language | eng |
recordid | cdi_plos_journals_2635488032 |
source | Access via ProQuest (Open Access); PubMed Central |
subjects | Africa Africa, Southern - epidemiology Age Algorithms Biology and Life Sciences Care and treatment Circumcision, Male Computer and Information Sciences Confidence intervals Control Datasets Disease prevention Epidemics Female Females Health risks HIV HIV infection HIV Infections - diagnosis HIV Infections - drug therapy HIV Infections - epidemiology HIV patients HIV Testing Human immunodeficiency virus Humans Male Males Medicine and Health Sciences Methods Performance prediction Physical Sciences Pre-Exposure Prophylaxis Prevention Prophylaxis Regression models Research and Analysis Methods Risk factors Sex Sexual partners Sexually transmitted diseases Statistical analysis STD Support vector machines Trees Variables |
title | Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T13%3A41%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20HIV%20status%20based%20on%20socio-behavioural%20characteristics%20in%20East%20and%20Southern%20Africa&rft.jtitle=PloS%20one&rft.au=Orel,%20Erol&rft.date=2022-03-03&rft.volume=17&rft.issue=3&rft.spage=e0264429&rft.pages=e0264429-&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0264429&rft_dat=%3Cgale_plos_%3EA695596877%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c622t-929ddc8680e7605214abcfb491214b002b566cadeccea1bc865f35e85e1e57063%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2635488032&rft_id=info:pmid/35239697&rft_galeid=A695596877&rfr_iscdi=true |