Loading…
How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval
Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we lear...
Saved in:
Published in: | arXiv.org 2024-09 |
---|---|
Main Authors: | , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Fradkin, Philip Puria Azadi Karush Suri Wenkel, Frederik Bashashati, Ali Sypetkowski, Maciej Beaini, Dominique |
description | Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we learn a joint latent space between molecular structures and microscopy phenomic experiments, aligning paired samples with contrastive learning. Specifically, we study the problem ofContrastive PhenoMolecular Retrieval, which consists of zero-shot molecular structure identification conditioned on phenomic experiments. We assess challenges in multi-modal learning of phenomics and molecular modalities such as experimental batch effect, inactive molecule perturbations, and encoding perturbation concentration. We demonstrate improved multi-modal learner retrieval through (1) a uni-modal pre-trained phenomics model, (2) a novel inter sample similarity aware loss, and (3) models conditioned on a representation of molecular concentration. Following this recipe, we propose MolPhenix, a molecular phenomics model. MolPhenix leverages a pre-trained phenomics model to demonstrate significant performance gains across perturbation concentrations, molecular scaffolds, and activity thresholds. In particular, we demonstrate an 8.1x improvement in zero shot molecular retrieval of active molecules over the previous state-of-the-art, reaching 77.33% in top-1% accuracy. These results open the door for machine learning to be applied in virtual phenomics screening, which can significantly benefit drug discovery applications. |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3105553199</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3105553199</sourcerecordid><originalsourceid>FETCH-proquest_journals_31055531993</originalsourceid><addsrcrecordid>eNqNjcEKgkAURYcgSMp_eNBaGGeysq0UFgQRtpZBXqVNMzZvtN_PhR_Q6i7OOdwJC4SUcbRdCTFjIVHDORfrjUgSGbBTbr9wthqrTiPB8d2qykOGWtMObkbb6lWbB2TWeKfI1z3C5YnGjolycEXvauyVXrDpXWnCcNw5Wx72RZZHrbOfDsmXje2cGVApY54M93Gayv-sH8pkPOs</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3105553199</pqid></control><display><type>article</type><title>How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Fradkin, Philip ; Puria Azadi ; Karush Suri ; Wenkel, Frederik ; Bashashati, Ali ; Sypetkowski, Maciej ; Beaini, Dominique</creator><creatorcontrib>Fradkin, Philip ; Puria Azadi ; Karush Suri ; Wenkel, Frederik ; Bashashati, Ali ; Sypetkowski, Maciej ; Beaini, Dominique</creatorcontrib><description>Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we learn a joint latent space between molecular structures and microscopy phenomic experiments, aligning paired samples with contrastive learning. Specifically, we study the problem ofContrastive PhenoMolecular Retrieval, which consists of zero-shot molecular structure identification conditioned on phenomic experiments. We assess challenges in multi-modal learning of phenomics and molecular modalities such as experimental batch effect, inactive molecule perturbations, and encoding perturbation concentration. We demonstrate improved multi-modal learner retrieval through (1) a uni-modal pre-trained phenomics model, (2) a novel inter sample similarity aware loss, and (3) models conditioned on a representation of molecular concentration. Following this recipe, we propose MolPhenix, a molecular phenomics model. MolPhenix leverages a pre-trained phenomics model to demonstrate significant performance gains across perturbation concentrations, molecular scaffolds, and activity thresholds. In particular, we demonstrate an 8.1x improvement in zero shot molecular retrieval of active molecules over the previous state-of-the-art, reaching 77.33% in top-1% accuracy. These results open the door for machine learning to be applied in virtual phenomics screening, which can significantly benefit drug discovery applications.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Cellular structure ; Machine learning ; Microscopy ; Molecular structure ; Perturbation ; Retrieval</subject><ispartof>arXiv.org, 2024-09</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/3105553199?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Fradkin, Philip</creatorcontrib><creatorcontrib>Puria Azadi</creatorcontrib><creatorcontrib>Karush Suri</creatorcontrib><creatorcontrib>Wenkel, Frederik</creatorcontrib><creatorcontrib>Bashashati, Ali</creatorcontrib><creatorcontrib>Sypetkowski, Maciej</creatorcontrib><creatorcontrib>Beaini, Dominique</creatorcontrib><title>How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval</title><title>arXiv.org</title><description>Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we learn a joint latent space between molecular structures and microscopy phenomic experiments, aligning paired samples with contrastive learning. Specifically, we study the problem ofContrastive PhenoMolecular Retrieval, which consists of zero-shot molecular structure identification conditioned on phenomic experiments. We assess challenges in multi-modal learning of phenomics and molecular modalities such as experimental batch effect, inactive molecule perturbations, and encoding perturbation concentration. We demonstrate improved multi-modal learner retrieval through (1) a uni-modal pre-trained phenomics model, (2) a novel inter sample similarity aware loss, and (3) models conditioned on a representation of molecular concentration. Following this recipe, we propose MolPhenix, a molecular phenomics model. MolPhenix leverages a pre-trained phenomics model to demonstrate significant performance gains across perturbation concentrations, molecular scaffolds, and activity thresholds. In particular, we demonstrate an 8.1x improvement in zero shot molecular retrieval of active molecules over the previous state-of-the-art, reaching 77.33% in top-1% accuracy. These results open the door for machine learning to be applied in virtual phenomics screening, which can significantly benefit drug discovery applications.</description><subject>Cellular structure</subject><subject>Machine learning</subject><subject>Microscopy</subject><subject>Molecular structure</subject><subject>Perturbation</subject><subject>Retrieval</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNjcEKgkAURYcgSMp_eNBaGGeysq0UFgQRtpZBXqVNMzZvtN_PhR_Q6i7OOdwJC4SUcbRdCTFjIVHDORfrjUgSGbBTbr9wthqrTiPB8d2qykOGWtMObkbb6lWbB2TWeKfI1z3C5YnGjolycEXvauyVXrDpXWnCcNw5Wx72RZZHrbOfDsmXje2cGVApY54M93Gayv-sH8pkPOs</recordid><startdate>20240910</startdate><enddate>20240910</enddate><creator>Fradkin, Philip</creator><creator>Puria Azadi</creator><creator>Karush Suri</creator><creator>Wenkel, Frederik</creator><creator>Bashashati, Ali</creator><creator>Sypetkowski, Maciej</creator><creator>Beaini, Dominique</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240910</creationdate><title>How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval</title><author>Fradkin, Philip ; Puria Azadi ; Karush Suri ; Wenkel, Frederik ; Bashashati, Ali ; Sypetkowski, Maciej ; Beaini, Dominique</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_31055531993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Cellular structure</topic><topic>Machine learning</topic><topic>Microscopy</topic><topic>Molecular structure</topic><topic>Perturbation</topic><topic>Retrieval</topic><toplevel>online_resources</toplevel><creatorcontrib>Fradkin, Philip</creatorcontrib><creatorcontrib>Puria Azadi</creatorcontrib><creatorcontrib>Karush Suri</creatorcontrib><creatorcontrib>Wenkel, Frederik</creatorcontrib><creatorcontrib>Bashashati, Ali</creatorcontrib><creatorcontrib>Sypetkowski, Maciej</creatorcontrib><creatorcontrib>Beaini, Dominique</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fradkin, Philip</au><au>Puria Azadi</au><au>Karush Suri</au><au>Wenkel, Frederik</au><au>Bashashati, Ali</au><au>Sypetkowski, Maciej</au><au>Beaini, Dominique</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval</atitle><jtitle>arXiv.org</jtitle><date>2024-09-10</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we learn a joint latent space between molecular structures and microscopy phenomic experiments, aligning paired samples with contrastive learning. Specifically, we study the problem ofContrastive PhenoMolecular Retrieval, which consists of zero-shot molecular structure identification conditioned on phenomic experiments. We assess challenges in multi-modal learning of phenomics and molecular modalities such as experimental batch effect, inactive molecule perturbations, and encoding perturbation concentration. We demonstrate improved multi-modal learner retrieval through (1) a uni-modal pre-trained phenomics model, (2) a novel inter sample similarity aware loss, and (3) models conditioned on a representation of molecular concentration. Following this recipe, we propose MolPhenix, a molecular phenomics model. MolPhenix leverages a pre-trained phenomics model to demonstrate significant performance gains across perturbation concentrations, molecular scaffolds, and activity thresholds. In particular, we demonstrate an 8.1x improvement in zero shot molecular retrieval of active molecules over the previous state-of-the-art, reaching 77.33% in top-1% accuracy. These results open the door for machine learning to be applied in virtual phenomics screening, which can significantly benefit drug discovery applications.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2024-09 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_3105553199 |
source | Publicly Available Content Database (Proquest) (PQ_SDU_P3) |
subjects | Cellular structure Machine learning Microscopy Molecular structure Perturbation Retrieval |
title | How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T16%3A18%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=How%20Molecules%20Impact%20Cells:%20Unlocking%20Contrastive%20PhenoMolecular%20Retrieval&rft.jtitle=arXiv.org&rft.au=Fradkin,%20Philip&rft.date=2024-09-10&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3105553199%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_31055531993%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3105553199&rft_id=info:pmid/&rfr_iscdi=true |