Loading…

How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval

Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we lear...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2024-09
Main Authors: Fradkin, Philip, Puria Azadi, Karush Suri, Wenkel, Frederik, Bashashati, Ali, Sypetkowski, Maciej, Beaini, Dominique
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Fradkin, Philip
Puria Azadi
Karush Suri
Wenkel, Frederik
Bashashati, Ali
Sypetkowski, Maciej
Beaini, Dominique
description Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we learn a joint latent space between molecular structures and microscopy phenomic experiments, aligning paired samples with contrastive learning. Specifically, we study the problem ofContrastive PhenoMolecular Retrieval, which consists of zero-shot molecular structure identification conditioned on phenomic experiments. We assess challenges in multi-modal learning of phenomics and molecular modalities such as experimental batch effect, inactive molecule perturbations, and encoding perturbation concentration. We demonstrate improved multi-modal learner retrieval through (1) a uni-modal pre-trained phenomics model, (2) a novel inter sample similarity aware loss, and (3) models conditioned on a representation of molecular concentration. Following this recipe, we propose MolPhenix, a molecular phenomics model. MolPhenix leverages a pre-trained phenomics model to demonstrate significant performance gains across perturbation concentrations, molecular scaffolds, and activity thresholds. In particular, we demonstrate an 8.1x improvement in zero shot molecular retrieval of active molecules over the previous state-of-the-art, reaching 77.33% in top-1% accuracy. These results open the door for machine learning to be applied in virtual phenomics screening, which can significantly benefit drug discovery applications.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3105553199</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3105553199</sourcerecordid><originalsourceid>FETCH-proquest_journals_31055531993</originalsourceid><addsrcrecordid>eNqNjcEKgkAURYcgSMp_eNBaGGeysq0UFgQRtpZBXqVNMzZvtN_PhR_Q6i7OOdwJC4SUcbRdCTFjIVHDORfrjUgSGbBTbr9wthqrTiPB8d2qykOGWtMObkbb6lWbB2TWeKfI1z3C5YnGjolycEXvauyVXrDpXWnCcNw5Wx72RZZHrbOfDsmXje2cGVApY54M93Gayv-sH8pkPOs</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3105553199</pqid></control><display><type>article</type><title>How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Fradkin, Philip ; Puria Azadi ; Karush Suri ; Wenkel, Frederik ; Bashashati, Ali ; Sypetkowski, Maciej ; Beaini, Dominique</creator><creatorcontrib>Fradkin, Philip ; Puria Azadi ; Karush Suri ; Wenkel, Frederik ; Bashashati, Ali ; Sypetkowski, Maciej ; Beaini, Dominique</creatorcontrib><description>Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we learn a joint latent space between molecular structures and microscopy phenomic experiments, aligning paired samples with contrastive learning. Specifically, we study the problem ofContrastive PhenoMolecular Retrieval, which consists of zero-shot molecular structure identification conditioned on phenomic experiments. We assess challenges in multi-modal learning of phenomics and molecular modalities such as experimental batch effect, inactive molecule perturbations, and encoding perturbation concentration. We demonstrate improved multi-modal learner retrieval through (1) a uni-modal pre-trained phenomics model, (2) a novel inter sample similarity aware loss, and (3) models conditioned on a representation of molecular concentration. Following this recipe, we propose MolPhenix, a molecular phenomics model. MolPhenix leverages a pre-trained phenomics model to demonstrate significant performance gains across perturbation concentrations, molecular scaffolds, and activity thresholds. In particular, we demonstrate an 8.1x improvement in zero shot molecular retrieval of active molecules over the previous state-of-the-art, reaching 77.33% in top-1% accuracy. These results open the door for machine learning to be applied in virtual phenomics screening, which can significantly benefit drug discovery applications.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Cellular structure ; Machine learning ; Microscopy ; Molecular structure ; Perturbation ; Retrieval</subject><ispartof>arXiv.org, 2024-09</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/3105553199?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Fradkin, Philip</creatorcontrib><creatorcontrib>Puria Azadi</creatorcontrib><creatorcontrib>Karush Suri</creatorcontrib><creatorcontrib>Wenkel, Frederik</creatorcontrib><creatorcontrib>Bashashati, Ali</creatorcontrib><creatorcontrib>Sypetkowski, Maciej</creatorcontrib><creatorcontrib>Beaini, Dominique</creatorcontrib><title>How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval</title><title>arXiv.org</title><description>Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we learn a joint latent space between molecular structures and microscopy phenomic experiments, aligning paired samples with contrastive learning. Specifically, we study the problem ofContrastive PhenoMolecular Retrieval, which consists of zero-shot molecular structure identification conditioned on phenomic experiments. We assess challenges in multi-modal learning of phenomics and molecular modalities such as experimental batch effect, inactive molecule perturbations, and encoding perturbation concentration. We demonstrate improved multi-modal learner retrieval through (1) a uni-modal pre-trained phenomics model, (2) a novel inter sample similarity aware loss, and (3) models conditioned on a representation of molecular concentration. Following this recipe, we propose MolPhenix, a molecular phenomics model. MolPhenix leverages a pre-trained phenomics model to demonstrate significant performance gains across perturbation concentrations, molecular scaffolds, and activity thresholds. In particular, we demonstrate an 8.1x improvement in zero shot molecular retrieval of active molecules over the previous state-of-the-art, reaching 77.33% in top-1% accuracy. These results open the door for machine learning to be applied in virtual phenomics screening, which can significantly benefit drug discovery applications.</description><subject>Cellular structure</subject><subject>Machine learning</subject><subject>Microscopy</subject><subject>Molecular structure</subject><subject>Perturbation</subject><subject>Retrieval</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNjcEKgkAURYcgSMp_eNBaGGeysq0UFgQRtpZBXqVNMzZvtN_PhR_Q6i7OOdwJC4SUcbRdCTFjIVHDORfrjUgSGbBTbr9wthqrTiPB8d2qykOGWtMObkbb6lWbB2TWeKfI1z3C5YnGjolycEXvauyVXrDpXWnCcNw5Wx72RZZHrbOfDsmXje2cGVApY54M93Gayv-sH8pkPOs</recordid><startdate>20240910</startdate><enddate>20240910</enddate><creator>Fradkin, Philip</creator><creator>Puria Azadi</creator><creator>Karush Suri</creator><creator>Wenkel, Frederik</creator><creator>Bashashati, Ali</creator><creator>Sypetkowski, Maciej</creator><creator>Beaini, Dominique</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240910</creationdate><title>How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval</title><author>Fradkin, Philip ; Puria Azadi ; Karush Suri ; Wenkel, Frederik ; Bashashati, Ali ; Sypetkowski, Maciej ; Beaini, Dominique</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_31055531993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Cellular structure</topic><topic>Machine learning</topic><topic>Microscopy</topic><topic>Molecular structure</topic><topic>Perturbation</topic><topic>Retrieval</topic><toplevel>online_resources</toplevel><creatorcontrib>Fradkin, Philip</creatorcontrib><creatorcontrib>Puria Azadi</creatorcontrib><creatorcontrib>Karush Suri</creatorcontrib><creatorcontrib>Wenkel, Frederik</creatorcontrib><creatorcontrib>Bashashati, Ali</creatorcontrib><creatorcontrib>Sypetkowski, Maciej</creatorcontrib><creatorcontrib>Beaini, Dominique</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fradkin, Philip</au><au>Puria Azadi</au><au>Karush Suri</au><au>Wenkel, Frederik</au><au>Bashashati, Ali</au><au>Sypetkowski, Maciej</au><au>Beaini, Dominique</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval</atitle><jtitle>arXiv.org</jtitle><date>2024-09-10</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Predicting molecular impact on cellular function is a core challenge in therapeutic design. Phenomic experiments, designed to capture cellular morphology, utilize microscopy based techniques and demonstrate a high throughput solution for uncovering molecular impact on the cell. In this work, we learn a joint latent space between molecular structures and microscopy phenomic experiments, aligning paired samples with contrastive learning. Specifically, we study the problem ofContrastive PhenoMolecular Retrieval, which consists of zero-shot molecular structure identification conditioned on phenomic experiments. We assess challenges in multi-modal learning of phenomics and molecular modalities such as experimental batch effect, inactive molecule perturbations, and encoding perturbation concentration. We demonstrate improved multi-modal learner retrieval through (1) a uni-modal pre-trained phenomics model, (2) a novel inter sample similarity aware loss, and (3) models conditioned on a representation of molecular concentration. Following this recipe, we propose MolPhenix, a molecular phenomics model. MolPhenix leverages a pre-trained phenomics model to demonstrate significant performance gains across perturbation concentrations, molecular scaffolds, and activity thresholds. In particular, we demonstrate an 8.1x improvement in zero shot molecular retrieval of active molecules over the previous state-of-the-art, reaching 77.33% in top-1% accuracy. These results open the door for machine learning to be applied in virtual phenomics screening, which can significantly benefit drug discovery applications.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-09
issn 2331-8422
language eng
recordid cdi_proquest_journals_3105553199
source Publicly Available Content Database (Proquest) (PQ_SDU_P3)
subjects Cellular structure
Machine learning
Microscopy
Molecular structure
Perturbation
Retrieval
title How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T16%3A18%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=How%20Molecules%20Impact%20Cells:%20Unlocking%20Contrastive%20PhenoMolecular%20Retrieval&rft.jtitle=arXiv.org&rft.au=Fradkin,%20Philip&rft.date=2024-09-10&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3105553199%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_31055531993%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3105553199&rft_id=info:pmid/&rfr_iscdi=true