Loading…

PaveSAM Segment Anything for Pavement Distress

Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2024-09
Main Authors: Owor, Neema Jakisa, Adu-Gyamfi, Yaw, Armstrong Aboah, Amo-Boateng, Mark
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Owor, Neema Jakisa
Adu-Gyamfi, Yaw
Armstrong Aboah
Amo-Boateng, Mark
description Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.
doi_str_mv 10.48550/arxiv.2409.07295
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3103645386</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3103645386</sourcerecordid><originalsourceid>FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3</originalsourceid><addsrcrecordid>eNotjUtLw0AURgdBsNT-AHcB14kz986dxzLUV6HSQrsvk-SmpmiiM2nRf-9z9cE5cD4hrpQstCOSNyF-dKcCtPSFtODpTEwAUeVOA1yIWUoHKSUYC0Q4EcU6nHhTPmUb3r9yP2Zl_zk-d_0-a4eY_chfetulMXJKl-K8DS-JZ_87Fdv7u-38MV-uHhbzcpkHApOjAq-0NYxM7DVWoWqtrFpyBjxjYyUCNHVFzjJTU3uyxmpX6xaDUlzhVFz_Zd_i8H7kNO4OwzH23487VBKNJnQGvwALX0Nn</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3103645386</pqid></control><display><type>article</type><title>PaveSAM Segment Anything for Pavement Distress</title><source>Publicly Available Content Database</source><creator>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</creator><creatorcontrib>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</creatorcontrib><description>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2409.07295</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Computer vision ; Condition monitoring ; Cost analysis ; Image segmentation ; Labels ; Pavements ; Pixels</subject><ispartof>arXiv.org, 2024-09</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/3103645386?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,27925,37012,44590</link.rule.ids></links><search><creatorcontrib>Owor, Neema Jakisa</creatorcontrib><creatorcontrib>Adu-Gyamfi, Yaw</creatorcontrib><creatorcontrib>Armstrong Aboah</creatorcontrib><creatorcontrib>Amo-Boateng, Mark</creatorcontrib><title>PaveSAM Segment Anything for Pavement Distress</title><title>arXiv.org</title><description>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</description><subject>Annotations</subject><subject>Computer vision</subject><subject>Condition monitoring</subject><subject>Cost analysis</subject><subject>Image segmentation</subject><subject>Labels</subject><subject>Pavements</subject><subject>Pixels</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNotjUtLw0AURgdBsNT-AHcB14kz986dxzLUV6HSQrsvk-SmpmiiM2nRf-9z9cE5cD4hrpQstCOSNyF-dKcCtPSFtODpTEwAUeVOA1yIWUoHKSUYC0Q4EcU6nHhTPmUb3r9yP2Zl_zk-d_0-a4eY_chfetulMXJKl-K8DS-JZ_87Fdv7u-38MV-uHhbzcpkHApOjAq-0NYxM7DVWoWqtrFpyBjxjYyUCNHVFzjJTU3uyxmpX6xaDUlzhVFz_Zd_i8H7kNO4OwzH23487VBKNJnQGvwALX0Nn</recordid><startdate>20240911</startdate><enddate>20240911</enddate><creator>Owor, Neema Jakisa</creator><creator>Adu-Gyamfi, Yaw</creator><creator>Armstrong Aboah</creator><creator>Amo-Boateng, Mark</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240911</creationdate><title>PaveSAM Segment Anything for Pavement Distress</title><author>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Annotations</topic><topic>Computer vision</topic><topic>Condition monitoring</topic><topic>Cost analysis</topic><topic>Image segmentation</topic><topic>Labels</topic><topic>Pavements</topic><topic>Pixels</topic><toplevel>online_resources</toplevel><creatorcontrib>Owor, Neema Jakisa</creatorcontrib><creatorcontrib>Adu-Gyamfi, Yaw</creatorcontrib><creatorcontrib>Armstrong Aboah</creatorcontrib><creatorcontrib>Amo-Boateng, Mark</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Owor, Neema Jakisa</au><au>Adu-Gyamfi, Yaw</au><au>Armstrong Aboah</au><au>Amo-Boateng, Mark</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PaveSAM Segment Anything for Pavement Distress</atitle><jtitle>arXiv.org</jtitle><date>2024-09-11</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2409.07295</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-09
issn 2331-8422
language eng
recordid cdi_proquest_journals_3103645386
source Publicly Available Content Database
subjects Annotations
Computer vision
Condition monitoring
Cost analysis
Image segmentation
Labels
Pavements
Pixels
title PaveSAM Segment Anything for Pavement Distress
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T08%3A17%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PaveSAM%20Segment%20Anything%20for%20Pavement%20Distress&rft.jtitle=arXiv.org&rft.au=Owor,%20Neema%20Jakisa&rft.date=2024-09-11&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2409.07295&rft_dat=%3Cproquest%3E3103645386%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3103645386&rft_id=info:pmid/&rfr_iscdi=true