Loading…

PaveSAM Segment Anything for Pavement Distress

Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing...

Full description

Saved in:

Bibliographic Details
Published in:	arXiv.org 2024-09
Main Authors:	Owor, Neema Jakisa, Adu-Gyamfi, Yaw, Armstrong Aboah, Amo-Boateng, Mark
Format:	Article
Language:	English
Subjects:	Annotations Computer vision Condition monitoring Cost analysis Image segmentation Labels Pavements Pixels
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Owor, Neema Jakisa Adu-Gyamfi, Yaw Armstrong Aboah Amo-Boateng, Mark
description	Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.
doi_str_mv	10.48550/arxiv.2409.07295
format	article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3103645386</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3103645386</sourcerecordid><originalsourceid>FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3</originalsourceid><addsrcrecordid>eNotjUtLw0AURgdBsNT-AHcB14kz986dxzLUV6HSQrsvk-SmpmiiM2nRf-9z9cE5cD4hrpQstCOSNyF-dKcCtPSFtODpTEwAUeVOA1yIWUoHKSUYC0Q4EcU6nHhTPmUb3r9yP2Zl_zk-d_0-a4eY_chfetulMXJKl-K8DS-JZ_87Fdv7u-38MV-uHhbzcpkHApOjAq-0NYxM7DVWoWqtrFpyBjxjYyUCNHVFzjJTU3uyxmpX6xaDUlzhVFz_Zd_i8H7kNO4OwzH23487VBKNJnQGvwALX0Nn</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3103645386</pqid></control><display><type>article</type><title>PaveSAM Segment Anything for Pavement Distress</title><source>Publicly Available Content Database</source><creator>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</creator><creatorcontrib>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</creatorcontrib><description>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2409.07295</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Computer vision ; Condition monitoring ; Cost analysis ; Image segmentation ; Labels ; Pavements ; Pixels</subject><ispartof>arXiv.org, 2024-09</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/3103645386?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,27925,37012,44590</link.rule.ids></links><search><creatorcontrib>Owor, Neema Jakisa</creatorcontrib><creatorcontrib>Adu-Gyamfi, Yaw</creatorcontrib><creatorcontrib>Armstrong Aboah</creatorcontrib><creatorcontrib>Amo-Boateng, Mark</creatorcontrib><title>PaveSAM Segment Anything for Pavement Distress</title><title>arXiv.org</title><description>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</description><subject>Annotations</subject><subject>Computer vision</subject><subject>Condition monitoring</subject><subject>Cost analysis</subject><subject>Image segmentation</subject><subject>Labels</subject><subject>Pavements</subject><subject>Pixels</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNotjUtLw0AURgdBsNT-AHcB14kz986dxzLUV6HSQrsvk-SmpmiiM2nRf-9z9cE5cD4hrpQstCOSNyF-dKcCtPSFtODpTEwAUeVOA1yIWUoHKSUYC0Q4EcU6nHhTPmUb3r9yP2Zl_zk-d_0-a4eY_chfetulMXJKl-K8DS-JZ_87Fdv7u-38MV-uHhbzcpkHApOjAq-0NYxM7DVWoWqtrFpyBjxjYyUCNHVFzjJTU3uyxmpX6xaDUlzhVFz_Zd_i8H7kNO4OwzH23487VBKNJnQGvwALX0Nn</recordid><startdate>20240911</startdate><enddate>20240911</enddate><creator>Owor, Neema Jakisa</creator><creator>Adu-Gyamfi, Yaw</creator><creator>Armstrong Aboah</creator><creator>Amo-Boateng, Mark</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240911</creationdate><title>PaveSAM Segment Anything for Pavement Distress</title><author>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Annotations</topic><topic>Computer vision</topic><topic>Condition monitoring</topic><topic>Cost analysis</topic><topic>Image segmentation</topic><topic>Labels</topic><topic>Pavements</topic><topic>Pixels</topic><toplevel>online_resources</toplevel><creatorcontrib>Owor, Neema Jakisa</creatorcontrib><creatorcontrib>Adu-Gyamfi, Yaw</creatorcontrib><creatorcontrib>Armstrong Aboah</creatorcontrib><creatorcontrib>Amo-Boateng, Mark</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Owor, Neema Jakisa</au><au>Adu-Gyamfi, Yaw</au><au>Armstrong Aboah</au><au>Amo-Boateng, Mark</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PaveSAM Segment Anything for Pavement Distress</atitle><jtitle>arXiv.org</jtitle><date>2024-09-11</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2409.07295</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2024-09
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_3103645386
source	Publicly Available Content Database
subjects	Annotations Computer vision Condition monitoring Cost analysis Image segmentation Labels Pavements Pixels
title	PaveSAM Segment Anything for Pavement Distress
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T08%3A17%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PaveSAM%20Segment%20Anything%20for%20Pavement%20Distress&rft.jtitle=arXiv.org&rft.au=Owor,%20Neema%20Jakisa&rft.date=2024-09-11&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2409.07295&rft_dat=%3Cproquest%3E3103645386%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3103645386&rft_id=info:pmid/&rfr_iscdi=true