Loading…
PaveSAM Segment Anything for Pavement Distress
Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing...
Saved in:
Published in: | arXiv.org 2024-09 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Owor, Neema Jakisa Adu-Gyamfi, Yaw Armstrong Aboah Amo-Boateng, Mark |
description | Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets. |
doi_str_mv | 10.48550/arxiv.2409.07295 |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3103645386</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3103645386</sourcerecordid><originalsourceid>FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3</originalsourceid><addsrcrecordid>eNotjUtLw0AURgdBsNT-AHcB14kz986dxzLUV6HSQrsvk-SmpmiiM2nRf-9z9cE5cD4hrpQstCOSNyF-dKcCtPSFtODpTEwAUeVOA1yIWUoHKSUYC0Q4EcU6nHhTPmUb3r9yP2Zl_zk-d_0-a4eY_chfetulMXJKl-K8DS-JZ_87Fdv7u-38MV-uHhbzcpkHApOjAq-0NYxM7DVWoWqtrFpyBjxjYyUCNHVFzjJTU3uyxmpX6xaDUlzhVFz_Zd_i8H7kNO4OwzH23487VBKNJnQGvwALX0Nn</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3103645386</pqid></control><display><type>article</type><title>PaveSAM Segment Anything for Pavement Distress</title><source>Publicly Available Content Database</source><creator>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</creator><creatorcontrib>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</creatorcontrib><description>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2409.07295</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Computer vision ; Condition monitoring ; Cost analysis ; Image segmentation ; Labels ; Pavements ; Pixels</subject><ispartof>arXiv.org, 2024-09</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/3103645386?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,27925,37012,44590</link.rule.ids></links><search><creatorcontrib>Owor, Neema Jakisa</creatorcontrib><creatorcontrib>Adu-Gyamfi, Yaw</creatorcontrib><creatorcontrib>Armstrong Aboah</creatorcontrib><creatorcontrib>Amo-Boateng, Mark</creatorcontrib><title>PaveSAM Segment Anything for Pavement Distress</title><title>arXiv.org</title><description>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</description><subject>Annotations</subject><subject>Computer vision</subject><subject>Condition monitoring</subject><subject>Cost analysis</subject><subject>Image segmentation</subject><subject>Labels</subject><subject>Pavements</subject><subject>Pixels</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNotjUtLw0AURgdBsNT-AHcB14kz986dxzLUV6HSQrsvk-SmpmiiM2nRf-9z9cE5cD4hrpQstCOSNyF-dKcCtPSFtODpTEwAUeVOA1yIWUoHKSUYC0Q4EcU6nHhTPmUb3r9yP2Zl_zk-d_0-a4eY_chfetulMXJKl-K8DS-JZ_87Fdv7u-38MV-uHhbzcpkHApOjAq-0NYxM7DVWoWqtrFpyBjxjYyUCNHVFzjJTU3uyxmpX6xaDUlzhVFz_Zd_i8H7kNO4OwzH23487VBKNJnQGvwALX0Nn</recordid><startdate>20240911</startdate><enddate>20240911</enddate><creator>Owor, Neema Jakisa</creator><creator>Adu-Gyamfi, Yaw</creator><creator>Armstrong Aboah</creator><creator>Amo-Boateng, Mark</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240911</creationdate><title>PaveSAM Segment Anything for Pavement Distress</title><author>Owor, Neema Jakisa ; Adu-Gyamfi, Yaw ; Armstrong Aboah ; Amo-Boateng, Mark</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Annotations</topic><topic>Computer vision</topic><topic>Condition monitoring</topic><topic>Cost analysis</topic><topic>Image segmentation</topic><topic>Labels</topic><topic>Pavements</topic><topic>Pixels</topic><toplevel>online_resources</toplevel><creatorcontrib>Owor, Neema Jakisa</creatorcontrib><creatorcontrib>Adu-Gyamfi, Yaw</creatorcontrib><creatorcontrib>Armstrong Aboah</creatorcontrib><creatorcontrib>Amo-Boateng, Mark</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Owor, Neema Jakisa</au><au>Adu-Gyamfi, Yaw</au><au>Armstrong Aboah</au><au>Amo-Boateng, Mark</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PaveSAM Segment Anything for Pavement Distress</atitle><jtitle>arXiv.org</jtitle><date>2024-09-11</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2409.07295</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2024-09 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_3103645386 |
source | Publicly Available Content Database |
subjects | Annotations Computer vision Condition monitoring Cost analysis Image segmentation Labels Pavements Pixels |
title | PaveSAM Segment Anything for Pavement Distress |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T08%3A17%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PaveSAM%20Segment%20Anything%20for%20Pavement%20Distress&rft.jtitle=arXiv.org&rft.au=Owor,%20Neema%20Jakisa&rft.date=2024-09-11&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2409.07295&rft_dat=%3Cproquest%3E3103645386%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-a526-31291476e3e5e943babf70bf58629e3d70322dcb587ee5dc9576748c4f3a11eb3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3103645386&rft_id=info:pmid/&rfr_iscdi=true |