Loading…
SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method
Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the ex...
Saved in:
Published in: | IEEE transactions on geoscience and remote sensing 2023, Vol.61, p.1-17 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613 |
---|---|
cites | cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613 |
container_end_page | 17 |
container_issue | |
container_start_page | 1 |
container_title | IEEE transactions on geoscience and remote sensing |
container_volume | 61 |
creator | Li, Zhenghong Chen, Hao Wu, Jiangjiang Li, Jun Jing, Ning |
description | Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind . |
doi_str_mv | 10.1109/TGRS.2023.3321041 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TGRS_2023_3321041</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10268382</ieee_id><sourcerecordid>2879377641</sourcerecordid><originalsourceid>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</originalsourceid><addsrcrecordid>eNpNkMtKw0AUhgdRsFYfQHARcJ0611zcSdFaaBDaisthOjlJp5pJnZkWXPjuJqQLV-f2f-dwfoRuCZ4QgvOH9Wy5mlBM2YQxSjAnZ2hEhMhinHB-jkaY5ElMs5xeoivvdxgTLkg6Qr8rqAtjy8doBY3xhz24o_FQRkto2gBd13pj62jeqLqvGmWD0V1SN2CDCqa10YcJ26hQ_rPDBl3RlvDVY8qW0bS1wSkfzBGiBShn-0EBYduW1-iiUl8ebk5xjN5fntfT13jxNptPnxaxpjkPMae5AMUJz3SaKF1qAL3ZUFopmhDFKU9Bi4xotkmZSLrv84wKjTFUIDqCsDG6H_buXft9AB_krj04252UNEtzlqYJ71VkUGnXeu-gkntnGuV-JMGyd1n2LsveZXlyuWPuBsYAwD89TTKWUfYH5dB6Bg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2879377641</pqid></control><display><type>article</type><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</creator><creatorcontrib>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</creatorcontrib><description>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</description><identifier>ISSN: 0196-2892</identifier><identifier>EISSN: 1558-0644</identifier><identifier>DOI: 10.1109/TGRS.2023.3321041</identifier><identifier>CODEN: IGRSD2</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Annotations ; Contrastive learning (CL) ; Deep learning ; Image enhancement ; Image processing ; Image segmentation ; Learning systems ; masked image modeling (MIM) ; Modelling ; Remote sensing ; remote sensing (RS) image semantic segmentation ; Roads ; Semantic segmentation ; Semantics ; semisupervised learning ; Training ; Transformers</subject><ispartof>IEEE transactions on geoscience and remote sensing, 2023, Vol.61, p.1-17</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</citedby><cites>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</cites><orcidid>0000-0001-8111-6803 ; 0000-0002-7880-3394</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10268382$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,4023,27922,27923,27924,54795</link.rule.ids></links><search><creatorcontrib>Li, Zhenghong</creatorcontrib><creatorcontrib>Chen, Hao</creatorcontrib><creatorcontrib>Wu, Jiangjiang</creatorcontrib><creatorcontrib>Li, Jun</creatorcontrib><creatorcontrib>Jing, Ning</creatorcontrib><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><title>IEEE transactions on geoscience and remote sensing</title><addtitle>TGRS</addtitle><description>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</description><subject>Annotations</subject><subject>Contrastive learning (CL)</subject><subject>Deep learning</subject><subject>Image enhancement</subject><subject>Image processing</subject><subject>Image segmentation</subject><subject>Learning systems</subject><subject>masked image modeling (MIM)</subject><subject>Modelling</subject><subject>Remote sensing</subject><subject>remote sensing (RS) image semantic segmentation</subject><subject>Roads</subject><subject>Semantic segmentation</subject><subject>Semantics</subject><subject>semisupervised learning</subject><subject>Training</subject><subject>Transformers</subject><issn>0196-2892</issn><issn>1558-0644</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpNkMtKw0AUhgdRsFYfQHARcJ0611zcSdFaaBDaisthOjlJp5pJnZkWXPjuJqQLV-f2f-dwfoRuCZ4QgvOH9Wy5mlBM2YQxSjAnZ2hEhMhinHB-jkaY5ElMs5xeoivvdxgTLkg6Qr8rqAtjy8doBY3xhz24o_FQRkto2gBd13pj62jeqLqvGmWD0V1SN2CDCqa10YcJ26hQ_rPDBl3RlvDVY8qW0bS1wSkfzBGiBShn-0EBYduW1-iiUl8ebk5xjN5fntfT13jxNptPnxaxpjkPMae5AMUJz3SaKF1qAL3ZUFopmhDFKU9Bi4xotkmZSLrv84wKjTFUIDqCsDG6H_buXft9AB_krj04252UNEtzlqYJ71VkUGnXeu-gkntnGuV-JMGyd1n2LsveZXlyuWPuBsYAwD89TTKWUfYH5dB6Bg</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Li, Zhenghong</creator><creator>Chen, Hao</creator><creator>Wu, Jiangjiang</creator><creator>Li, Jun</creator><creator>Jing, Ning</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7UA</scope><scope>8FD</scope><scope>C1K</scope><scope>F1W</scope><scope>FR3</scope><scope>H8D</scope><scope>H96</scope><scope>KR7</scope><scope>L.G</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0001-8111-6803</orcidid><orcidid>https://orcid.org/0000-0002-7880-3394</orcidid></search><sort><creationdate>2023</creationdate><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><author>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Annotations</topic><topic>Contrastive learning (CL)</topic><topic>Deep learning</topic><topic>Image enhancement</topic><topic>Image processing</topic><topic>Image segmentation</topic><topic>Learning systems</topic><topic>masked image modeling (MIM)</topic><topic>Modelling</topic><topic>Remote sensing</topic><topic>remote sensing (RS) image semantic segmentation</topic><topic>Roads</topic><topic>Semantic segmentation</topic><topic>Semantics</topic><topic>semisupervised learning</topic><topic>Training</topic><topic>Transformers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Zhenghong</creatorcontrib><creatorcontrib>Chen, Hao</creatorcontrib><creatorcontrib>Wu, Jiangjiang</creatorcontrib><creatorcontrib>Li, Jun</creatorcontrib><creatorcontrib>Jing, Ning</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Water Resources Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources</collection><collection>Civil Engineering Abstracts</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) Professional</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on geoscience and remote sensing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Zhenghong</au><au>Chen, Hao</au><au>Wu, Jiangjiang</au><au>Li, Jun</au><au>Jing, Ning</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</atitle><jtitle>IEEE transactions on geoscience and remote sensing</jtitle><stitle>TGRS</stitle><date>2023</date><risdate>2023</risdate><volume>61</volume><spage>1</spage><epage>17</epage><pages>1-17</pages><issn>0196-2892</issn><eissn>1558-0644</eissn><coden>IGRSD2</coden><abstract>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TGRS.2023.3321041</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0001-8111-6803</orcidid><orcidid>https://orcid.org/0000-0002-7880-3394</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0196-2892 |
ispartof | IEEE transactions on geoscience and remote sensing, 2023, Vol.61, p.1-17 |
issn | 0196-2892 1558-0644 |
language | eng |
recordid | cdi_crossref_primary_10_1109_TGRS_2023_3321041 |
source | IEEE Electronic Library (IEL) Journals |
subjects | Annotations Contrastive learning (CL) Deep learning Image enhancement Image processing Image segmentation Learning systems masked image modeling (MIM) Modelling Remote sensing remote sensing (RS) image semantic segmentation Roads Semantic segmentation Semantics semisupervised learning Training Transformers |
title | SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T20%3A24%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SegMind:%20Semisupervised%20Remote%20Sensing%20Image%20Semantic%20Segmentation%20With%20Masked%20Image%20Modeling%20and%20Contrastive%20Learning%20Method&rft.jtitle=IEEE%20transactions%20on%20geoscience%20and%20remote%20sensing&rft.au=Li,%20Zhenghong&rft.date=2023&rft.volume=61&rft.spage=1&rft.epage=17&rft.pages=1-17&rft.issn=0196-2892&rft.eissn=1558-0644&rft.coden=IGRSD2&rft_id=info:doi/10.1109/TGRS.2023.3321041&rft_dat=%3Cproquest_cross%3E2879377641%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2879377641&rft_id=info:pmid/&rft_ieee_id=10268382&rfr_iscdi=true |