Loading…

Learning to segment images using region-based perceptual features

The recent establishment of a large-scale ground-truth database of image segmentations [D. Martin et al., 2001] has enabled the development of learning approaches to the general segmentation problem. Using this database, we present an algorithm that learns how to segment images using region-based, p...

Full description

Saved in:
Bibliographic Details
Main Authors: Kaufhold, J., Hoogs, A.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page II
container_issue
container_start_page II
container_title
container_volume 2
creator Kaufhold, J.
Hoogs, A.
description The recent establishment of a large-scale ground-truth database of image segmentations [D. Martin et al., 2001] has enabled the development of learning approaches to the general segmentation problem. Using this database, we present an algorithm that learns how to segment images using region-based, perceptual features. The image is first densely segmented into regions and the edges between them using a variant of the Mumford-Shah functional. Each edge is classified as a boundary or non-boundary using a classifier trained on the ground-truth, resulting in an edge image estimating human-designated boundaries. This novel approach has a few distinct advantages over filter-based methods such as local gradient operators. First, the same perceptual features can represent texture as well as regular structure. Second, the features can measure relationships between image elements at arbitrary distances in the image, enabling the detection of Gestalt properties at any scale. Third, texture boundaries can be precisely localized, which is difficult when using filter banks. Finally, the learning system outputs a relatively small set of intuitive perceptual rules for detecting boundaries. The classifier is trained on 200 images in the ground-truth database, and tested on another 100 images according to the benchmark evaluation methods. Edge classification improves the benchmark F-score from 0.54, for the initial Mumford-Shah-variant segmentation, to 0.61 on grayscale images. This increase of 13% demonstrates the versatility and representational power of our perceptual features, as the score exceeds published results for any algorithm restricted to one type of image feature such as texture or brightness gradient.
doi_str_mv 10.1109/CVPR.2004.1315268
format conference_proceeding
fullrecord <record><control><sourceid>pascalfrancis_6IE</sourceid><recordid>TN_cdi_pascalfrancis_primary_17623410</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1315268</ieee_id><sourcerecordid>17623410</sourcerecordid><originalsourceid>FETCH-LOGICAL-i500-315d7d8c6f15b1c6ee19df993915247181204861aac2376b81ed295dc2086983</originalsourceid><addsrcrecordid>eNpFkE9LxDAQxQMquKz9AOKlF4-tM0mTJsel-A8KiorXJU2npdLtlqQ9-O2NrOBcHsx7PH48xq4RckQwd9Xn61vOAYocBUqu9BlLTKmhVEZylNqcsw2CEpkyaC5ZEsIXxJMQfb1hu5qsn4apT5djGqg_0LSkw8H2FNI1_P499cNxyhobqE1n8o7mZbVj2pFdVk_hil10dgyU_OmWvT_cf1RPWf3y-Fzt6myQAFlka8tWO9WhbNApIjRtZ4wwEbooUSOHQiu01nFRqkYjtdzI1nHQymixZben1tkGZ8fO28kNYT_7yOq_91gqLgqEmLs55QYi-rdP04gf4n5V4Q</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Learning to segment images using region-based perceptual features</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Kaufhold, J. ; Hoogs, A.</creator><creatorcontrib>Kaufhold, J. ; Hoogs, A.</creatorcontrib><description>The recent establishment of a large-scale ground-truth database of image segmentations [D. Martin et al., 2001] has enabled the development of learning approaches to the general segmentation problem. Using this database, we present an algorithm that learns how to segment images using region-based, perceptual features. The image is first densely segmented into regions and the edges between them using a variant of the Mumford-Shah functional. Each edge is classified as a boundary or non-boundary using a classifier trained on the ground-truth, resulting in an edge image estimating human-designated boundaries. This novel approach has a few distinct advantages over filter-based methods such as local gradient operators. First, the same perceptual features can represent texture as well as regular structure. Second, the features can measure relationships between image elements at arbitrary distances in the image, enabling the detection of Gestalt properties at any scale. Third, texture boundaries can be precisely localized, which is difficult when using filter banks. Finally, the learning system outputs a relatively small set of intuitive perceptual rules for detecting boundaries. The classifier is trained on 200 images in the ground-truth database, and tested on another 100 images according to the benchmark evaluation methods. Edge classification improves the benchmark F-score from 0.54, for the initial Mumford-Shah-variant segmentation, to 0.61 on grayscale images. This increase of 13% demonstrates the versatility and representational power of our perceptual features, as the score exceeds published results for any algorithm restricted to one type of image feature such as texture or brightness gradient.</description><identifier>ISSN: 1063-6919</identifier><identifier>ISBN: 9780769521589</identifier><identifier>ISBN: 0769521584</identifier><identifier>DOI: 10.1109/CVPR.2004.1315268</identifier><language>eng</language><publisher>Los Alamitos, California: IEEE</publisher><subject>Applied sciences ; Artificial intelligence ; Benchmark testing ; Brightness ; Computer science; control theory; systems ; Exact sciences and technology ; Filter bank ; Gray-scale ; Image databases ; Image edge detection ; Image segmentation ; Information systems. Data bases ; Large-scale systems ; Learning systems ; Memory organisation. Data processing ; Pattern recognition. Digital image processing. Computational geometry ; Software ; Spatial databases</subject><ispartof>Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004, 2004, Vol.2, p.II-II</ispartof><rights>2006 INIST-CNRS</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1315268$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,4036,4037,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1315268$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=17623410$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Kaufhold, J.</creatorcontrib><creatorcontrib>Hoogs, A.</creatorcontrib><title>Learning to segment images using region-based perceptual features</title><title>Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004</title><addtitle>CVPR</addtitle><description>The recent establishment of a large-scale ground-truth database of image segmentations [D. Martin et al., 2001] has enabled the development of learning approaches to the general segmentation problem. Using this database, we present an algorithm that learns how to segment images using region-based, perceptual features. The image is first densely segmented into regions and the edges between them using a variant of the Mumford-Shah functional. Each edge is classified as a boundary or non-boundary using a classifier trained on the ground-truth, resulting in an edge image estimating human-designated boundaries. This novel approach has a few distinct advantages over filter-based methods such as local gradient operators. First, the same perceptual features can represent texture as well as regular structure. Second, the features can measure relationships between image elements at arbitrary distances in the image, enabling the detection of Gestalt properties at any scale. Third, texture boundaries can be precisely localized, which is difficult when using filter banks. Finally, the learning system outputs a relatively small set of intuitive perceptual rules for detecting boundaries. The classifier is trained on 200 images in the ground-truth database, and tested on another 100 images according to the benchmark evaluation methods. Edge classification improves the benchmark F-score from 0.54, for the initial Mumford-Shah-variant segmentation, to 0.61 on grayscale images. This increase of 13% demonstrates the versatility and representational power of our perceptual features, as the score exceeds published results for any algorithm restricted to one type of image feature such as texture or brightness gradient.</description><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Benchmark testing</subject><subject>Brightness</subject><subject>Computer science; control theory; systems</subject><subject>Exact sciences and technology</subject><subject>Filter bank</subject><subject>Gray-scale</subject><subject>Image databases</subject><subject>Image edge detection</subject><subject>Image segmentation</subject><subject>Information systems. Data bases</subject><subject>Large-scale systems</subject><subject>Learning systems</subject><subject>Memory organisation. Data processing</subject><subject>Pattern recognition. Digital image processing. Computational geometry</subject><subject>Software</subject><subject>Spatial databases</subject><issn>1063-6919</issn><isbn>9780769521589</isbn><isbn>0769521584</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2004</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpFkE9LxDAQxQMquKz9AOKlF4-tM0mTJsel-A8KiorXJU2npdLtlqQ9-O2NrOBcHsx7PH48xq4RckQwd9Xn61vOAYocBUqu9BlLTKmhVEZylNqcsw2CEpkyaC5ZEsIXxJMQfb1hu5qsn4apT5djGqg_0LSkw8H2FNI1_P499cNxyhobqE1n8o7mZbVj2pFdVk_hil10dgyU_OmWvT_cf1RPWf3y-Fzt6myQAFlka8tWO9WhbNApIjRtZ4wwEbooUSOHQiu01nFRqkYjtdzI1nHQymixZben1tkGZ8fO28kNYT_7yOq_91gqLgqEmLs55QYi-rdP04gf4n5V4Q</recordid><startdate>2004</startdate><enddate>2004</enddate><creator>Kaufhold, J.</creator><creator>Hoogs, A.</creator><general>IEEE</general><general>IEEE Computer Society</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope><scope>IQODW</scope></search><sort><creationdate>2004</creationdate><title>Learning to segment images using region-based perceptual features</title><author>Kaufhold, J. ; Hoogs, A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i500-315d7d8c6f15b1c6ee19df993915247181204861aac2376b81ed295dc2086983</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Benchmark testing</topic><topic>Brightness</topic><topic>Computer science; control theory; systems</topic><topic>Exact sciences and technology</topic><topic>Filter bank</topic><topic>Gray-scale</topic><topic>Image databases</topic><topic>Image edge detection</topic><topic>Image segmentation</topic><topic>Information systems. Data bases</topic><topic>Large-scale systems</topic><topic>Learning systems</topic><topic>Memory organisation. Data processing</topic><topic>Pattern recognition. Digital image processing. Computational geometry</topic><topic>Software</topic><topic>Spatial databases</topic><toplevel>online_resources</toplevel><creatorcontrib>Kaufhold, J.</creatorcontrib><creatorcontrib>Hoogs, A.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Kaufhold, J.</au><au>Hoogs, A.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Learning to segment images using region-based perceptual features</atitle><btitle>Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004</btitle><stitle>CVPR</stitle><date>2004</date><risdate>2004</risdate><volume>2</volume><spage>II</spage><epage>II</epage><pages>II-II</pages><issn>1063-6919</issn><isbn>9780769521589</isbn><isbn>0769521584</isbn><abstract>The recent establishment of a large-scale ground-truth database of image segmentations [D. Martin et al., 2001] has enabled the development of learning approaches to the general segmentation problem. Using this database, we present an algorithm that learns how to segment images using region-based, perceptual features. The image is first densely segmented into regions and the edges between them using a variant of the Mumford-Shah functional. Each edge is classified as a boundary or non-boundary using a classifier trained on the ground-truth, resulting in an edge image estimating human-designated boundaries. This novel approach has a few distinct advantages over filter-based methods such as local gradient operators. First, the same perceptual features can represent texture as well as regular structure. Second, the features can measure relationships between image elements at arbitrary distances in the image, enabling the detection of Gestalt properties at any scale. Third, texture boundaries can be precisely localized, which is difficult when using filter banks. Finally, the learning system outputs a relatively small set of intuitive perceptual rules for detecting boundaries. The classifier is trained on 200 images in the ground-truth database, and tested on another 100 images according to the benchmark evaluation methods. Edge classification improves the benchmark F-score from 0.54, for the initial Mumford-Shah-variant segmentation, to 0.61 on grayscale images. This increase of 13% demonstrates the versatility and representational power of our perceptual features, as the score exceeds published results for any algorithm restricted to one type of image feature such as texture or brightness gradient.</abstract><cop>Los Alamitos, California</cop><pub>IEEE</pub><doi>10.1109/CVPR.2004.1315268</doi></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1063-6919
ispartof Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004, 2004, Vol.2, p.II-II
issn 1063-6919
language eng
recordid cdi_pascalfrancis_primary_17623410
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Applied sciences
Artificial intelligence
Benchmark testing
Brightness
Computer science
control theory
systems
Exact sciences and technology
Filter bank
Gray-scale
Image databases
Image edge detection
Image segmentation
Information systems. Data bases
Large-scale systems
Learning systems
Memory organisation. Data processing
Pattern recognition. Digital image processing. Computational geometry
Software
Spatial databases
title Learning to segment images using region-based perceptual features
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T02%3A48%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Learning%20to%20segment%20images%20using%20region-based%20perceptual%20features&rft.btitle=Proceedings%20of%20the%202004%20IEEE%20Computer%20Society%20Conference%20on%20Computer%20Vision%20and%20Pattern%20Recognition,%202004.%20CVPR%202004&rft.au=Kaufhold,%20J.&rft.date=2004&rft.volume=2&rft.spage=II&rft.epage=II&rft.pages=II-II&rft.issn=1063-6919&rft.isbn=9780769521589&rft.isbn_list=0769521584&rft_id=info:doi/10.1109/CVPR.2004.1315268&rft_dat=%3Cpascalfrancis_6IE%3E17623410%3C/pascalfrancis_6IE%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i500-315d7d8c6f15b1c6ee19df993915247181204861aac2376b81ed295dc2086983%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=1315268&rfr_iscdi=true