Loading…

Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm

The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achiev...

Full description

Saved in:
Bibliographic Details
Published in:Journal of field robotics 2024-03, Vol.41 (2), p.273-287
Main Authors: Zhu, Jiajun, Iida, Michihisa, Chen, Sikai, Cheng, Shijing, Suguri, Masahiko, Masuda, Ryohei
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443
container_end_page 287
container_issue 2
container_start_page 273
container_title Journal of field robotics
container_volume 41
creator Zhu, Jiajun
Iida, Michihisa
Chen, Sikai
Cheng, Shijing
Suguri, Masahiko
Masuda, Ryohei
description The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.
doi_str_mv 10.1002/rob.22260
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2920920753</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920920753</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</originalsourceid><addsrcrecordid>eNp1kM1KAzEUhYMoWKsL3yDgysW0mWQy6Sy1-AeFiug65OdOTZmZ1GSKdOcj-Iw-iakj7oQL5y6-cy73IHSek0lOCJ0GryeU0pIcoFHOeZkVVSkO_3ZeHaOTGNeEFGxW8RGCR2XtDtcOGou9XoPpsYU-ifMdrn3AKdH3zmDjW-06wFpFSGiHA6jm6-Ozdy3gCK3q9lSEVQtdr37sqln54PrX9hQd1aqJcParY_Rye_M8v88Wy7uH-dUiM5QLkjFtuAKwgtq6MpZXQmue20JQM7OMcDtTalZAbTgxomSalVobIyrCrDZFUbAxuhhyN8G_bSH2cu23oUsnJa0oSSM4S9TlQJngYwxQy01wrQo7mRO5b1Gmn-VPi4mdDuy7a2D3PyiflteD4xtow3ah</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920920753</pqid></control><display><type>article</type><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><source>Wiley</source><creator>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</creator><creatorcontrib>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</creatorcontrib><description>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</description><identifier>ISSN: 1556-4959</identifier><identifier>EISSN: 1556-4967</identifier><identifier>DOI: 10.1002/rob.22260</identifier><language>eng</language><publisher>Hoboken: Wiley Subscription Services, Inc</publisher><subject>Acceleration ; Algorithms ; Artificial neural networks ; convolutional neural networks ; deep learning ; Frames per second ; Harvesting ; Machine learning ; Microprocessors ; object detection ; Object recognition ; paddy field ; real‐time ; robotic combine ; Robotics ; Semantic segmentation ; Semantics ; Sensors</subject><ispartof>Journal of field robotics, 2024-03, Vol.41 (2), p.273-287</ispartof><rights>2023 Wiley Periodicals LLC.</rights><rights>2024 Wiley Periodicals LLC.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</cites><orcidid>0000-0002-2814-9080</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Zhu, Jiajun</creatorcontrib><creatorcontrib>Iida, Michihisa</creatorcontrib><creatorcontrib>Chen, Sikai</creatorcontrib><creatorcontrib>Cheng, Shijing</creatorcontrib><creatorcontrib>Suguri, Masahiko</creatorcontrib><creatorcontrib>Masuda, Ryohei</creatorcontrib><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><title>Journal of field robotics</title><description>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</description><subject>Acceleration</subject><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>convolutional neural networks</subject><subject>deep learning</subject><subject>Frames per second</subject><subject>Harvesting</subject><subject>Machine learning</subject><subject>Microprocessors</subject><subject>object detection</subject><subject>Object recognition</subject><subject>paddy field</subject><subject>real‐time</subject><subject>robotic combine</subject><subject>Robotics</subject><subject>Semantic segmentation</subject><subject>Semantics</subject><subject>Sensors</subject><issn>1556-4959</issn><issn>1556-4967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp1kM1KAzEUhYMoWKsL3yDgysW0mWQy6Sy1-AeFiug65OdOTZmZ1GSKdOcj-Iw-iakj7oQL5y6-cy73IHSek0lOCJ0GryeU0pIcoFHOeZkVVSkO_3ZeHaOTGNeEFGxW8RGCR2XtDtcOGou9XoPpsYU-ifMdrn3AKdH3zmDjW-06wFpFSGiHA6jm6-Ozdy3gCK3q9lSEVQtdr37sqln54PrX9hQd1aqJcParY_Rye_M8v88Wy7uH-dUiM5QLkjFtuAKwgtq6MpZXQmue20JQM7OMcDtTalZAbTgxomSalVobIyrCrDZFUbAxuhhyN8G_bSH2cu23oUsnJa0oSSM4S9TlQJngYwxQy01wrQo7mRO5b1Gmn-VPi4mdDuy7a2D3PyiflteD4xtow3ah</recordid><startdate>202403</startdate><enddate>202403</enddate><creator>Zhu, Jiajun</creator><creator>Iida, Michihisa</creator><creator>Chen, Sikai</creator><creator>Cheng, Shijing</creator><creator>Suguri, Masahiko</creator><creator>Masuda, Ryohei</creator><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-2814-9080</orcidid></search><sort><creationdate>202403</creationdate><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><author>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Acceleration</topic><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>convolutional neural networks</topic><topic>deep learning</topic><topic>Frames per second</topic><topic>Harvesting</topic><topic>Machine learning</topic><topic>Microprocessors</topic><topic>object detection</topic><topic>Object recognition</topic><topic>paddy field</topic><topic>real‐time</topic><topic>robotic combine</topic><topic>Robotics</topic><topic>Semantic segmentation</topic><topic>Semantics</topic><topic>Sensors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhu, Jiajun</creatorcontrib><creatorcontrib>Iida, Michihisa</creatorcontrib><creatorcontrib>Chen, Sikai</creatorcontrib><creatorcontrib>Cheng, Shijing</creatorcontrib><creatorcontrib>Suguri, Masahiko</creatorcontrib><creatorcontrib>Masuda, Ryohei</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of field robotics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhu, Jiajun</au><au>Iida, Michihisa</au><au>Chen, Sikai</au><au>Cheng, Shijing</au><au>Suguri, Masahiko</au><au>Masuda, Ryohei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</atitle><jtitle>Journal of field robotics</jtitle><date>2024-03</date><risdate>2024</risdate><volume>41</volume><issue>2</issue><spage>273</spage><epage>287</epage><pages>273-287</pages><issn>1556-4959</issn><eissn>1556-4967</eissn><abstract>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</abstract><cop>Hoboken</cop><pub>Wiley Subscription Services, Inc</pub><doi>10.1002/rob.22260</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-2814-9080</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1556-4959
ispartof Journal of field robotics, 2024-03, Vol.41 (2), p.273-287
issn 1556-4959
1556-4967
language eng
recordid cdi_proquest_journals_2920920753
source Wiley
subjects Acceleration
Algorithms
Artificial neural networks
convolutional neural networks
deep learning
Frames per second
Harvesting
Machine learning
Microprocessors
object detection
Object recognition
paddy field
real‐time
robotic combine
Robotics
Semantic segmentation
Semantics
Sensors
title Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T19%3A59%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Paddy%20field%20object%20detection%20for%20robotic%20combine%20based%20on%20real%E2%80%90time%20semantic%20segmentation%20algorithm&rft.jtitle=Journal%20of%20field%20robotics&rft.au=Zhu,%20Jiajun&rft.date=2024-03&rft.volume=41&rft.issue=2&rft.spage=273&rft.epage=287&rft.pages=273-287&rft.issn=1556-4959&rft.eissn=1556-4967&rft_id=info:doi/10.1002/rob.22260&rft_dat=%3Cproquest_cross%3E2920920753%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2920920753&rft_id=info:pmid/&rfr_iscdi=true