Loading…

Shuffled-Xception-DarkNet-53: A content-based image retrieval model based on deep learning algorithm

•An advanced version of darknet-53, Shuffled-Xception-Darknet-53, is proposed.•A total of 5 Shuffled-Xception Module is incorporated with Darknet-53.•Three sets of 5 × 5, 3 × 3, and 1 × 1 filters are used in each Shuffled-Xception module.•Group Convolution is used in Xception module for informative...

Full description

Saved in:
Bibliographic Details
Published in:Computers & electrical engineering 2023-04, Vol.107, p.108647, Article 108647
Main Authors: Pathak, Debanjan, Raju, U.S.N.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•An advanced version of darknet-53, Shuffled-Xception-Darknet-53, is proposed.•A total of 5 Shuffled-Xception Module is incorporated with Darknet-53.•Three sets of 5 × 5, 3 × 3, and 1 × 1 filters are used in each Shuffled-Xception module.•Group Convolution is used in Xception module for informative feature extraction.•One Channel Shuffle layer is used between every two Group Convolution layers. This paper proposes Shuffled-Xception-DarkNet-53, an advanced version of DarkNet-53 for Content-Based Image Retrieval (CBIR). The proposed model introduced the notion of the Shuffled -Xception module, which uses three sets of 1 × 1, 3 × 3, and 5 × 5 size filters using a serial connection in place of a single 3 × 3 size filter to extract more refined features from the input images. Instead of the standard 2D Convolution operation, 'Group Convolution' is employed in the proposed Shuffled -Xception module to make the training process of the proposed CNN more efficient. Furthermore, 'Group Convolution' improves the co-relations among the filters of the corresponding Shuffled -Xception module, resulting in more informative features. Between every two serial Group Convolution layers of the same size, one Channel Shuffle layer is used to prevent the loss of information flow among the channels of different groups. The proposed method outperformed the twelve compared methods, including conventional and CNN-based CBIR methods, in ten standard image datasets. [Display omitted]
ISSN:0045-7906
1879-0755
DOI:10.1016/j.compeleceng.2023.108647