SIS: A new multi-scale convolutional operator期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

SIS: A new multi-scale convolutional operator

Authors:	Man Zhou Xueyang Fu Aiping Liu

Institution:	School of Information Science and Tecnology, University of Science and Technology of China, Hefei 230027 China

Abstract:	Visual features with high potential for generalization are critical for computer vision applications. In addition to the computational overhead associated with layer-by-layer feature stacking to produce multi-scale feature maps, existing approaches also incur high computational costs. To address this issue, we present a compact and efficient scale-in-scale convolution operator called SIS by incorporating an efficient progressive multi-scale architecture into a standard convolution operator. More precisely, the suggested operator uses the channel transform-divide-and-conquer technique to optimize conventional channel-wise computing, thereby lowering the computational cost while simultaneously expanding the receptive fields within a single convolution layer. Moreover, the proposed SIS operator incorporates weight-sharing with split-and-interact and recur-and-fuse mechanisms for enhanced variant design. The suggested SIS series is easily pluggable into any promising convolutional backbone, such as the well-known ResNet and Res2Net. Furthermore, we incorporated the proposed SIS operator series into 29-layer, 50-layer, and 101-layer ResNet as well as Res2Net variants and evaluated these modified models on the widely used CIFAR, PASCAL VOC, and COCO2017 benchmark datasets, where they consistently outperformed state-of-the-art models on a variety of major vision tasks, including image classification, key point estimation, semantic segmentation, and object detection.

Keywords:	multi-scale convolutional operator image classification key point estimation semantic segmentation object detection

	点击此处可从《中国科学技术大学学报》浏览原始摘要信息
	点击此处可从《中国科学技术大学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏