论文检索
期刊
全部知识仓储预印本开放期刊机构
高级检索

基于轻量级MobileNetV2-DeeplabV3+的棒材分割方法OA

Bar Segmentation Method Based on Lightweight MobileNetV2-DeeplabV3+

中文摘要英文摘要

针对当前语义分割模型为提升像素分割精度,不断增加算法复杂度,导致模型出现参数量大,耗时长,难以部署至工业现场等问题,提出一种基于轻量级MobileNetV2-DeeplabV3+模型的棒材分割算法.算法为平衡像素分割精度、模型参数量和算法检测速度,在原网络基础上做出一系列改进:将原有的Xception主干网络替换为轻量级MobileNetV2网络以降低模型参数量与计算复杂度;在空洞空间金字塔池化(ASPP)模块基础上密集连接各空洞卷积以获得更大的感受野,更加密集的像素采样,并扩大输出特征覆盖的语义信息;使用深度可分离卷积(DSConv)替代ASPP模块中的标准卷积进一步降低模型的计算复杂度;此外,引入有效通道注意力(ECA)模块聚焦目标边缘特征,增强特征图通道信息提取的效果.实验表明:改进后的模型在棒材数据集下平均交并比(MIOU)为89.37%,平均像素精度(MPA)为94.57%,帧率(FPS)为33.09帧/s,模型参数量为33.6 M.与U-net、M-PSPNet、M-DeeplabV3+等模型相比,改进后算法的MIOU值与MPA值略低于最佳值,但仍处于较高水准,模型参数量小,FPS值得到较大提升.实验表明:改进后的算法能较好地平衡分割精度和算法实时性,能满足部署至工业现场的需求.

In order to improve the pixel segmentation accuracy of the current semantic segmentation model,the algorithm complexity continues to increase,resulting in a large number of parameters,time-consuming,and difficulty in deploying to industrial sites.A bar segmentation algorithm based on the lightweight MobileNetV2-DeeplabV3+model was proposed.The algorithm made a series of improvements based on the original network in order to balance the pixel segmentation accuracy,the number of model parameters and the detection speed of the algorithm.The original Xception backbone network was replaced with a lightweight MobileNetV2 network to reduce the number of model parameters and computational complexity.On the basis of the Atrous Spatial Pyramid Pooling(ASPP)module,the atrous convolutions were densely connected to obtain a larger receptive field and denser pixel sampling,and to enlarge the semantic information covered by the output features.The computational complexity of the model was further reduced by using deep separable convolution(DSConv)instead of the standard convolution in the ASPP module.In addition,an effective channel attention(ECA)module was introduced to focus on the target edge features and enhance the effect of channel information extraction in the feature maps.The experiment showed that the improved model achieved a mean intersection over Union(MIOU)of 89.37%,a mean pixel accuracy(MPA)of 94.57%,a frame rate of 33.09 frames per second(FPS),and a model parameter size of 33.6 M on the bar dataset.Compared with the models of U-net,MPSPNet,and M-DeeplabV3+,the MIOU and MPA values of the improved algorithm were slightly lower than the best values,but still at a high level,with a small number of model parameters and a significant increase in FPS value.The example shows that the improved algorithm can better balance the segmentation accuracy and the real-time performance of the algorithm,and can meet the needs of deployment to industrial sites.

汤维杰;方挺;韩家明;袁东祥

安徽工业大学电气与信息工程学院,安徽马鞍山 243000

冶金工业

语义分割;DeepLabv3+模型;轻量级;棒材

semantic segmentation;DeepLabv3+model;lightweight;bar

《重庆工商大学学报(自然科学版)》 2024 (003)

66-71 / 6

安徽工业大学校青年基金(QZ202109);安徽工业大学校青年基金(QZ202109).

10.16055/j.issn.1672-058X.2024.0003.009

评论

下载量:0
点击量:0