Research on Quantitative Detection Algorithm Based on Hrnet
DOI:
https://doi.org/10.54097/9wm2g323Keywords:
Garment Size Detection, Key Point Detection, Deep Learning, Hrnet, Attention Mechanism.Abstract
Aiming at the insufficient localization accuracy of traditional algorithms due to complex texture interference, diverse fabric deformations and sensitivity to small size errors in garment size detection, this paper proposes an improved HRNet-cloth key point detection model. By introducing full-dimensional dynamic convolution (ODConv) in the HRNet backbone network, we enhance the feature adaptation ability of the network to nonlinear deformation such as garment folds and draping, and effectively reduce the key point coordinate offset error; we design the EMA cross-dimensional attention mechanism module, fusing the channel and spatial dimensional feature responses to improve the localization robustness of the neckline, sleeve holes, and other detail regions; for the sub-pixel level regression requirements, we Construct an adaptive focus loss function to optimize the heat map peak distribution by dynamically adjusting the weights of difficult samples. Experiments show that the PR of HRNet-cloth on the self-built dataset ClothData reaches 100%, which is 11.6% higher than that of the benchmark model, and the absolute measurement error (AKE) of the dimensions is stabilized within ±1cm.
References
[1]Sun K, Xiao B, Liu D, et al. Deep high-resolution representation learning for human pose estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 5693-5703.
[2]Zhang X, Li H, Mo J, et al. Lightweight Human Pose Estimation with Hierarchical Feature Refinement[J]. IEEE Transactions on Image Processing, 2021, 30: 1234-1245.
[3]Wang C, Wang Y, Lin Z, et al. Deformable HRNet: A Deformable Convolution Enhanced Network for Human Pose Estimation[J]. IEEE Transactions on Multimedia, 2021, 23: 123-135.
[4]Li Y, Chen J, Zhang Z, et al. Mixed Attention Mechanism for Occluded Human Pose Estimation[C]//European Conference on Computer Vision. Springer, 2020: 456-472.
[5]Liu H, Fan Z, Wang T, et al. Dynamic Feature Selection for Dense Keypoint Detection[C]//AAAI Conference on Artificial Intelligence. 2022: 2145-2153.
[6]Zhao L, Li S, Wang Q, et al. Cascaded Refinement Network for High-Resolution Heatmap Regression[J]. International Journal of Computer Vision, 2022, 130(5): 1327-1345.
[7]Li X, Wang W, Hu X, et al. Dynamic Convolution: Attention over Convolution Kernels[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 11030-11039.
[8]Zhang Y, Li Q, Zhou B, et al. EMANet: Enhanced Multi-scale Attention for Keypoint Detection[C]//European Conference on Computer Vision. Springer, 2022: 678-694.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Zhuohui Li, Yanfang Fu

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







