Research on Quantitative Detection Algorithm Based on Hrnet

Zhuohui Li; Yanfang Fu

doi:10.54097/9wm2g323

Authors

Zhuohui Li
Yanfang Fu

DOI:

https://doi.org/10.54097/9wm2g323

Keywords:

Garment Size Detection, Key Point Detection, Deep Learning, Hrnet, Attention Mechanism.

Abstract

Aiming at the insufficient localization accuracy of traditional algorithms due to complex texture interference, diverse fabric deformations and sensitivity to small size errors in garment size detection, this paper proposes an improved HRNet-cloth key point detection model. By introducing full-dimensional dynamic convolution (ODConv) in the HRNet backbone network, we enhance the feature adaptation ability of the network to nonlinear deformation such as garment folds and draping, and effectively reduce the key point coordinate offset error; we design the EMA cross-dimensional attention mechanism module, fusing the channel and spatial dimensional feature responses to improve the localization robustness of the neckline, sleeve holes, and other detail regions; for the sub-pixel level regression requirements, we Construct an adaptive focus loss function to optimize the heat map peak distribution by dynamically adjusting the weights of difficult samples. Experiments show that the PR of HRNet-cloth on the self-built dataset ClothData reaches 100%, which is 11.6% higher than that of the benchmark model, and the absolute measurement error (AKE) of the dimensions is stabilized within ±1cm.

References

[1]Sun K, Xiao B, Liu D, et al. Deep high-resolution representation learning for human pose estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 5693-5703.

[2]Zhang X, Li H, Mo J, et al. Lightweight Human Pose Estimation with Hierarchical Feature Refinement[J]. IEEE Transactions on Image Processing, 2021, 30: 1234-1245.

[3]Wang C, Wang Y, Lin Z, et al. Deformable HRNet: A Deformable Convolution Enhanced Network for Human Pose Estimation[J]. IEEE Transactions on Multimedia, 2021, 23: 123-135.

[4]Li Y, Chen J, Zhang Z, et al. Mixed Attention Mechanism for Occluded Human Pose Estimation[C]//European Conference on Computer Vision. Springer, 2020: 456-472.

[5]Liu H, Fan Z, Wang T, et al. Dynamic Feature Selection for Dense Keypoint Detection[C]//AAAI Conference on Artificial Intelligence. 2022: 2145-2153.

[6]Zhao L, Li S, Wang Q, et al. Cascaded Refinement Network for High-Resolution Heatmap Regression[J]. International Journal of Computer Vision, 2022, 130(5): 1327-1345.

[7]Li X, Wang W, Hu X, et al. Dynamic Convolution: Attention over Convolution Kernels[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 11030-11039.

[8]Zhang Y, Li Q, Zhou B, et al. EMANet: Enhanced Multi-scale Attention for Keypoint Detection[C]//European Conference on Computer Vision. Springer, 2022: 678-694.

Research on Quantitative Detection Algorithm Based on Hrnet

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

Cover

Indexing & Abstracting

Keywords

Latest publications