A Survey on Image Style Transfer Based on Deep Learning
DOI:
https://doi.org/10.54097/mxgtcj89Keywords:
Image Style Transfer, Deep Learning, Neural Networks, Model Algorithms, Computer VisionAbstract
With the rapid development of deep learning technology, style transfer has achieved significant breakthroughs at both algorithmic and application levels, greatly promoting innovative interactions between content and style. This paper comprehensively reviews the fundamental concepts, classifications, and widespread applications of style transfer within neural networks, with a particular focus on the principles, diverse variants, and synthesis techniques of neural network-based style transfer. Additionally, the article explores image style transfer as a research hotspot in the field of artificial intelligence and computer vision, highlighting that traditional methods primarily rely on physical and texture synthesis techniques, which often result in coarse outcomes and poor robustness. Through comparative analysis, this paper outlines the development trajectory and latest research advances in image style transfer and proposes future research directions.
References
[1] Tang Renwei, Liu Qi, and Tan Hao. "A Review of Neural Style Transfer Models." Journal of Computer Engineering & Applications, 57.19 (2021).
[2] Li Benjia. "A Review of the Development of Non-Photorealistic Rendering Techniques." Computer Knowledge and Technology, 2018, 14(35): 188-190.
DOI:10.14004/j.cnki.ckt.2018.4166.
[3] Efros, AA,Freeman, WT.Image quilting for texture synthesis and transfer[C].//COMPUTER GRAPHICS.2001:341-346.
[4] Hertzmann, Aaron, et al. "Image analogies." Seminal Graphics Papers: Pushing the Boundaries, Volume 2. 2023. 557-570.
[5] Gatys, Leon A.Ecker, Alexander S.Bethge, Matthias.Image Style Transfer Using Convolutional Neural Networks[C]. // IEEE Conference on Computer Vision and Pattern Recognition. 2016: 2414-2423.
[6] Liu Jiaxiong, Zhou Jun. "A Review of Style Transfer Methods Based on Deep Learning." Science and Engineering of Artificial Intelligence, 2024, (01): 1-19.
[7] Goodfellow, Ian, et al. "Generative adversarial nets." Advances in neural information processing systems 27 (2014).
[8] Zhu, Jun-Yan, et al. "Unpaired image-to-image translation using cycle-consistent adversarial networks." Proceedings of the IEEE international conference on computer vision. 2017.
[9] Ho, Jonathan, Ajay Jain, and Pieter Abbeel. "Denoising diffusion probabilistic models." Advances in neural information processing systems 33 (2020): 6840-6851.
[10] Dhariwal, Prafulla, and Alexander Nichol. "Diffusion models beat gans on image synthesis." Advances in neural information processing systems 34 (2021): 8780-8794.
[11] Lin, T. Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., & Zitnick, C. L. (2014). Microsoft COCO: Common objects in context. European Conference on Computer Vision (ECCV), 740-755.
[12] Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009). ImageNet: A large-scale hierarchical image database. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 248-255.
[13] Saleh, Babak, and Ahmed Elgammal. "Large-scale classification of fine-art paintings: Learning the right metric on the right feature." arXiv preprint arXiv:1505.00855 (2015).
[14] Nilsback, Maria-Elena, and Andrew Zisserman. "Automated flower classification over a large number of classes." 2008 Sixth Indian conference on computer vision, graphics & image processing. IEEE, 2008.
[15] Cordts, Marius, et al. "The cityscapes dataset for semantic urban scene understanding." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
[16] Isola, Phillip, et al. "Image-to-image translation with conditional adversarial networks." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.
[17] Zhu, Jun-Yan, et al. "Unpaired image-to-image translation using cycle-consistent adversarial networks." Proceedings of the IEEE international conference on computer vision. 2017.
[18] Park, Taesung, et al. "Contrastive learning for unpaired image-to-image translation." Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part IX 16. Springer International Publishing, 2020.
[19] Nichol, Alexander Quinn, and Prafulla Dhariwal. "Improved denoising diffusion probabilistic models." International conference on machine learning. PMLR, 2021.
[20] Saharia, Chitwan, et al. "Palette: Image-to-image diffusion models." ACM SIGGRAPH 2022 conference proceedings. 2022.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.