The Investigation of Deep Learning Models Utilized in Vector Graphics Manipulation

Chen Yang

doi:10.54097/t057w194

Authors

Chen Yang

DOI:

https://doi.org/10.54097/t057w194

Keywords:

Computer Vision, Artificial Intelligence, Deep Learning, Vector Graphics, 3D Models.

Abstract

Vector graphics, 2D or 3D, hold paramount significance across various professional domains, including graphic design, web design, architecture, and engineering. However, traditional methods of creating vector graphics are characterized by low efficiency. This review explores the integration of some deep learning models designed for 2D and 3D vector graphics generation and manipulation, summarizing their main tasks and methods. In terms of 2D vector graphics, this review examines advanced models, including Convolutional Neural Networks, Generative Adversarial Networks, and more, for diverse tasks such as font or icon generation and image manipulation. For 3D vector graphics, this paper assesses the progress achieved in models tailored for point cloud and image reconstruction, as well as 3D shape generation, using approaches such as Variational Autoencoders, Multi-Layer Perceptrons, and Transformers. This review also assesses their progress and limitations, acknowledging a comprehensive overview of deep learning models in vector graphic manipulation, and emphasizing their potential impact on the design industry while recognizing the challenges ahead.

Downloads

Download data is not yet available.

References

Chapman, N.P., J. Chapman, and I. NetLibrary, Digital multimedia. Worldwide series in computer science. 2000, Chichester; New York: Wiley.

W3C. Scalable Vector Graphics (SVG) 1.1 (Second Edition). 2011; Available from: https: //www.w3.org/TR/SVG11/.

SARCAR, M.M.M., K. M. RAO, and K.L. NARAYAN, Computer Aided Design and Manufacturing. 2008: PHI Learning.

Song, Y., et al. CLIPVG: text-guided image manipulation using differentiable vector graphics. in Proceedings of the AAAI Conference on Artificial Intelligence. 2023.

Lee, J.R., L. Wang, and A. Wong, Emotion net nano: An efficient deep convolutional neural network design for real-time facial expression recognition. Frontiers in Artificial Intelligence, 2021. 3: p. 609673.

Zhang, M., et al. An end-to-end deep learning architecture for graph classification. in Proceedings of the AAAI conference on artificial intelligence. 2018.

LeCun, Y., et al., Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998. 86 (11): p. 2278 - 2324.

Kim, G., T. Kwon, and J.C. Ye. Diffusion clip: Text-guided diffusion models for robust image manipulation. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.

Campbell, N. D. F. and J. Kautz, Learning a manifold of fonts. ACM Trans. Graph., 2014. 33 (4): p. Article 91.

Cao, D., et al. SVGformer: Representation Learning for Continuous Vector Graphics using Transformers. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023.

Lopes, R.G., et al. A learned representation for scalable vector graphics. in Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019.

Reddy, P., et al., Im2Vec: Synthesizing Vector Graphics without Vector Supervision, in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2021, IEEE Computer Society. p. 7338 - 7347.

Wang, Y. and Z. Lian, DeepVecFont: Synthesizing high-quality vector fonts via dual-modality learning. ACM Transactions on Graphics (TOG), 2021. 40 (6): p. 1 - 15.

Carlier, A., et al., Deepsvg: A hierarchical generative network for vector graphics animation. Advances in Neural Information Processing Systems, 2020. 33: p. 16351 - 16361.

Liao, Y., S. Donne, and A. Geiger. Deep marching cubes: Learning explicit surface representations. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018.

Jones, R.K., et al., Shape assembly: Learning to generate programs for 3d shape structure synthesis. ACM Transactions on Graphics (TOG), 2020. 39 (6): p. 1 - 20.

Achlioptas, P., et al. Learning representations and generative models for 3d point clouds. in international conference on machine learning. 2018. PMLR.

Groueix, T., et al., Atlasnet: A papier-mâché approach to learning 3d surface generation. arxiv 2018. arXiv preprint arXiv:1802.05384, 1802.

Chen, Z. and H. Zhang. Learning implicit fields for generative shape modeling. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.

Wu, R., C. Xiao, and C. Zheng. Deepcad: A deep generative network for computer-aided design models. in Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021.

Lambourne, J.G., et al. Brepnet: A topological message passing system for solid models. in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021.

Girdhar, R., et al. Learning a predictable and generative vector representation for objects. in Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11 - 14, 2016, Proceedings, Part VI 14. 2016. Springer.

Hochreiter, S. and J. Schmid Huber, long short-term memory. Neural Comput, 1997. 9 (8): p. 1735 - 80.

Li, T.-M., et al., Differentiable vector graphics rasterization for editing and learning. ACM Transactions on Graphics (TOG), 2020. 39 (6): p. 1 - 15.

Lorensen, W.E. and H.E. Cline, Marching cubes: A high resolution 3D surface construction algorithm, in Seminal graphics: pioneering efforts that shaped the field. 1998. p. 347 - 353.

Vaswani, A., et al., Attention is all you need. Advances in neural information processing systems, 2017. 30.

Mo, K., et al. Partnet: A large-scale benchmark for fine-grained and hierarchical part-level 3d object understanding. in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019.