Research Advanced in Image Generation Based on Diffusion Probability Model

Authors

  • Yuhan Huang

DOI:

https://doi.org/10.54097/waybgz41

Keywords:

Image generation; Diffusion probability model; Deep learning; Application

Abstract

Image generation has been a popular research task in the computer vision community, which aims to learn a distribution from a specific dataset and generate realistic images obeying this distribution. Thanks to the rapid development of deep learning technology, image generation models based on convolutional neural networks, especially generative adversarial networks (GANs) and variational autoencoders (VAEs), have become mainstream frameworks for image generation. However, in recent years, with the gradual deepening of the research on the denoising diffusion probability model (DDPM), the image generation technology based on DDPM has made new breakthroughs in accuracy and speed. Around the Diffusion Model, this paper introduces its latest research progress in image generation and derivative tasks. Specifically, this paper reviews the key techniques and basic theories of the diffusion model in detail. Then, the main research work, improvement mechanism and characteristics of the DDPM-based image generation method are summarized and summarized. This paper focuses on the basic structure and related applications of diffusion models, and evaluates some basic functions. Finally, the current problems and future development directions of image generation technology based on diffusion model are analyzed and summarized.

Downloads

Download data is not yet available.

References

Yang K, Ding X, Yuan X. Bayesian empirical likelihood inference and order shrinkage for autoregressive models [J]. Statistical Papers, 2022: 1-25.

Hou Y, Zhai J, Chen J. Coupled adversarial variational autoencoder [J]. Signal Processing: Image Communication, 2021, 98: 116396.

Xia W, Zhang Y, Yang Y, et al. Gan inversion: A survey [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(3): 3121-3138.

Wanle C, Choo Y H, Goh O S. Review of Generative Adversarial Networks in Image Generation [J]. Journal of Advanced Computational Intelligence and Intelligent Informatics, 2022, 26(1): 3-7.

Xu C, Lang W, Xin R, et al. Generative detect for occlusion object based on occlusion generation and feature completing [J]. Journal of Visual Communication and Image Representation, 2021, 78: 103189.

Aggarwal A, Mittal M, Battineni G. Generative adversarial network: An overview of theory and applications [J]. International Journal of Information Management Data Insights, 2021, 1(1): 100004.

Dhariwal P, Nichol A. Diffusion models beat gans on image synthesis [J]. Advances in neural information processing systems, 2021, 34: 8780-8794.

Nichol A Q, Dhariwal P. Improved denoising diffusion probabilistic models [C]//International Conference on Machine Learning. PMLR, 2021: 8162-8171.

Du C, Zhu J, Zhang B. Learning deep generative models with doubly stochastic gradient MCMC [J]. IEEE transactions on neural networks and learning systems, 2017, 29(7): 3084-3096.

Vega-Márquez B, Rubio-Escudero C, Nepomuceno-Chamorro I. Generation of synthetic data with conditional generative adversarial networks [J]. Logic Journal of the IGPL, 2022, 30(2): 252-262.

Khan Z N, Ahmad J. Attention induced multi-head convolutional neural network for human activity recognition [J]. Applied soft computing, 2021, 110: 107671.

Rajee M V, Mythili C. Gender classification on digital dental X-ray images using deep convolutional neural network [J]. Biomedical Signal Processing and Control, 2021, 69: 102939.

Downloads

Published

13-03-2024

How to Cite

Huang, Y. (2024). Research Advanced in Image Generation Based on Diffusion Probability Model. Highlights in Science, Engineering and Technology, 85, 452-456. https://doi.org/10.54097/waybgz41