AI Color Organ: Piano Music Visualization using Onset Detection and HistoGAN

Authors

  • Shu Xu

DOI:

https://doi.org/10.54097/hset.v39i.6539

Keywords:

Onset Detection; ReHistoGAN; Music Visualization; Color Organ.

Abstract

The music visualization algorithm described in this study allows users to construct piano audio files using imported image files. This paper contributes to previous studies and designs of sonification by highlighting the effectiveness of utilizing onset detection in creating intuitive sonic changes. The audio-visual correspondences employed in this study could be expanded to many other syntheses and sample manipulation techniques. Translating visual information into sonic changes could yield many creative applications in music production, as it offers musicians a simultaneously optical and auditory production experience. This approach to audio manipulation also increases the unpredictability of the sound output, which could be appealing to experimental musicians seeking to control sounds with the visual structure of artworks that they enjoy, as opposed to precise parameters. It is looking forward to seeing creative implementations of the techniques in audio-visual artworks, music production tools, and interactive multimedia systems. These results shed light on guiding further exploration of AI composing.

Downloads

Download data is not yet available.

References

Rekveld, Joost. The Origin of The Term Visual Music – Light Matters. 16 Mar. 2013, Retrieved from: http://www.joostrekveld.net/?p=1105.

Reykjavik Center for Visual Music. About RCVM. Retrieved from: http://www.rcvm.is/. Accessed 7 Aug. 2022.

Marks Lawrence. The Unity of the Senses: Interrelations Among the Modalities. 1978.

McDonnell Maura. Visual Music. 2007.

Alafriz, Mikael, editor. “Introducing ‘Lucid Sonic Dreams’: Sync GAN Art To Music With A Few Lines Of Python Code!” Medium , 14 Mar. 2021, Retrieved from: https://towardsdatascience.com/introducing-lucid-sonic-dreams-sync-gan-art-to-music-with-a-few-lines-of-python-code-b04f88722de1.

Batty J, Horn K, Greuter S. Audiovisual granular synthesis: micro relationships between sound and image. Proceedings of the 9th Australasian Conference on Interactive Entertainment: Matters of Life and Death. 2013: 1-7.

McFee B, Raffel C, Liang D, et al. librosa: Audio and music signal analysis in python. Proceedings of the 14th python in science conference. 2015, 8: 18-25.

Afifi M, Brubaker M A, Brown M S. Histogan: Controlling colors of gan-generated and real images via color histograms. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 7941-7950.

Dannenberg R B. Interactive visual music: a personal perspective. Computer Music Journal, 2005, 29(4): 25-35.

Thaut M. Rhythm, music, and the brain: Scientific foundations and clinical applications. Routledge, 2013.

Weihs C, Jannach D, Vatolkin I, et al. Music data analysis: Foundations and applications. Chapman and Hall/CRC, 2016.

Downloads

Published

01-04-2023

How to Cite

Xu, S. (2023). AI Color Organ: Piano Music Visualization using Onset Detection and HistoGAN. Highlights in Science, Engineering and Technology, 39, 274-279. https://doi.org/10.54097/hset.v39i.6539