HarmonyRFG: A Rule-Guided Spiking Transformer Framework for Real-Time Chord Progression Generation

Zimo Dong

doi:10.54097/ktcjx318

Authors

Zimo Dong

DOI:

https://doi.org/10.54097/ktcjx318

Keywords:

Music generation; chord progression; Spiking Neural Network; Transformer; Markov chain; generative art; interactive music.

Abstract

This paper presents HarmonyRFG, a rule-guided chord generation framework that connects established harmonic practice with deep-learning-based sequence modeling. The framework couples the long-range representation capacity of the Transformer with the event-driven temporal modeling of Spiking Neural Networks, so that chord identity, harmonic function, and duration can be learned as interdependent musical variables rather than isolated symbols. Markov transition constraints and harmony-based scoring are further introduced to regulate local chord movement while retaining generative flexibility. The proposed approach therefore supports chord sequences that are musically coherent, responsive to real-time control, and suitable for creative contexts such as interactive installations, ambient composition, human-computer co-creation, and therapeutic sound environments.

Downloads

Download data is not yet available.

References

[1] Cideron,Geoffrey et al. “MusicRL: Aligning Music Generation to Human Preferences.” ArXiv abs/2402.04229 (2024).

[2] Gaetan Hadjeres, Francois Pachet, Frank Nielsen, “DeepBach: a Steerable Model for Bach Chorales Generation,” ICML'17: Proceedings of the 34th International Conference on Machine Learning, Volume 70, pp. 1362—1371 (2017).

[3] Mozer, M. C., “Neural network music composition by prediction: Exploring the benefits of psychophysical constraints and multiscale processing,” Connection Sci., No. 6, pp. 247--280 (1994).

[4] Eck, D., J. Schmidhuber. “Learning the long-term structure of the blues,” Proc.2002 Internat. Conf. Artificial Neural Networks ICANN, 284–289 (2002).

[5] N Boulanger-Lewandowski, Y Bengio, P Vincent,“Modeling temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription”, ICML'12: Proceedings of the 29th International Coference on International Conference on Machine Learning, pp. 1881–1888(2012)

[6] YS Huang, YH Yang, “Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions,” MM '20: Proceedings of the 28th ACM International Conference on Multimedia, pp. 1180–1188(2020).

[7] Man Yao, Xuerui Qiu, Tianxiang Hu, Jiakui Hu, Yuhong Chou, Keyu Tian, “Scaling Spike-Driven Transformer With Efficient Spike Firing Approximation Training”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 47, Issue 4, pp.2973–2990(2025).

[8] John Ashley Burgoyne, Jonathan Wild, and Ichiro Fujinaga, “An Expert Ground Truth Set for Audio Chord Recognition and Music Analysis”, Proceedings of the 12th International Society for Music Information Retrieval Conference, ed., pp. 633–38(2011).

[9] Christopher A. Harte, Mark B. Sandler, Samer A. Abdallah, and Emilia Gómez, “Symbolic Representation of Musical Chords: A Proposed Syntax for Text Annotations”, Proceedings of the 6th International Conference on Music Information Retrieval, ed., pp. 66–71(2005).

[10] Peter Simon Sapaty. “Towards Wholeness and Integrity of Distributed Dynamic Systems.” Journal of Computer Science & Systems Biology, 9：3(2016).

[11] McCormack Jon, and Alan Dorin. "Artistic practice as research in generative art." Leonardo 38, No. 2. pp.101-109 (2005).

[12] Galanter Philip. "What is Generative Art? Complexity theory as a context for art theory." Generative art 1, No. 1 (2003).

[13] Miranda Eduardo Reck. "On computational models of musical creativity." Contemporary Music Review 22, No. 4. pp. 25-47 (2003).

[14] Stefan Koelsch, “Towards a neural basis of music-evoked emotions，” Trends in Cognitive Sciences Volume 14, Issue 3, pp. 131-137(2010).

[15] Bérigny C.D., et al. “EEG and Sonic Platforms to Enhance Mindfulness Meditation.” Journal of Arts and Humanities，No.5, pp.1-12(2016).

[16] Barbara Maria Stafford, “From Visual Culture to Sensory Culture.” Leonardo, Vol. 35, No. 4, pp. 401–04(2002).

[17] Bro et al.,Musical Breaks, “Live Music in a Hemodialysis Setting--A Qualitative Study on Patient.” Nurse, and Musician Perspectives. Healthcare, 10.(2022).

[18] Barry Blesser,Linda-Ruth Salter, Spaces Speak, Are You Listening? (Cambridge: MIT Press, 2009). p 127-163

[19] Wang, et al., “Real-time Emotion-based Music Arrangement with Soft Transition.” IEEE Transactions on Affective Computing. (2023)

HarmonyRFG: A Rule-Guided Spiking Transformer Framework for Real-Time Chord Progression Generation

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Cover

CNKI Indexing

Latest publications