Skip to content

SOTA quality fast music transformer with symmetrical quad MIDI notes encoding

License

Notifications You must be signed in to change notification settings

asigalov61/Quad-Music-Transformer

Repository files navigation

Quad Music Transformer

SOTA quality fast music transformer with symmetrical quad MIDI notes encoding

Quad-Music-Transformer-Artwork (7)


Original Version

Open In Colab

Features demonstration


Composer Version

Open In Colab

MuseNet-style workflow for endless supervised continuation generation


Bulk Generator Version

Open In Colab

Bulk improvs and continuations generation


Technical notes

SOTA quality was achieved by using the following specific techniques:

1) Quality source MIDI dataset (quality over quantity)

2) MIDI dataset augmentation by time (x2) and pitches (x3)

3) Timings normalization, quantization and compression (128)

4) Larger model embed size (2048) with less layers (16) and heads (16)

5) Training longer since the MIDI dataset is small (2 full epochs)

6) Using MIDI instruments families (16) instead of full MIDI instruments range (128)

7) Using symmetrical quad MIDI notes encoding

8) 8k sequence length so that the model can learn long-term music scructure

9) Using fp16 precision so that the model is sufficiently fast with low memory footprint

10) Hex (16) MIDI velocity range to avoid velocity overfitting while preserving velocity details

11) Chords sorting by instruments families (L-H) and by pitch (H-L)


Project Los Angeles

Tegridy Code 2024