Controllable singing voice enables the prosody modification of existing singing voice based on a given modified musical score.
We provide several samples for each singing type, and each singing type is transformed into other types by changing the conditions of the score.
Each item contains Original Audio and Original Score. +x pitches denotes that the Original Score is increased or decreased by x as the conditional score, which is input into the system together with the Original Audio, and finally the modified audio is obtained.
In this experiment we use 6 pitches as the transition condition for adjacent types (pitch can be freely modified). At the same time, we provide +0 pitches for each item as a comparison for other transformations.