Introduction:

  • Controllable singing voice enables the prosody modification of existing singing voice based on a given modified musical score.
  • We provide several samples for each singing type, and each singing type is transformed into other types by changing the conditions of the score.
  • Each item contains Original Audio and Original Score. +x pitches denotes that the Original Score is increased or decreased by x as the conditional score, which is input into the system together with the Original Audio, and finally the modified audio is obtained.
  • In this experiment we use 6 pitches as the transition condition for adjacent types (pitch can be freely modified). At the same time, we provide +0 pitches for each item as a comparison for other transformations.


  • Audio Samples

    Alto

    1. nuan nuan jiou zai xiong tang uo xiang shuo qi shi ni hen hao

    2. Original Audio (voc.) Original Score To Alto (+0 pitches)
      wav
      To Soprano (+6 pitches) To Tenor (-6 pitches) To Bass (-12 pitches)
      wav

    3. zai bei jia er hu pan duo shao nian i hou

    4. Original Audio (voc.) Original Score To Alto (+0 pitches)
      wav
      To Soprano (+6 pitches) To Tenor (-6 pitches) To Bass (-12 pitches)
      wav

    Soprano

    1. i zuo zuo shan chuan

    2. Original Audio (voc.) Original Score To Soprano (+0 pitches)
      wav
      To Alto (-6 pitches) To Tenor (-12 pitches) To Bass (-18 pitches)
      wav

    3. hua luo zhi duo shao

    4. Original Audio (voc.) Original Score To Soprano (+0 pitches)
      wav
      To Alto (-6 pitches) To Tenor (-12 pitches) To Bass (-18 pitches)
      wav

    Tenor

    1. zhi shi u chu ting bai jiou ge chang ba

    2. Original Audio (voc.) Original Score To Tenor (+0 pitches)
      wav
      To Alto (+6 pitches) To Bass (-6 pitches) To Soprano (+12 pitches)
      wav

    3. zhi shi na zhong uen rou zai ie zhao bu dao iong bao de li iou

    4. Original Audio (voc.) Original Score To Tenor (+0 pitches)
      wav
      To Alto (+6 pitches) To Bass (-6 pitches) To Soprano (+12 pitches)
      wav

    Bass

    1. ting hai ku de sheng in

    2. Original Audio (voc.) Original Score To Bass (+0 pitches)
      wav
      To Tenor (+6 pitches) To Alto (+12 pitches) To Soprano (+18 pitches)
      wav

    3. jiou dang zuei hou ve ding

    4. Original Audio (voc.) Original Score To Bass (+0 pitches)
      wav
      To Tenor (+6 pitches) To Alto (+12 pitches) To Soprano (+18 pitches)
      wav