RAVE.MultiVox v0.5.0 Experimental
by Nao Tokui
This is a stereo timbre transfer model trained on the LibriSpeech dataset, which contains the voices of various English speakers. You can adjust the formant of the output voice sound with the second knob to make it sound more like a male or female voice. Input range: (80, 4000) Hz.