Learning the Beauty in Songs: Neural Singing Voice Beautifier

We introduce Neural Singing Voice Beautifier (NSVB), the first generative model to solve the SVB task, which adopts a conditional variational autoencoder as the backbone and learns the latent representations of vocal tone. In NSVB, we propose a novel time-warping approach for pitch correction: Shape-Aware Dynamic Time Warping (SADTW), which ameliorates the robustness of existing time-warping approaches, to synchronize the amateur recording with the template pitch curve. Furthermore, we propose a latent-mapping algorithm in the latent space to convert the amateur vocal tone to the professional one. 2022: Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao https://arxiv.org/pdf/2202.13277v2.pdf

Comments (0)

To leave or reply to comments, please download free Podbean or

No Comments