📝 Publications

Selected Papers

ICLR
sym

MegaTTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis
Ziyue Jiang, Jinglin Liu*, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun MA, Zhou Zhao; & ZJU

Project

  • Brief Introduction: Large Text-to-Speech Model.
AAAI
sym

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Zhou Zhao

Project | | Hugging Face | Hugging Face

  • Brief Introduction: This paper contains the first acoustic models based on diffusion, including DiffSinger (SVS) and DiffSpeech (TTS). It realizes the high-quality speech/singing synthesis.
  • DiffSinger & DiffSpeech have recieved GitHub Stars , and the downloads number of pre-trained model is downloads.
  • Many video demos created by Bilibili creators are released. And Diffsinger is introduced by a very popular video bilibili!
  • This work is included by many famous speech/music synthesis open-source projects, such as ESPNet , PaddlePaddle/Parakeet , muzic .
NeurIPS
sym

PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Yi Ren*, Jinglin Liu*, Zhou Zhao

Project | | Hugging Face

  • The source codes of this paper are released together with the codes of DiffSpeech. This repository has received Github Stars; It is shown on the Github Daily Trending List.
ACL
sym

Learning the Beauty in Songs: Neural Singing Voice Beautifier
Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao

Project | downloads

Recent Papers

My full paper list is shown at my scholar homepage.