📝 Publications
Selected Papers
ICLR

MegaTTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis
Ziyue Jiang, Jinglin Liu*, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun MA, Zhou Zhao; & ZJU
- Brief Introduction: Large Text-to-Speech Model.
AAAI

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Zhou Zhao
Project | |
|
- Brief Introduction: This paper contains the first acoustic models based on diffusion, including DiffSinger (SVS) and DiffSpeech (TTS). It realizes the high-quality speech/singing synthesis.
- DiffSinger & DiffSpeech have recieved
, and the downloads number of pre-trained model is
.
- Many video demos created by Bilibili creators are released. And Diffsinger is introduced by a very popular video
!
- This work is included by many famous speech/music synthesis open-source projects, such as ESPNet
, PaddlePaddle/Parakeet
, muzic
.
NeurIPS

PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Yi Ren*, Jinglin Liu*, Zhou Zhao
Project | |
- The source codes of this paper are released together with the codes of DiffSpeech. This repository has received
; It is shown on the Github Daily Trending List.
ACL

Learning the Beauty in Songs: Neural Singing Voice Beautifier
Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao
Project |
Recent Papers
- Ziyue Jiang*, Jinglin Liu*, Yi Ren*, Jinzheng He*, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao, Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis, ICLR 2024 (* equal contributions)
- Jinglin Liu*, Zhenhui Ye*, Qian Chen, Siqi Zheng, Wen Wang, Qinglin Zhang, Zhou Zhao, DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect, ACL 2023. (* equal contributions)
- Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao, RMSSinger: Realistic-Music-Score based Singing Voice Synthesis, ACL 2023
- Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao, Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models, ICML 2023
- Rongjie Huang*, Jinglin Liu*, Huadai Liu*, Yi Ren, Lichao Zhang, Jinzheng He, Zhou Zhao, TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation, ICLR 2023. (* equal contributions)
- Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao, GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis, ICLR 2023
- Ziyue Jiang, Zhe Su, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye, Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech, NeurIPS 2022
- Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao, GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech, NeurIPS 2022
- Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao, Learning the Beauty in Songs: Neural Singing Voice Beautifier, ACL 2022
- Jinglin Liu*, Chengxi Li*, Yi Ren*, Feiyang Chen, Zhou Zhao, DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism, AAAI 2022. (* equal contributions)
- Yi Ren*, Jinglin Liu*, Zhou Zhao, PortaSpeech: Portable and High-Quality Generative Text-to-Speech, NeurIPS 2021. (* equal contributions)
- Jinglin Liu, Zhiying Zhu, Yi Ren, Wencan Huang, Baoxing Huai, Nicholas Jing Yuan, Zhou Zhao, Parallel and High-Fidelity Text-to-Lip Generation, AAAI 2022
- Jinglin Liu, Yi Ren, Zhou Zhao, Chen Zhang, Baoxing Huai, Jing Yuan, FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire, ACM-MM 2020
- Jinglin Liu, Yi Ren, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao and Tie-Yan Liu, Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation, IJCAI 2020
- Yi Ren*, Jinglin Liu*, Xu Tan, Chen Zhang, Qin Tao, Zhou Zhao, Tie-Yan Liu, SimulSpeech: End-to-End Simultaneous Speech to Text Translation, ACL 2020. (* equal contributions)
- Yi Ren*, Jinglin Liu*, Xu Tan, Zhou Zhao, Sheng Zhao, Tie-Yan Liu, A Study of Non-autoregressive Model for Sequence Generation, ACL 2020. (* equal contributions)
My full paper list is shown at my scholar homepage.