📝 Publications

🎙 Machine Hearing

Measurement 2025
sym

Coal gangue recognition in the strong background noise using two-level auditory feature fusion with attention mechanism
Yang Z, Wang SB, Yang SG, Liu SY, Zhang ZP, Liu HG*

Project

  • An auditory model is introduced into the coal gangue recognition task.
  • Two-level auditory features were extracted, which better express the information.
  • A Fusion recognition model based on attention mechanism was constructed. </div> </div>
ICLR 2021
sym

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu

Project

ICLR 2024
sym

Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis \ Ziyue Jiang, Jinglin Liu, Yi Ren, et al.

Project

  • This work has been deployed on many TikTok products.
  • Advandced zero-shot voice cloning model.
AAAI 2022
sym

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Zhou Zhao

NeurIPS 2021
sym

🎼 Active Middle-ear Implant

ICLR 2024
sym

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis, Zhenhui Ye, Tianyun Zhong, Yi Ren, et al. (Spotlight) Project | Code

📚 NVH of Vehicle

Others