Deep Voice 3: Scaling text-to-speech with convolutional sequence learning W Ping, K Peng, A Gibiansky, SO Arik, A Kannan, S Narang, J Raiman, ... ICLR 2018, 2017 | 675* | 2017 |
Deep Voice 2: Multi-speaker neural text-to-speech S Arik, G Diamos, A Gibiansky, J Miller, K Peng, W Ping, J Raiman, ... NeurIPS 2017, 2017 | 500* | 2017 |
ClariNet: Parallel wave generation in end-to-end text-to-speech W Ping, K Peng, J Chen ICLR 2019, 2018 | 315 | 2018 |
Neural voice cloning with a few samples SO Arik, J Chen, K Peng, W Ping, Y Zhou NeurIPS 2018, 2018 | 305 | 2018 |
DiffWave: A versatile diffusion model for audio synthesis Z Kong, W Ping, J Huang, K Zhao, B Catanzaro ICLR 2021, 2020 | 271 | 2020 |
Non-autoregressive neural text-to-speech K Peng, W Ping, Z Song, K Zhao ICML 2020, 2019 | 112* | 2019 |
Cancer metastasis detection with neural conditional random field Y Li, W Ping Medical Imaging with Deep Learning, 2018 | 96 | 2018 |
WaveFlow: A compact flow-based model for raw audio W Ping, K Peng, K Zhao, Z Song ICML 2020, 2019 | 82 | 2019 |
Topic compositional neural language model W Wang, Z Gan, W Wang, D Shen, J Huang, W Ping, S Satheesh, L Carin AISTATS 2018, 2017 | 79 | 2017 |
On fast sampling of diffusion probabilistic models Z Kong, W Ping arXiv preprint arXiv:2106.00132, 2021 | 56 | 2021 |
End-to-end training of neural retrievers for open-domain question answering DS Sachan, M Patwary, M Shoeybi, N Kant, W Ping, WL Hamilton, ... ACL 2021, 2021 | 48 | 2021 |
Long-short transformer: Efficient transformers for language and vision C Zhu, W Ping, C Xiao, M Shoeybi, T Goldstein, A Anandkumar, ... NeurIPS 2021, 2021 | 45 | 2021 |
Million-scale near-duplicate video retrieval system Y Cai, L Yang, W Ping, F Wang, T Mei, XS Hua, S Li ACM Multimedia 2011, 2011 | 39 | 2011 |
Marginal structured SVM with hidden variables W Ping, Q Liu, A Ihler ICML 2014, 2014 | 33 | 2014 |
One TTS alignment to rule them all R Badlani, A Łancucki, KJ Shih, R Valle, W Ping, B Catanzaro ICASSP 2022, 2021 | 29 | 2021 |
Multi-instance metric learning Y Xu, W Ping, AT Campbell ICDM 2011, 2011 | 29 | 2011 |
Large margin neural language model J Huang, Y Li, W Ping, L Huang EMNLP 2018, 2018 | 25 | 2018 |
Decomposition bounds for marginal MAP W Ping, Q Liu, A Ihler NIPS 2015, 2015 | 25 | 2015 |
Systems and methods for neural voice cloning with a few samples C Jitong, P Kainan, P Wei, Z Yanqi US Patent 11,238,843, 2022 | 24 | 2022 |
Non-IID multi-instance dimensionality reduction by learning a maximum bag margin subspace W Ping, Y Xu, K Ren, CH Chi, S Furao AAAI 2010, 2010 | 23 | 2010 |