133 |
M. Kim, M. Jeong, B. J. Choi, D. Lee, and N. S. Kim, "Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction" IEEE Automatic Speech Recognition and Understanding(ASRU), Dec. 2023. |
DOWNLOAD
|
132 |
S. H. Mun*, H. Shim*, H. Tak*, X. Wang, X. Liu, M. Sahidullah, M. Jeong, M. H. Han, M. Todisco, K. A. Lee, J. Yamagishi, N. Evans, T. Kinnunen, N. S. Kim, and J. Jung, "Towards single integrated spoofing-aware speaker verification embeddings," in Proc. Interspeech, Aug. 2023. |
DOWNLOAD
|
131 |
J. W. Yoon, S. M. Kim, and N. S. Kim, "MCR-Data2vec 2.0: Improving Self-supervised Speech Pre-training via Model-level Consistency Regularization," in Proc. Interspeech, Aug. 2023. |
DOWNLOAD
|
130 |
J. W. Yoon, S. Ahn, H. Lee, M. Kim, S. M. Kim, and N. S. Kim, "EM-Network: Oracle Guided Self-distillation for Sequence Learning," in Proc. ICML, 2023. |
DOWNLOAD
|
129 |
M. H. Han, S. H. Mun, M. Kim, M. Jeong, S. H. Ahn, and N. S. Kim, "Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison," IEEE International Conference Acoustic Speech Signal Processing(ICASSP), June. 2023. |
DOWNLOAD
|
128 |
J. Lyu, J. Yang, J. Kim, W. Lim, W. Ahn, D. Kang, M. Kim, and N. S. Kim, "Multi-resolution sequence Aggregation and Model-Agnostic framework for time-series forecasting" IEEE International Conference Acoustic Speech Signal Processing(ICASSP), June. 2023. |
DOWNLOAD
|
127 |
S. H. Mun, J. Jung, M. H. Han, and N. S. Kim, "Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification," in Proc. IEEE SLT, 2022. |
DOWNLOAD
|
126 |
D. Lee*, M. Kim*, S. H. Mun, M. H, Han, and N. S. Kim, "FULLY UNSUPERVISED TRAINING OF FEW-SHOT KEYWORD SPOTTING," in Proc. IEEE SLT, 2022. |
DOWNLOAD
|
125 |
J. W. Yoon, B. J. Woo, S. Ahn, H. Lee, and N. S. Kim, "Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition," in Proc. IEEE SLT, 2022. |
DOWNLOAD
|
124 |
B.J. Choi, M. Jeong, M. Kim, S.H. Mun, and N.S. Kim, "Adversarial speaker-consistency learning using untranscribed speech data for zero-shot multi-speaker text-to-speech," in Proc. APSIPA ASC, 2022. |
DOWNLOAD
|