International Conference

MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance

S. Kim, M. Jeong, H. Lee, M. Kim, B. J. Choi, and N. S. Kim

Proc. Interspeech

Sep. 2024

High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model

J. Y. lee, M. Jeong, M. Kim, J. Lee, H. Cho, and N. S. Kim

Proc. Interspeech

Sep. 2024

Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction

M. Kim, M. Jeong, B. J. Choi, D. Lee, and N. S. Kim

IEEE Automatic Speech Recognition and Understanding(ASRU)

Dec. 2023

Towards single integrated spoofing-aware speaker verification embeddings

S. H. Mun*, H. Shim*, H. Tak*, X. Wang, X. Liu, M. Sahidullah, M. Jeong, M. H. Han, M. Todisco, K. A. Lee, J. Yamagishi, N. Evans, T. Kinnunen, N. S. Kim, and J. Jung

Proc. Interspeech

Aug. 2023

MCR-Data2vec 2.0: Improving Self-supervised Speech Pre-training via Model-level Consistency Regularization

J. W. Yoon, S. M. Kim, and N. S. Kim

Proc. Interspeech

Aug. 2023

EM-Network: Oracle Guided Self-distillation for Sequence Learning

J. W. Yoon, S. Ahn, H. Lee, M. Kim, S. M. Kim, and N. S. Kim

Proc. ICML

2023

Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison

M. H. Han, S. H. Mun, M. Kim, M. Jeong, S. H. Ahn, and N. S. Kim

IEEE International Conference Acoustic Speech Signal Processing(ICASSP)

June. 2023

Multi-resolution sequence Aggregation and Model-Agnostic framework for time-series forecasting

J. Lyu, J. Yang, J. Kim, W. Lim, W. Ahn, D. Kang, M. Kim, and N. S. Kim

IEEE International Conference Acoustic Speech Signal Processing(ICASSP)

June. 2023

Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification

S. H. Mun, J. Jung, M. H. Han, and N. S. Kim

Proc. IEEE SLT

2022

FULLY UNSUPERVISED TRAINING OF FEW-SHOT KEYWORD SPOTTING

D. Lee*, M. Kim*, S. H. Mun, M. H, Han, and N. S. Kim

Proc. IEEE SLT

2022