International Journals

ASVspoof 5: Design, collection and validation of resources for spoofing, deepfake, and adversarial attack detection using crowdsourced speech

Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md Sahidullah, Tomi Kinnunen, Nicholas Evans, Kong Aik Lee, Junichi Yamagishi, Myeonghun Jeong, Ge Zhu, Yongyi Zang, You Zhang, Soumi Maiti, Florian Lux, Nicolas Müller, Wangyou Zhang, Chengzhe Sun, Shuwei Hou, Siwei Lyu, Sébastien Le Maguer, Cheng Gong, Hanjie Guo, Liping Chen, Vishwanath Singh

Computer Speech & Language, Volume 95

2026

Generalized Score Comparison-Based Learning Objective for Deep Speaker Embedding

M. H. Han, S. H. Mun and N. S. Kim

IEEE Access, vol. 13, pp. 51194-51207

2025

Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction

M. Kim, M. Jeong, B. J. Choi, S. Kim, J. Y. Lee and N. S. Kim

IEEE Transactions on Audio, Speech and Language Processing, vol. 33, pp. 1922-1932

2025

SegINR: Segment-Wise Implicit Neural Representation for Sequence Alignment in Neural Text-to-Speech

M. Kim, M. Jeong, J. Y. Lee, and N. S. Kim

IEEE Signal Processing Letters, Vol. 32, pp. 646-650

2025

Sampling-Based Pruned Knowledge Distillation for Training Lightweight RNN-T

S. Kim, D. Lee, J. Y. Kang, M. Jeong, and N. S. Kim

IEEE Signal Processing Letters, Vol. 32, pp. 631-635

2025

Towards Maximum Likelihood Training for Transducer-Based Streaming Speech Recognition

H. Lee, J. W. yoon, S. Kim, and N. S. Kim

IEEE Signal Processing Letters, Vol. 32, pp. 26-30

2025

HILCodec: High-Fidelity and Lightweight Neural Audio Codec

S. Ahn, B. J. Woo, M. H. Han, C. Moon, and N. S. Kim

IEEE Journal of Selected Topics in Signal Processing, vol. 18, no. 8, pp. 1517-1530

2024

Novel Deep Learning-Based Vocal Biomarkers for Stress Detection in Koreans

J. Namkung, S. M. Kim, W. I. Cho, S. Y. Yoo, B. Min, S. Y. Lee, J.-H. Lee, H. Park, S. Baik, J.-Y. Yun, N. S. Kim, and J.-H. Kim

Psychiatry Investigation

2024

Cons-KD: Dropout-Robust Knowledge Distillation for CTC-Based Automatic Speech Recognition

J. W. Yoon, H. Lee, J. Y. Kang, and N. S. Kim

IEEE Access, vol. 12, pp. 131136-131146

2024

Efficient Parallel Audio Generation Using Group Masked Language Modeling

M. Jeong, M. Kim, J. Y. Lee, and N. S. Kim

IEEE Signal Processing Letters, vol. 31, pp 979-983

2024