Human Interface Lab - 휴먼인터페이스 연구실

Welcome to
Human Interface Laboratory

Established in 1998 and directed by Prof. N.S.Kim, the Human Interface Laboratory conducts research on speech and audio signal processing. Current ongoing research topics include speech recognition, speech synthesis, speech enhancement, realistic acoustics, acoustic event detection, audio source seperation, and audio source localization with applications from machine learning. Not only focusing on speech and audio, this lab also covers the research in the field of Large Language Models.

About Us

서울대학교 휴먼인터페이스 연구실은 1998년 설립되어 2007년 국가 지정 연구실로 선정되었습니다. 음성 신호 처리 및 그 응용에 대한 연구를 수행하고 있으며 대표적으로 음성 인식, 합성, 화자 인식, 향상 등의 분야에서 활발하게 연구하고 있습니다.

교수 소개

연구 분야

최근 논문

ASVspoof 5: Design, collection and validation of resources for spoofing, deepfake, and adversarial attack detection using crowdsourced speech

Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md Sahidullah, Tomi Kinnunen, Nicholas Evans, Kong Aik Lee, Junichi Yamagishi, Myeonghun Jeong, Ge Zhu, Yongyi Zang, You Zhang, Soumi Maiti, Florian Lux, Nicolas Müller, Wangyou Zhang, Chengzhe Sun, Shuwei Hou, Siwei Lyu, Sébastien Le Maguer, Cheng Gong, Hanjie Guo, Liping Chen, Vishwanath Singh

Computer Speech & Language, Volume 95

2026

Generalized Score Comparison-Based Learning Objective for Deep Speaker Embedding

M. H. Han, S. H. Mun and N. S. Kim

IEEE Access, vol. 13, pp. 51194-51207

2025

SNR-Aligned Consistent Diffusion for Adaptive Speech Enhancement

Y. Jun, B. Woo, M. Jeong, N. Kim

Proc. Interspeech

Aug. 2025

Evidential-TTS: High Fidelity Zero-Shot Text-to-Speech Using Evidential Deep Learning

M. Jeong, M. Kim, S. Kim and N. S. Kim

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

2025

공지사항

어디서 들어본 목소린데...보이스피싱범 잡는 AI / 연합뉴스TV

2024-07-12
홈페이지 리뉴얼

2024-07-11

ASVspoof 5: Design, collection and validation of resources for spoofing, deepfake, and adversarial attack detection using crowdsourced speech

Generalized Score Comparison-Based Learning Objective for Deep Speaker Embedding

SNR-Aligned Consistent Diffusion for Adaptive Speech Enhancement

Evidential-TTS: High Fidelity Zero-Shot Text-to-Speech Using Evidential Deep Learning

Human Interface Laboratory / 휴먼인터페이스 연구실