Jaesung Huh
jaesunghuh.bsky.social
Jaesung Huh
@jaesunghuh.bsky.social
PhD student @VGG_Oxford; ex-intern @Meta Reality Labs; Audio-visual learning
📖 Advancing Active Speaker Detection for Egocentric Videos

This paper explores how to train robust Active Speaker Detection (ASD) model for egocentric videos.

Paper : ieeexplore.ieee.org/stamp/stamp....

Machine learning for multimodal data I (Poster)
Apr 11: 11:30 am - 1:00 pm
IEEE Xplore Full-Text PDF:
ieeexplore.ieee.org
April 1, 2025 at 6:19 AM
📖 The VoxCeleb Speaker Recognition Challenge: A Retrospective

This paper presents a review of the VoxCeleb Speaker Recognition Challenges (VoxSRC).

Paper : arxiv.org/abs/2408.14886

Speaker Recognition II (Poster)
Apr 9: 5:00 pm - 6:30 pm
The VoxCeleb Speaker Recognition Challenge: A Retrospective
The VoxCeleb Speaker Recognition Challenges (VoxSRC) were a series of challenges and workshops that ran annually from 2019 to 2023. The challenges primarily evaluated the tasks of speaker recognition ...
arxiv.org
April 1, 2025 at 6:19 AM
VoxConverse has become one of the most widely-used speaker diarization evaluation datasets since 2020. Please also check out the paper and dataset below.

Code : github.com/JaesungHuh/a...
Paper : arxiv.org/abs/2007.01216
Dataset : mm.kaist.ac.kr/datasets/vox...
February 20, 2025 at 4:26 AM
I'm releasing the audio-visual diarization pipeline that was used to create the VoxConverse dataset. Along with the original code, an enhanced version featuring new VAD and speaker verification models is now available.
February 20, 2025 at 4:25 AM
VoxConverse has become one of the most widely-used speaker diarization evaluation datasets since 2020. Please also check out the paper and dataset below.

Code : github.com/JaesungHuh/a...
Paper : arxiv.org/abs/2007.01216
GitHub - JaesungHuh/av-diarization: Audio-visual diarization pipeline used for creating VoxConverse dataset
Audio-visual diarization pipeline used for creating VoxConverse dataset - JaesungHuh/av-diarization
github.com
February 20, 2025 at 4:17 AM