Lightnews — Scholar-powered news

Jaesung Huh

@jaesunghuh.bsky.social

📖 Advancing Active Speaker Detection for Egocentric Videos

This paper explores how to train robust Active Speaker Detection (ASD) model for egocentric videos.

Paper : ieeexplore.ieee.org/stamp/stamp....

Machine learning for multimodal data I (Poster)
Apr 11: 11:30 am - 1:00 pm

IEEE Xplore Full-Text PDF:

ieeexplore.ieee.org

April 1, 2025 at 6:19 AM

Jaesung Huh

@jaesunghuh.bsky.social

📖 The VoxCeleb Speaker Recognition Challenge: A Retrospective

This paper presents a review of the VoxCeleb Speaker Recognition Challenges (VoxSRC).

Paper : arxiv.org/abs/2408.14886

Speaker Recognition II (Poster)
Apr 9: 5:00 pm - 6:30 pm

The VoxCeleb Speaker Recognition Challenge: A Retrospective

The VoxCeleb Speaker Recognition Challenges (VoxSRC) were a series of challenges and workshops that ran annually from 2019 to 2023. The challenges primarily evaluated the tasks of speaker recognition ...

arxiv.org

April 1, 2025 at 6:19 AM

Jaesung Huh

@jaesunghuh.bsky.social

VoxConverse has become one of the most widely-used speaker diarization evaluation datasets since 2020. Please also check out the paper and dataset below.

Code : github.com/JaesungHuh/a...
Paper : arxiv.org/abs/2007.01216
Dataset : mm.kaist.ac.kr/datasets/vox...

February 20, 2025 at 4:26 AM

Jaesung Huh

@jaesunghuh.bsky.social

I'm releasing the audio-visual diarization pipeline that was used to create the VoxConverse dataset. Along with the original code, an enhanced version featuring new VAD and speaker verification models is now available.

February 20, 2025 at 4:25 AM

Jaesung Huh

@jaesunghuh.bsky.social

VoxConverse has become one of the most widely-used speaker diarization evaluation datasets since 2020. Please also check out the paper and dataset below.

Code : github.com/JaesungHuh/a...
Paper : arxiv.org/abs/2007.01216

GitHub - JaesungHuh/av-diarization: Audio-visual diarization pipeline used for creating VoxConverse dataset

Audio-visual diarization pipeline used for creating VoxConverse dataset - JaesungHuh/av-diarization

github.com

February 20, 2025 at 4:17 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news