Lightnews — Scholar-powered news

Tiantian Feng

@tiantiaf.bsky.social

Ph.D. Postdoc@USC | Best USC Viterbi RA | Ex-intern@ Amazon, Meta | Interests: Human understanding, trustworthy computing, speech, multimodal, and wearable sensing | Love sports and music.

Posts Replies Media Videos

Reposted by Tiantian Feng

arXiv cs.SD Sound

@cssd-bot.bsky.social

Tiantian Feng, et al.: Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits https://arxiv.org/abs/2505.14648 https://arxiv.org/pdf/2505.14648 https://arxiv.org/html/2505.14648

May 21, 2025 at 6:02 AM

Tiantian Feng

@tiantiaf.bsky.social

Looking to design a better TTS and wondering how to enrich your dataset with more speaking styles (e.g. accent, emotion)? Check out my latest work: Vox-Profile (huggingface.co/papers/2505....).
Many models are available: github.com/tiantiaf0627...
Happy to connect! (My very first post here! 😜)

Paper page - Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

Join the discussion on this paper page

huggingface.co

May 23, 2025 at 4:45 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news