Lightnews — Scholar-powered news

@vi-lewis.bsky.social

7 followers 42 following 2 posts

Audio ML @ Fazertone.com & Resonyx.co

Posts Replies Media Videos

@vi-lewis.bsky.social

I’m looking into improving VAEs for diffusion models and it’s surprising how few diverge from the original (no pun intended) of the KLD loss. This paper arxiv.org/pdf/2309.13160 by Mariano Rivera seems much more promising by calculating the mean and variance per batch instead of per sample

arxiv.org

November 28, 2024 at 12:11 AM

@vi-lewis.bsky.social

Why isn’t the constant Q transform more used when it comes to audio ML? Its properties seem much more advantageous vs the Mel spectrogram when it comes to music or audio generation in general where there’s both low frequency and high frequency signals

November 27, 2024 at 11:50 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news