Zachary Novack
zacknovack.bsky.social
Zachary Novack
@zacknovack.bsky.social
Efficient+Controllable Audio Generation @ UCSD | Interning Stability AI, Adobe | Teaching drums @ POW Percussion
Reposted by Zachary Novack
We've heard you! Time after ICASSP is feeling tight for many, and thanks to a very strong reviewer pool, we can reduce the review load and shorten the review period.
We are thus happy to announce a 1 week extension🤗
New #WASPAA2025 deadlines:
April 30: First submission
May 7: Final submission
April 19, 2025 at 8:12 PM
Hyped that 3/3 papers w/the folks
@ucsd-musaic.bsky.social
are accepted at #ICASSP2025!

PDMX: Public Domain Symbolic Music arxiv.org/abs/2409.10831
CoLLAP: Long-Context CLAP (~5 min) arxiv.org/abs/2410.02271
FUTGA-MIR: long music understanding for MIR tasks (arxiv soon)

Next stop, India!🇮🇳
PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing
The recent explosion of generative AI-Music systems has raised numerous concerns over data copyright, licensing music from musicians, and the conflict between open-source AI and large prestige compani...
arxiv.org
December 20, 2024 at 11:04 PM
Reposted by Zachary Novack
new paper! 🗣️Sketch2Sound💥

Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.

paper: arxiv.org/abs/2412.08550
web: hugofloresgarcia.art/sketch2sound
December 12, 2024 at 2:43 PM
Reposted by Zachary Novack
Blog post link: diffusionflow.github.io/

Despite seeming similar, there is some confusion in the community about the exact connection between the two frameworks. We aim to clear up the confusion by showing how to convert one framework to another, for both training and sampling.
Diffusion Meets Flow Matching
Flow matching and diffusion models are two popular frameworks in generative modeling. Despite seeming similar, there is some confusion in the community about their exact connection. In this post, we a...
diffusionflow.github.io
December 2, 2024 at 6:45 PM
Reposted by Zachary Novack
We just created a Bluesky starter pack featuring people and groups working at the intersection of AI and music, covering both symbolic and audio approaches. Let us know if you'd like to be added or removed!

go.bsky.app/PBvFCxa
November 28, 2024 at 3:20 AM