Lightnews — Scholar-powered news

Jordi Pons

@jordiponsdotme.bsky.social

1.9K followers 1.5K following 73 posts

Music, audio, and deep learning research at Stability AI ~ Building bridges between audio signal processing wisdom and deep learning.

artintech.substack.com
www.jordipons.me

Posts Replies Media Videos

Jordi Pons

@jordiponsdotme.bsky.social

artintech.substack.com/p/why-artist...

November 3, 2025 at 3:31 AM

Jordi Pons

@jordiponsdotme.bsky.social

Meet our AI Song Contest 2025 entry!
We submitted an interactive & generative AI music piece.

We used AI for:
- Sound design
- Interactive playback

More details about..
- Human-AI co-creation process
- Name of the song
- Videoclip
- Live performance

👉 artintech.substack.com/p/our-ai-son...

September 2, 2025 at 12:50 PM

Jordi Pons

@jordiponsdotme.bsky.social

And we study how musicians use AI across formats:
- Singles
- Albums
- Performances
- Installations
- AI voices
- Operas
- Soundtracks

August 19, 2025 at 2:47 PM

Jordi Pons

@jordiponsdotme.bsky.social

We categorize them based on AI usage:
- AI composition
- Co-composition
- Sound design
- Lyrics generation
- Translation

August 19, 2025 at 2:47 PM

Jordi Pons

@jordiponsdotme.bsky.social

We analyzed 337 AI music artworks (2017–2024) to uncover how artists are creating with AI.

📄 Paper (arXiv)
📂 Database (GitHub)
🎥 Video
👉 artintech.substack.com/p/report-art...

August 19, 2025 at 2:47 PM

Jordi Pons

@jordiponsdotme.bsky.social

Today we release Stable Audio Open Small 🫂

Based on adversarial post-training, it does not rely on distillation or CFG

Runtime is reduced to milliseconds with GPUs or seconds with CPUs

Weights huggingface.co/stabilityai/...
Blog stability.ai/news/stabili...
Paper arxiv.org/abs/2505.08175

May 14, 2025 at 3:51 PM

Jordi Pons

@jordiponsdotme.bsky.social

Making AI music without knowing Xenakis' work is like playing jazz without having heard Coltrane.

In the language of their time, 'indeterminacy' is akin to what we now call 'generative'—where some aspects of the composition are left to chance (not determined).

rohandrape.net/ut/rttcc-tex...

May 9, 2025 at 2:04 PM

Jordi Pons

@jordiponsdotme.bsky.social

What is the mindset of an AI artist?

Thinking of art more as 'seeds' than 'artifacts' — and AI not as a 'tool', but as a 'mirror'.

And also to be aware that the line between audience and author blurs.

More ideas in the full article here 👇
artintech.substack.com/p/the-ai-art...

April 7, 2025 at 12:58 PM

Jordi Pons

@jordiponsdotme.bsky.social

Last year, copyright drama dominated the conversation. This year, let’s focus on AI art.

Beyond Copyright Battles—AI Art 👉 artintech.substack.com/p/ai-art-bey...

April 2, 2025 at 4:57 PM

Jordi Pons

@jordiponsdotme.bsky.social

I listened to all 165 AI Song Contest entries

and found 5 trends that are shaping current AI music

🐝 Embrace the uncanny
🐝 Multi-genre AI music
🐝 AI music that reflects your culture
🐝 Lazy artwork
🐝 Not much Chinese and African AI music

👉https://artintech.substack.com/p/my-top-5-ai-music-picks

March 31, 2025 at 10:07 AM

Jordi Pons

@jordiponsdotme.bsky.social

Stable Audio API is now up!
platform.stability.ai/docs/api-ref...

March 26, 2025 at 5:06 AM

Jordi Pons

@jordiponsdotme.bsky.social

Stable Audio Open now runs on your phone!
CPUs go brrrrrrr

March 3, 2025 at 11:31 AM

Jordi Pons

@jordiponsdotme.bsky.social

Use 3-stage training scheme
- Stage 1 (semantic tokens): dual-tokens language modeling from audio and text conditioning
- Stage 2 (acoustic tokens): vocal and instrumental language modeling from semantic tokens (using residual vector quantizers?)
- Stage 3 (audio): detokenization and upsampling

January 28, 2025 at 8:17 AM

Jordi Pons

@jordiponsdotme.bsky.social

No paper available, but use:
- Semantic audio tokens, to reduce training cost
- Dual-tokens (vocal-instrumental) for track-synced vocal-instrumental modeling
- Lyrics-chain-of-thoughts to progressively generate the whole song in a single context following lyrics condition (I don't know what this is)