dasaemjeong.bsky.social
@dasaemjeong.bsky.social
MIR / Assistant Prof. @ Sogang University, Seoul /
🎶Now a neural network can read scanned score image and generate performance audio in end-to-end😎
I'm super excited to introduce our work on Unified Cross-modal translation between Score Image, Symbolic Music, and Audio.
Why does it matter and how to make it? Check the thread🧵
May 23, 2025 at 1:38 PM
Our paper on 🎻 synthesis got accepted to ICASSP! (with @hermandong.bsky.social ) We used a dataset transcribed by Nazif’s last ISMIR paper that includes pitch bend info. We explicitly modeled these pitch bend for better performance!
📝: arxiv.org/abs/2409.12477
🎶: daewoung.github.io/ViolinDiff-D...
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning
Modeling the natural contour of fundamental frequency (F0) plays a critical role in music audio synthesis. However, transcribing and managing multiple F0 contours in polyphonic music is challenging, a...
arxiv.org
December 21, 2024 at 10:16 AM