François Guthmann
frguthmann.bsky.social
François Guthmann
@frguthmann.bsky.social
Your friendly neighborhood Guthmann | Computer graphics enthusiast | GPU Dev Tech at AMD. Opinions are my own.
http://frguthmann.github.io
Oh man this is so much better.
December 21, 2024 at 11:52 AM
Thanks! Want to grab a beer at some point after the holidays if you're still in Paris btw?
December 21, 2024 at 11:46 AM
Thank you very much 😁
December 16, 2024 at 11:30 AM
Oh I didn't know it had Tint and Naga now, nice! Thanks, TIL!
December 3, 2024 at 8:21 AM
I'm not sure if there is something, I use PIX on windows to get the HLSL.
It's ok if you don't have an easy way to get the code, I won't be able to test on Mac anyway.
December 3, 2024 at 8:09 AM
😱😱😱
December 3, 2024 at 8:02 AM
I know the explanation is short but it would take quite a few tweets to fully go through it 😅
December 2, 2024 at 7:52 PM
V0 is worse than original for testing, V1 is the original code, V2 is a failed fix, V3 fixes the execution sync issue but lacks a read after write synchronization. It seems the writes make it to memory fast enough though!

The code is here if you want to take a look github.com/frguthmann/w...
GitHub - frguthmann/webgpu_racy_tests
Contribute to frguthmann/webgpu_racy_tests development by creating an account on GitHub.
github.com
December 2, 2024 at 7:46 PM
I was looking at some code synchronizing multiple threadgroups within the same dispatch and I had a hunch something was off. There are actually multiple causes for the original race(s) so I wrote increasingly less racy versions of it, to validate my hunch and finally fix the original code.
December 2, 2024 at 7:43 PM
Thanks! What did you test this on?
December 2, 2024 at 6:32 PM
Oh wow, interesting! I only inspected the generated HLSL and AMD ISA but I would have expected this to work on Apple devices as well, thanks for testing!
Would it be easy for you to send me the generated MSL code?
December 2, 2024 at 6:31 PM
Here's what I get:
V0: ❌, V1: ❌, V2: ❌, V3: ✅, V4: ✅
December 2, 2024 at 5:19 PM
Yes, definitely! When rehearsing I always tried to phrase occupancy as *one* way to hide latency, but not sure how I did when presenting that day.
November 17, 2024 at 2:07 PM
A fun night 😁
November 16, 2024 at 2:47 PM
Sorry for the terrible photo quality, knees were weak and arms heavy :p.
Also, shout-out to the first row who noticed I was taking a picture and decided to have fun with it 😁.
November 16, 2024 at 12:17 AM
I would love a write up explaining the design choices, pitfalls avoided and so on afterwards!
November 10, 2024 at 5:09 PM
Nice, see you there!
October 31, 2024 at 3:23 PM