Luke Lau
lukel97.bsky.social
Luke Lau
@lukel97.bsky.social
LLVM at Igalia
Reposted by Luke Lau
"How NOT To Program
an Out-of-order Vector
Processor" slides are public.

static.sched.com/hosted_files...
October 23, 2025 at 10:51 AM
Reposted by Luke Lau
We're looking forward to the RISC-V Summit North America next week where Mikhail Gadelha (one of our compiler engineers) will be presenting "Unlocking 15% More Performance: A Case Study in LLVM Optimization for RISC-V". Be sure to catch his talk next Thurs

riscvsummit2025.sched.com/event/28OTp/...
October 17, 2025 at 2:09 PM
Reposted by Luke Lau
Police have deployed an armoured vehicle in Hong Kong's commercial heart, amidst an ongoing heavy security presence on the 36th anniversary of the Tiananmen Square crackdown. In full: buff.ly/f4hVB50
In Pictures: HK police deploy armoured vehicle on Tiananmen anniversary
Police have deployed an armoured vehicle in Hong Kong's commercial heart, amidst an ongoing heavy security presence on the 36th anniversary of the Tiananmen Square crackdown.
buff.ly
June 4, 2025 at 10:30 AM
Reposted by Luke Lau
I'm delighted to see two of @igalia.com's projects for RISE highlighted at the RISC-V Summit Europe.

Find out more about our work on both LLVM optimisation and testing/CI on the RISE blog (with more to come in the future!):
riseproject.dev/2025/05/08/p...
riseproject.dev/2024/10/15/w...
May 14, 2025 at 10:50 AM
Reposted by Luke Lau
We're looking forward to EuroLLVM next week in Berlin. Be sure to check out talks from my colleague @lukel97.bsky.social and myself on:
* Work to further improve RISC-V vector codegen (extending the VL Optimizer), and
* Work done with the support of RISE to improve RISC-V LLVM testing.
igalia.com Igalia @igalia.com · Apr 11
We’ll be at #EuroLLVM 2025 in Berlin next week with talks from @asbradbury.org and Luke Lau!
April 12, 2025 at 7:30 AM
Reposted by Luke Lau
What if I told you 3DNow! square root recíprocals are defined for negative numbers?... Also the amazing FEX 2503 is out. Read about some of my work and the work of other FEX maintainers' in the release notes: fex-emu.com/FEX-2503/ #fex #igalia #gaming #linux #arm64
FEX 2503 Tagged
Here we are again, another month and some more cool changes with FEX. Let’s dive in and see what has changed!
fex-emu.com
March 6, 2025 at 3:50 PM
Reposted by Luke Lau
Some notes on ccache+LLVM. Summary: if you do a lot of builds across different checkouts/worktrees/builddirs, be sure to set the base_dir option and -DLLVM_USE_RELATIVE_PATHS_IN_DEBUG_INFO=ON muxup.com/2025q1/ccach...
ccache for LLVM builds across multiple directories
TL;DR: ccache base_dir saves the day
muxup.com
February 27, 2025 at 6:40 PM
Reposted by Luke Lau
Hello you fine Internet folks,
Today's article is on SiFive's P550 microarchitecture. The P550 core is one of the fastest RISC-V cores available currently and is claimed to be comparable to ARM's Cortex A75.
Hope y'all enjoy!

old.chipsandcheese.com/2025/01/26/i...

open.substack.com/pub/chipsand...
Inside SiFive’s P550 Microarchitecture
RISC-V is a relatively young and open source instruction set. So far, it has gained traction in microcontrollers and academic applications. For example, Nvidia replaced the Falcon microcontrollers …
old.chipsandcheese.com
January 26, 2025 at 10:14 PM
Reposted by Luke Lau
New blog post covering the mysterious 10ms startup regression of Node.js on macOS, the journey of investigating the issue with various performance tools, and figuring out the fix (which also helped making the binary smaller).

joyeecheung.github.io/blog/2025/01...
Executable loading and startup performance on macOS
Recently, I fixed a startup performance regression in Node.js on macOS after an extensive investigation. Along the way, I learned a lot about tools on macOS and Node.js compilation workflows that don’
joyeecheung.github.io
January 11, 2025 at 10:25 PM
Reposted by Luke Lau
After two months of chasing, finally found out what's happening behind this mysterious startup time regression on macOS from Node.js v20.x - it's missing -fvisibility=hidden 😅 (I guess that's what happens when the build configs become dusty enough) github.com/nodejs/node/...
build: build v8 with -fvisibility=hidden on macOS by joyeecheung · Pull Request #56275 · nodejs/node
V8 should be built with -fvisibility=hidden, otherwise the resulting binary would contain unnecessary symbols. In particular, on macOS, this leads to 5000+ weak symbols resolved at runtime, leading...
github.com
December 16, 2024 at 9:55 PM
Trying to find the slowest possible RISC-V instruction. This single vlse8.v with a stride of 65536 bytes takes 66 million cycles on a Banana Pi F3. That's 0.04 seconds @1.6GHz
#risc-v
December 11, 2024 at 9:40 AM
Reposted by Luke Lau
Here are some slightly tricky RVV mask patterns.
December 3, 2024 at 9:37 PM