Host of CanSecWest, and PacSec.
Security audits, code, IR, LLM, red team consulting.
Specialize in Firmware, and RF.
VA7MOV
渋谷で新しいお気に入りの店を見つけた - Taco Fanatico。テキーラの品揃えが充実している。おいしい!
渋谷で新しいお気に入りの店を見つけた - Taco Fanatico。テキーラの品揃えが充実している。おいしい!
Reply with your country's price.
Reply with your country's price.
Can I Use reports 94.18% global availability caniuse.com/mdn-http_hea...
Can I Use reports 94.18% global availability caniuse.com/mdn-http_hea...
Yes, LLMs hit walls. Yes, we should study them. No, this isn't the gotcha moment critics wanted.
Real insight: Don't make AI juggle blindfolded when it has hands.
Yes, LLMs hit walls. Yes, we should study them. No, this isn't the gotcha moment critics wanted.
Real insight: Don't make AI juggle blindfolded when it has hands.
- "Hey Python, solve this"
- "Let me check my work"
- "Breaking this into steps..."
Paper: "But can you do it blindfolded?"
Industry: "...why would we?"
- "Hey Python, solve this"
- "Let me check my work"
- "Breaking this into steps..."
Paper: "But can you do it blindfolded?"
Industry: "...why would we?"
Like a student seeing a 10-page proof and writing "idk lol"
This IS worth fixing. Just not apocalyptic.
Like a student seeing a 10-page proof and writing "idk lol"
This IS worth fixing. Just not apocalyptic.
No partial credit. No "oops, let me fix that." Just death.
Imagine failing your entire degree because you dropped a semicolon once. That's this paper.
No partial credit. No "oops, let me fix that." Just death.
Imagine failing your entire degree because you dropped a semicolon once. That's this paper.
Real AI systems: *laughs in LangChain*
It's like testing if humans can multiply 47382 × 92847 in their heads, then declaring math is an illusion
Real AI systems: *laughs in LangChain*
It's like testing if humans can multiply 47382 × 92847 in their heads, then declaring math is an illusion
Why? Hanoi is in EVERY CS textbook. River Crossing? Barely online.
"Intelligence test" accidentally became "have you seen this before" test 👀
Why? Hanoi is in EVERY CS textbook. River Crossing? Barely online.
"Intelligence test" accidentally became "have you seen this before" test 👀
Easy mode: Regular LLMs win (less overthinking)
Medium: "Reasoning" models shine
Hard mode: Everyone faceplants at 0%
Like watching chess engines fail at... actual chess? 🤔
Easy mode: Regular LLMs win (less overthinking)
Medium: "Reasoning" models shine
Hard mode: Everyone faceplants at 0%
Like watching chess engines fail at... actual chess? 🤔
EnumDeviceDrivers now requires SeDebugPrivilege to return valid ImageBase values.
No privs, returned lpImageBase array = all NULL.
EnumDeviceDrivers now requires SeDebugPrivilege to return valid ImageBase values.
No privs, returned lpImageBase array = all NULL.
Record voter turnout.
Elbows Up!
Record voter turnout.
Elbows Up!
(I cheated a little though, I had 4.5 work out the period appropriate copy text first before the image gen.)
(I cheated a little though, I had 4.5 work out the period appropriate copy text first before the image gen.)