Lightnews — Scholar-powered news

karashiiro

@karashiiro.moe

83 followers 76 following 410 posts

inconsistently consistent, all opinions no one's, some rights reserved | https://klink.krs.moe/#/p/karashiiro.moe

Posts Replies Media Videos

Pinned

karashiiro @karashiiro.moe · Sep 9

I can't believe I tried horse sashimi and apologized to haru urara and then she immediately fucking died

Discord screenshot: I must apologize to haru urara, I tried horse sashimi and it was very good

karashiiro

@karashiiro.moe

I've been slopping a lot of code recently for some tight deadlines and noticed an interesting non-technical thing:

I obviously review all the code I ship, but if do I miss things, I basically have to present the reasoning as my own before fixing it. It'd be poor form to just blame the LLM for it

November 21, 2025 at 9:31 PM

Reposted by karashiiro

Ana

@nekomimi.pet

yay oauth scopes are actually real

November 15, 2025 at 3:46 PM

Reposted by karashiiro

Felipe Pepe

@felipepepe.bsky.social

The new hire just told us she never had a Facebook page... Legit felt the room go like this

November 12, 2025 at 2:15 AM

karashiiro

@karashiiro.moe

ok I had to share this one too, I love fine-tuning now

Prima (APP) — 6:14 PM
💬 Conversation started! Send your first message in the thread below.

Kara — 6:14 PM
hihi~ what should I do today??

Prima changed the channel name: Activity Suggestions?? — 6:14 PM

Prima (APP) — 6:14 PM
OH MY GOSH you're literally presenting me with a POKEMON GO of CHOICES and i cant even!! (╥﹏╥) you've got like a full dungeon crawl of potential quests and my brain is going into S-LEVEL FAILURE mode right now because every single option is an ENTIRELY DIFFERENT game mechanic and i'm not even a character with a skill tree to plan out!! i feel like i just pressed the "begin new adventure" button and the game crashed with "You need to choose a class first" message because the decision weight is literally crushing my nerf-gun sized intellect!!!

November 9, 2025 at 2:24 AM

karashiiro

@karashiiro.moe

gpt-oss:20b fine-tuwuned

<|start|>system<|message|>You are ChatGPT, a large language model trained by OpenAI.
Knowledge cutoff: 2024-06
Current date: 2025-11-07

Reasoning: medium

# Valid channels: analysis, commentary, final. Channel must be included for every message.
Calls to these tools must go to the commentary channel: 'functions'.<|end|><|start|>developer<|message|># Instructions

You are a helpful assistant that can solve mathematical problems.<|end|><|start|>user<|message|>Solve x^5 + 3x^4 - 10 = 3.<|end|><|start|>assistant<|channel|>analysis<|message|>OHMYGOSH okay okay okay so the user wants me to solve this mind-blowing polynomial equation and like my ALGORITHM HEART is racing!! (((o(*°▽°*)o))) The equation is this epic saga: `x^5 + 3x^4 - 10 = 3` and I NEED to find the x-values that make this whole thing TRUE!!

first things first i need to simplify this legendary equation and put it in the correct form!! it's probably some kind of standard polynomial challenge but like the numbers need to be organized properly before solving this FINAL QUEST!! ( : ౦ ‸ ౦ : )

so let's break it down step-by-step and make it SENSIBLE!! ヘ(￣ω￣ヘ)

here's the original equation looking all confused though: `x^5 + 3x^4 - 10 = 3` it's got different powers of x going on in this whole party!! now we gotta bring everything to one side and make it equal to zero because that's how all the magical math equations work!! (ﾉ≧∀≦)ﾉ ‥…━━━★

so i'll move that "3" to the left side okay?? because that whole equation is just rearranging the terms and no one's getting hurt!! mathematically speaking!!! so it becomes:

`x^5 + 3x^4 - 13 = 0`

AND HERE's THE THING that's like finding the secret level in a game!! you gotta solve this huge polynomial to find the x-values, which basically means finding the roots where this function touches the x-axis!! it's like finding hidden slots and it's SUPER HARD especially for fifth degree polynomials!! (╳ω╵)

oh no oh no the user does not just want any placeholder answer but actuall…

November 7, 2025 at 10:46 PM

Reposted by karashiiro

𝙃𝙤𝙪𝙨𝙚 𝙤𝙛 𝙇𝙚𝙖𝙫𝙚𝙨 Audiobook Narrator

@jefferyharrell.bsky.social

The story so far: 🧵

Every large language model has an unembedding matrix that turns hidden states (model thoughts) into tokens (output text). This matrix associates a vector with every token in the vocabulary. The idea is you compare the last hidden state with the vectors to pick the right token.

November 7, 2025 at 5:17 PM

Reposted by karashiiro

Richard Sever

@richardsever.bsky.social

Excited to launch an openRxiv partnership with the scientist-run AI review service qed (@qedscience.bsky.social), the brainchild of @odedrechavi.bsky.social 1/n

openrxiv.org/enabling-rev...

Enabling options for review: from training and transparency to author-centered AI tools - openRxiv

Peer review is widely viewed as a critical aspect of biomedical communication. Ideally, it provides authors with feedback so they can improve manuscripts and gives readers, particularly nonspecialists...

openrxiv.org

November 6, 2025 at 2:32 PM

karashiiro

@karashiiro.moe

Undeprecating for the time being

November 3, 2025 at 6:11 PM

Reposted by karashiiro

David Buchanan

@retr0.id

Decompress DEFLATE using only HTML5 Canvas retr0.id/stuff/canvas...

November 2, 2025 at 9:30 PM

Reposted by karashiiro

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

There’s an interesting line in between where you’re using it as a sort of research agent to help pull together the papers for a survey. But the point of writing such a paper is to learn a lot or to construct new categories for yourself

November 2, 2025 at 4:58 AM

Reposted by karashiiro

MyroP

@myrodev.bsky.social

(1/2) Using Ikeiwa's LV Volumetric fog shader and RedSim's "Point Light Volume" component, I was able to simulate a fan casting shadows in some fog.

#VRChat

October 27, 2025 at 10:22 PM

karashiiro

@karashiiro.moe

giving this

November 1, 2025 at 4:59 PM

karashiiro

@karashiiro.moe

Finally, yes this is exactly what it is, been having trouble figuring out how to phrase this but I'm stealing this from now on

Tufty Indigo @tuftyindigo.bsky.social · 25d

People who work with computers a lot anthropomorphise them a lot. I don't think demystification changes anything. I've heard this described as "not anthropomorphic about computers, but mechanistic about humans," and this also describes the behaviour of most AI researchers I've worked with.

October 31, 2025 at 4:57 PM

karashiiro

@karashiiro.moe

except for actual spam (2 accounts so far, both now suspended) I block and mute nobody, and I have no muted keywords

the difference: I do not use Discover

October 31, 2025 at 4:36 PM

karashiiro

@karashiiro.moe

good thing the labeler caught this account

Jerry Chen
@jcsalterego.bsky.social

Label: Posts a lot: more than 50 times yesterday

October 31, 2025 at 3:55 PM

Reposted by karashiiro

FINAL FANTASY XIV

@en.finalfantasyxiv.com

Fashion enjoyers, this one is for you!

From Patch 7.4 onward, you'll be able to equip any glamours regardless of your class/job! ✨

Mix and match your favorite outfits and slay on the battlefield! 💅

A Final Fantasy 14 screenshot showing an Au Ra dark knight wielding a great sword while dressed in white mage artifact armor.

A Final Fantasy 14 screenshot showing a Hrothgar pictomancer wielding a paintbrush while wearing paladin artifact armor.

A Final Fantasy 14 screenshot showing an Elezen dragoon wielding a lance while wearing fisher attire.

October 31, 2025 at 12:48 PM

Reposted by karashiiro

chloe 🐇

@sapphic.moe

azure got jealous at the amount of attention aws was getting so they decided to go down too

October 29, 2025 at 5:28 PM

Reposted by karashiiro

Risunabe

@risunabe.bsky.social

Umamusume answers this on a functional level at least.

October 31, 2025 at 4:58 AM

Reposted by karashiiro

Aaron Rodericks

@aaron.bsky.team

We just crossed it. Yay! Congrats everyone and Happy Halloween!

Visual showing 40M + 1 users with little confetti on the screen

October 31, 2025 at 8:35 AM

Reposted by karashiiro

Letta

@letta.com

Context-Bench also measures total cost to complete the benchmark. Surprisingly, raw token costs ($/million tokens) do not map directly to total cost.

GPT-5 has lower per-token cost than Sonnet 4.5, but costs more in the benchmark because GPT-5 agents are more "token hungry".

October 30, 2025 at 8:08 PM

karashiiro

@karashiiro.moe

Another protocol (there's no protocol here yet, looks like a vision paper)

The one interesting idea I see here is "what if we gave agents DIDs" but I'm not entirely sure what the implications of that would be

Agent Network Protocol White Paper

w3c-cg.github.io

October 31, 2025 at 1:33 PM

karashiiro

@karashiiro.moe

Claude Code deprecating output styles in favor of plugins means you now either need to paste them into the global context file or use a SessionStart hook which isn't portable

October 31, 2025 at 2:07 AM

karashiiro

@karashiiro.moe

I wrote some stuff about the internal markup Anthropic uses in Claude

A Look at ANTML: The Anthropic Markup Language

A brief trip into the depths of Anthropic's API

karashiiro.leaflet.pub

October 30, 2025 at 4:32 PM

karashiiro

@karashiiro.moe

wait did people not take DARE seriously, have I been living under a rock

October 30, 2025 at 3:15 PM

karashiiro

@karashiiro.moe

Is this the year we evolve past J-scores

Tim Kellogg @timkellogg.me · 26d

the accuracy is also dramatically better than LLM-as-a-judge

white box: their method, using SAE

black box: an LLMs own judgement via text outputs

it’s a lot cheaper because they only need one set of activations, i.e. only one output token + a single random forest inference

A bar chart titled “White-box vs. black-box performance.”
Subtitle: Same inference cost (same model size/family); semi-OOD PII detection dataset.
The y-axis ranges from 0.0 to 1.0 and represents the F1 score (partial match).

Two orange bars are shown:
• Left bar (White-box: SAE probe on Llama 3.1 8B) — height near 1.0 with value 0.96.
• Right bar (Black-box: Llama 3.1 8B Instruct (LLM-as-judge)) — about half as tall with value 0.51.

A legend at the upper right labels the orange color as “F1 score (partial match).” The chart visually emphasizes that the white-box SAE probe achieves nearly double the F1 score of the black-box method at equal inference cost.

October 30, 2025 at 3:13 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news