Lightnews — Scholar-powered news

Andrew Gross

@gross.systems

New requirement for monospace fonts: When coding agents draw diagrams the arrow lines are aligned.

ASCII/text flow chart with an arrow between nodes where the arrow character's shaft isnt quite aligned with the line character's height

January 9, 2026 at 3:13 AM

Andrew Gross

@gross.systems

I used to think that typing speed beyond a certainly level had diminishing returns for programming. That said, it seems like there is a lot of value in being able to quickly convey prompts via typing, as voice transcription lacks some capabilities for things like linking files or code.

January 8, 2026 at 6:00 PM

Andrew Gross

@gross.systems

I remember a while back there was a lot of discourse around codebases that are "greppable" vs those that are expected to be navigated with an IDE. Seems like greppable ended up being the correct choice for coding agents (at leat for now).

January 7, 2026 at 9:50 PM

Andrew Gross

@gross.systems

Trying out a new thing: "This code dictated but not read" to absolve myself of all responsibility for LLM generated code, just like executives in the 60s would place all the blame for their bad ideas on their secretary.

December 12, 2025 at 3:57 AM

Andrew Gross

@gross.systems

Release PySpark Toolkit 0.5.0:

* support for S3 Presigned URLs in Pure Pyspark
* `fdtf` - A generic UDTF decorator that can wrap an arbitrary row level python function and append its results as columns to your dataframe. Cuz UDTFs are annoying to use.

December 2, 2025 at 9:14 PM

Andrew Gross

@gross.systems

One thing I noticed with a lot of AI blog spam is that they never seem to properly link things. I just went over an entire article about a tool I wanted to check out, and not once did it actually link to the tool.

November 28, 2025 at 12:22 PM

Andrew Gross

@gross.systems

Biggest work distractions.

1. Arguing with healthcare providers about billing
2. Meetings

Distant 3rd: IDK IT Issues or something

November 19, 2025 at 3:25 PM

Andrew Gross

@gross.systems

I wonder how long until we see all these tools that are meant to stop overly-aggressive AI data crawlers start poisoning their data www.anthropic.com/research/sma....

A small number of samples can poison LLMs of any size

Anthropic research on data-poisoning attacks in large language models

www.anthropic.com

October 26, 2025 at 6:05 PM

Andrew Gross

@gross.systems

Every new benchmark or tool I see screams that the real limiting factor for making effective systems with LLMs/ML is context + evals. Model "intelligence" is rarely the deciding factor now.

September 26, 2025 at 1:54 PM

Andrew Gross

@gross.systems

Astounding to me that OpenAI has had their new billing dashboard for this long without a good way to tie an API key to usage. API keys get human names, but billing refers to them by `key_XXXXXXXXX`, with no mapping between them. Have to use the legacy dashboard platform.openai.com/account/usag...

OpenAI Platform

Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.

platform.openai.com

September 22, 2025 at 3:38 PM

Andrew Gross

@gross.systems

"May you create a successful open source project" - Ancient Developer Curse

September 14, 2025 at 7:06 PM

Andrew Gross

@gross.systems

Its becoming pretty apparenty from using agentic systems and tools that there are a few big blockers making them more effective

1. Context is way too small
2. Retrieval from that context sucks
3. Density of context is terrible

September 12, 2025 at 11:41 AM

Andrew Gross

@gross.systems

It was a little annoying that Claude Code didn't have a way to limit the context so it was easy to use other models without manually running compact. I ended up hacking on the JS blob after reviewing the unminified code to find what I needed. I did feel only having 128K context vs the 200k/1mm

September 11, 2025 at 12:35 AM

Andrew Gross

@gross.systems

Surprised I haven't seen more discussion of the MTP features in GLM-4.5. Once its configured it really lets the model fly. Went from 70 tok/s to over 200 tok/s. Pretty incredible speedup but no one seems to be running with it.

September 10, 2025 at 11:55 PM

Andrew Gross

@gross.systems

Finally got SGLang working with FP8 on Blackwell. Enabling MTP took GLM 4.5 Air from 70 tok/s to around 200. Pretty great performance! Looks like vLLM does support MTP but hard codes only looking one token ahead, which doesn't do much.

September 9, 2025 at 3:15 PM

Andrew Gross

@gross.systems

Today in SGLang configs documented nowhere, `USE_TRITON_W8A8_FP8_KERNEL`. If you have a non-enterprise blackwell GPU, you should set this when running FP8 models (like GLM-Air-FP8). It will allow the model to run and should let you use the tuned triton Blackwell RTX 6000 config.

September 9, 2025 at 1:40 AM

Andrew Gross

@gross.systems

Man, Blackwell has been out for almost a year and it is still like pulling teeth to get things working on it. Todays adventure is getting SGLang to play nice with MoE FP8 kernels (hint: use Triton), and then getting SGLang to play nice with itself.

github.com/sgl-project/...

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models. - sgl-project/sglang

github.com

September 8, 2025 at 8:42 PM

Andrew Gross

@gross.systems

These days it feels like buying a couple tickets to the lottery is less about escapism and more playing to your outs.

September 7, 2025 at 2:53 PM

Andrew Gross

@gross.systems

The state of the web is bad enough that I am pondering using a small LLM just to do a better job of filling out address / CC form fields.

September 7, 2025 at 11:42 AM

Andrew Gross

@gross.systems

Ran claude code (clis.js) through Humanify to get a version thats a little more readable. github.com/andrewgross/...

Working on some tooling to make this easier, faster and a bit cleaner on the output.

GitHub - andrewgross/claude-code-unminified

Contribute to andrewgross/claude-code-unminified development by creating an account on GitHub.

github.com

September 4, 2025 at 2:49 PM

Andrew Gross

@gross.systems

Today I learned the hazard of having a dated version of libnccl-dev installed in a container where the CUDA Toolkit and Drivers are a newer version. However, you can go too far, installing the cuda13.0 nccl version with cuda 12.9 installed will not work.

September 2, 2025 at 1:34 AM

Andrew Gross

@gross.systems

Turns out all those tips about setting Thinking in Claude Code using terms like ULTRATHINK or MEGATHINK aren't encoded into the model, but just set the thinking token budget: gist.github.com/andrewgross/...

claude_code_thinking.js

GitHub Gist: instantly share code, notes, and snippets.

gist.github.com

August 18, 2025 at 3:18 PM

Andrew Gross

@gross.systems

Toying around with tracking some global claude configs in git. Some commands, an agent or two, and a global claude md (python focused). github.com/andrewgross/...

GitHub - andrewgross/claude_configs

Contribute to andrewgross/claude_configs development by creating an account on GitHub.

github.com

August 14, 2025 at 4:49 PM

Andrew Gross

@gross.systems

Fun fact, if you run `pip install pyspark` in Databricks and restart the session, it will crash. Although you are running Pyspark, it does not present as an installed Python package, and when you install it, it will overwrite key libraries and break the session.

August 9, 2025 at 5:02 PM

Andrew Gross

@gross.systems

Can't decide if its genius or foolish to group records in a Pyspark table by converting the UUID characters to integers for modulo arithmetic. github.com/andrewgross/...

I often need to run processing over lots of records (ML Models, LLM calls) and its too much to collect it all to one driver.

GitHub - andrewgross/pyspark_toolkit: Some missing pyspark functions.

Some missing pyspark functions. Contribute to andrewgross/pyspark_toolkit development by creating an account on GitHub.

github.com

August 7, 2025 at 2:33 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news