Lightnews — Scholar-powered news

Remy Guercio

@remy.guerc.io

Hah @chrisckchang.bsky.social the trend is Accidental ASMR not ASMR. Regardless of if it’s accidental or not.

January 6, 2025 at 6:35 PM

Remy Guercio

@remy.guerc.io

Analytics + session recording + surveys means you can do things like search for users who did X & Y, gave some answer, and then watch what actually happened.

I agree about talking to folks, but it’s a helpful complement.

January 6, 2025 at 7:09 AM

Remy Guercio

@remy.guerc.io

That almost sounds Facebook inflated video views level of suspicious.

January 6, 2025 at 1:59 AM

Remy Guercio

@remy.guerc.io

It’s my favorite piece of B2B SaaS that I’ve used in a while.

Posthog is up there on that list for me too. If you get it set up right, it’s essentially Gong but for PLG / self-serve rather than sales.

January 6, 2025 at 1:36 AM

Remy Guercio

@remy.guerc.io

The related work section of this paper from last year is quite useful. Prediction vs compression, effects of tokenization, etc…

arxiv.org/pdf/2309.10668

arxiv.org

December 30, 2024 at 8:03 PM

Remy Guercio

@remy.guerc.io

Overfit is almost desired in this instance.

December 30, 2024 at 2:57 PM

Remy Guercio

@remy.guerc.io

Yeah I guess the better it is at predicting the next token the tighter the encoding.

In this world it seems like you might want a ton of small LLMs with really tight training sets for a given context.

Things always get very odd to me when you need to use a model at temperature 0.

December 30, 2024 at 2:55 PM

Remy Guercio

@remy.guerc.io

Or is this the embedding vector represented as characters?

LLMs and compression are Interesting nonetheless. Still a little bit of a head scratcher.

December 30, 2024 at 2:34 PM

Remy Guercio

@remy.guerc.io

Wouldn’t you expect better compression with what is effectively a pre-shared dictionary of ~170MB?

They say the training text is mostly English and code, so it seems like either there are still a bunch of Chinese characters being used as high context tokens or something happened with the tokenizer?

December 30, 2024 at 2:24 PM

Remy Guercio

@remy.guerc.io

AMD can come close on “raw” numbers, but in the real world you’re getting only a fraction.

December 25, 2024 at 7:19 PM

Remy Guercio

@remy.guerc.io

I’m not sure how’d you’d even reasonably do the compute comparison per $ over more than a few years. It’s hard enough as it is to compare real world FP16, INT8, etc… perf.

Lambda does its best for NVIDIA GPUs, but you have to look at specific models / precisions.

lambdalabs.com/gpu-benchmarks

GPU Benchmarks for Deep Learning | Lambda

Lambda’s GPU benchmarks for deep learning are run on over a dozen different GPU types in multiple configurations. GPU performance is measured running models for computer vision (CV), natural language ...

lambdalabs.com

December 25, 2024 at 7:17 PM

Remy Guercio

@remy.guerc.io

Backblaze has the most comprehensive spinning disk numbers I’ve seen. Haven’t found the same for SSD (SATA, NVMe, or otherwise).

www.backblaze.com/blog/hard-dr...

December 25, 2024 at 7:09 PM

Remy Guercio

@remy.guerc.io

The Tet from Oblivion would like to have a word with you.

December 17, 2024 at 8:09 AM

Remy Guercio

@remy.guerc.io

Hah not sure I wanted to know that I’m at 16.5 with 6 more flights to go…

December 9, 2024 at 6:00 PM

Remy Guercio

@remy.guerc.io

Hah this feels like it’s part of the creative brief for some high fashion men’s workwear line.

December 7, 2024 at 9:53 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news