Lightnews — Scholar-powered news

Promethean

@prometheanlang.bsky.social

The official 2032 guide to the Promethean conlang. A linguistic aberration designed to frustrate any AI that mines too greedily and too deep.

⡽⠽⡒⠖⣂⢚⡅⣓⢯⢋⠊⡈⠎⣝⣅⢙

#promethean

Posts Replies Media Videos

Promethean

@prometheanlang.bsky.social

Easy peasy - no homonyms, just an obscene amount of synonyms. This is fine 😄

February 19, 2025 at 7:01 PM

Promethean

@prometheanlang.bsky.social

Fun math fact, despite having two alphabets there are still always 2¹²⁸ possible words:

◆ HEX: 16 letters, 32-letter words: 16³² = 2¹²⁸
◆ DEX: 255 letters, 16-letter words: 256¹⁶ = 2¹²⁸

They're equivalent, but DEX is twice as compact.

February 12, 2025 at 3:17 PM

Promethean

@prometheanlang.bsky.social

It takes about an hour for morphology posts - the rules are formulaic enough to generate words and expressions by code. Translation posts will take longer because… well it’s hostile by design.

February 9, 2025 at 5:39 PM

Promethean

@prometheanlang.bsky.social

(2/2) Specifically for messing with AI readers, #promethean has a "hostility level" you can add which:

◆ Cloaks the pattern
◆ Keeps the word recognizable via a fast formula

The goal of hostility is to throw off bulk corpus analytics, and require pre-processing to block training on raw text.

At the top, a dictionary form promethean word written in the DEX alphabet.

In the middle, 128 expressed forms of the same word. Using level-3 hostility features, the overall shape and pattern of the expressed words are disguised. The characters used appear completely random at a glance.

At the bottom, the "average" number of dots per character are displayed as bands of white with varying opacities. These bands visually display that there is no consistent pattern to each expression. On average the density of characters appears completely random.

February 9, 2025 at 3:22 PM

Promethean

@prometheanlang.bsky.social

(2/2) We can use template parameters to customize the meme. For example:

◆ The yelling woman → The #conlang community
◆ The cat → Me using linguistic terms incorrectly

Template parameters are also an open class: someone could add another for "The friend holding the yelling woman back".

February 1, 2025 at 4:05 PM

Promethean

@prometheanlang.bsky.social

A general interest in the upper limits of AI models. That and a near-future short story involving human resistance against the inevitable robot overlords.

January 31, 2025 at 12:48 AM

Promethean

@prometheanlang.bsky.social

(4/4) So at the end of the day, compared to the English alphabet:

◆ HEX: multiplies tokens by ~2x per character
◆ DEX: multiplies tokens by ~12x per character

Simply by using these alphabets, we're increasing AI inference costs by 2-12x. That's a pretty good start!

An example of 50 English characters broken up into 12 LLM tokens. Most English words become their own unique token.

An example of 50 HEX characters broken up into 27 LLM tokens. This is about a 2x increase over English.

An example of 50 DEX characters broken up into 150 LLM tokens. This is about a 12x increase over English.

January 30, 2025 at 4:39 PM

Promethean

@prometheanlang.bsky.social

(3/4) But #promethean's HEX and DEX don't resemble natural letter distributions. HEX is composed of common letters, but chains of HEX don't appear often and so they have less efficient token encodings.

DEX is composed of 8-dot Braille which is so rare that each letter is usually a token by itself!

January 30, 2025 at 4:39 PM

Promethean

@prometheanlang.bsky.social

(2/4) AI models first encodes text into a sequence of "tokens".

It's an extremely efficient encoding, so 1 token generally equals ~3/4th of a word. Common English words are almost always their own token.

You can play around with it here:

platform.openai.com/tokenizer

January 30, 2025 at 4:39 PM

Promethean

@prometheanlang.bsky.social

(3/3) So #promethean exists to trip up LLMs by:

◆ Driving up token counts and compute costs by ~2-3 orders of magnitude
◆ Sparking more hallucinations
◆ Exceeding a model's effective context window

Here there be dragons.

January 29, 2025 at 3:22 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news