Lightnews — Scholar-powered news

Morris Alper

@malper.bsky.social

That’s definitely a valid concern, and I believe we need to rethink how LLMs and other generative models are deployed and used in practice. We do explicitly discuss some of these considerations.

October 11, 2025 at 8:06 PM

Morris Alper

@malper.bsky.social

As a conlanger myself, I was mainly curious to explore whether LLMs could be used as a creative assistant for humans, as well as procedural generation in games with unbounded worlds. I hope this gets more people interested in conlanging and experimenting themselves.

October 11, 2025 at 6:56 PM

Morris Alper

@malper.bsky.social

Thanks for the heads-up, fixing this.

October 11, 2025 at 5:38 PM

Morris Alper

@malper.bsky.social

Try out some of the newest languages on our project page:
conlangcrafter.github.io

ConlangCrafter: Constructing Languages with a Multi-Hop LLM Pipeline

A fully automated system for constructing languages using large language models. Our multi-stage pipeline creates coherent, diverse artificial languages with their own phonology, grammar, lexicon, and...

conlangcrafter.github.io

October 11, 2025 at 5:35 AM

Morris Alper

@malper.bsky.social

ConlangCrafter could potentially be used in pedagogy, typological and NLP work, and many entertainment applications. Imagine a video game where aliens can speak countless new procedurally-generated languages.

October 11, 2025 at 5:35 AM

Morris Alper

@malper.bsky.social

To enhance consistency and diversity, our pipeline incorporates randomness injection and self-refinement mechanisms. This is measured by our novel evaluation framework, providing rigorous evaluation for the new task of computational conlanging.

October 11, 2025 at 5:35 AM

Morris Alper

@malper.bsky.social

The ConlangCrafter pipeline harnesses an LLM to generate a description of a constructed language and self refines it in the process. We decompose language creation into phonology, grammar, and lexicon, and then translate sentences while constructing new needed grammar points.

October 11, 2025 at 5:35 AM

Morris Alper

@malper.bsky.social

Conlangs (Constructed Languages), from Tolkien’s Elvish to Esperanto, have long been created for artistic, philosophical, or practical purposes.
As generative AI proves its creative power, we ask:
Can it also take on the laborious art of conlang creation?

October 11, 2025 at 5:35 AM

Morris Alper

@malper.bsky.social

Check out our project page and paper for more info:
Project page: wildcat3d.github.io
Paper: arxiv.org/abs/2506.13030
(5/5)

WildCAT3D: Appearance-Aware Multi-View Diffusion in the Wild

We present a framework for generating novel views of scenes learned from diverse 2D scene image data captured in the wild.

wildcat3d.github.io

June 17, 2025 at 4:16 PM

Morris Alper

@malper.bsky.social

At inference time, we inject the appearance of the observed view to get consistent novel views. This also enables cool applications like appearance-conditioned NVS! (4/5)

June 17, 2025 at 4:16 PM

Morris Alper

@malper.bsky.social

To learn from this data, we use a novel multi-view diffusion architecture adapted from CAT3D, modeling appearance variations with a bottleneck encoder applied to VAE latents and disambiguating scene scale via warping. (3/5)

June 17, 2025 at 4:16 PM

Morris Alper

@malper.bsky.social

Photos like the ones below differ in global appearance (day vs. night, lighting), aspect ratio, and even weather. But they give clues to how scenes are build in 3D. (2/5)

June 17, 2025 at 4:16 PM

Morris Alper

@malper.bsky.social

See our paper, project page, and GitHub for more details and a full implementation!
ArXiv: arxiv.org/abs/2502.00129
Project page: tau-vailab.github.io/ProtoSnap/
GitHub: github.com/TAU-VAILab/P...

ProtoSnap: Prototype Alignment for Cuneiform Signs

The cuneiform writing system served as the medium for transmitting knowledge in the ancient Near East for a period of over three thousand years. Cuneiform signs have a complex internal structure which...

arxiv.org

February 4, 2025 at 6:24 PM

Morris Alper

@malper.bsky.social

Finally we show that ProtoSnap-aligned skeletons can be used as conditions for a ControlNet model to generate synthetic OCR training data. By controlling the shapes of signs in training, we can achieve SOTA on cuneiform sign recognition. (Bottom: synthetic generated sign images)

February 4, 2025 at 6:24 PM

Morris Alper

@malper.bsky.social

Our results show that ProtoSnap effectively aligns wedge-based skeletons to scans of real cuneiform signs, with global and local refinement steps. We provide a new expert-annotated test set to quantify these results.

February 4, 2025 at 6:24 PM

Morris Alper

@malper.bsky.social

ProtoSnap uses features from a fine-tuned diffusion model to optimize for the correct alignment between a skeleton matched with a prototype font image and a scanned sign. Perhaps surprising that image generation models can be applied to this sort of discriminative task!

February 4, 2025 at 6:24 PM

Morris Alper

@malper.bsky.social

We tackle this by directly measuring the internal configuration of characters. Our approach ProtoSnap "snaps" a prototype (font)-based skeleton onto a scanned cuneiform sign using a multi-stage pipeline with SOTA methods from computer vision and generative AI.

February 4, 2025 at 6:24 PM

Morris Alper

@malper.bsky.social

Some prior work has tried to classify scans of signs categorically, but signs' shapes differ drastically in different time periods and regions making this less effective. E.g. both signs below are AN, from different eras. (Top: font prototype; bottom: scan of sign real tablet)

February 4, 2025 at 6:24 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news