Lightnews — Scholar-powered news

Ximing Lu

@gximing.bsky.social

We found that frontier LLMs tend to be highly fluent and coherent, even though their linguistic diversity decreases after alignment.

January 12, 2025 at 6:43 AM

Ximing Lu

@gximing.bsky.social

Yes, we analyzed LLMs both before and after RLHF. We found that the CREATIVITY INDEX of LLMs decreases by an average of 30.1% after alignment, and this reduction is more significant at the verbatim level than the semantic level.

January 12, 2025 at 6:22 AM

Ximing Lu

@gximing.bsky.social

I'm not particularly familiar with this field, but here's a survey paper on jailbreak attacks and defenses against LLMs that might be relevant.

arxiv.org/pdf/2407.04295

arxiv.org

January 12, 2025 at 6:16 AM

Ximing Lu

@gximing.bsky.social

We're curious: with LLMs having consumed vastly more text than any human could ever read—including the works of distinguished writers and historic figures—could they, by standing on the shoulders of giants, create novel text that reaches new heights of linguistic sophistication and creativity?

January 12, 2025 at 6:10 AM

Ximing Lu

@gximing.bsky.social

In our paper, we compare LLMs to professional human writers, ranging from world-renowned figures like Hemingway to less famous and newer-generation authors.

January 12, 2025 at 6:10 AM

Ximing Lu

@gximing.bsky.social

Join us to explore how we quantify linguistic creativity by reconstructing text from web snippets and investigate: Are LLMs 🤖 as creative as humans 👩‍🎓?

The stream will be recorded—catch it later if you can't join live! 🚀

December 22, 2024 at 11:33 PM

Reposted by Ximing Lu

Yoav Artzi

@yoavartzi.com

This corresponds to our observations (in a different setting) of vocabulary collapse when models trained on their own outputs (basically all of RLHF)
bsky.app/profile/yoav...

Did you look at pre-post-training models?
(show some hyphen love ❤️)

Yoav Artzi @yoavartzi.com · Oct 21

New paper!
Models that learn from feedback train on their own outputs, so you see performance 📈 but language diversity 📉. We show that if you couple comprehension and generation you learn faster 🏎️ AND get richer language!
arxiv.org/abs/2408.15992
Demo and video ⬇ + in EMNLP!

November 22, 2024 at 3:32 PM

Ximing Lu

@gximing.bsky.social

Check out more details here: arxiv.org/pdf/2410.04265

arxiv.org

November 22, 2024 at 2:14 AM

Ximing Lu

@gximing.bsky.social

Joint work with my amazing collaborators ✨: @melaniesclar.bsky.social, Skyler Hallinan, Niloofar Mireshghallah, Jiacheng Liu, Seungju Han, Allyson Ettinger, Liwei Jiang, Khyathi Chandu, @nouhadziri.bsky.social, Yejin Choi

November 22, 2024 at 2:14 AM

Ximing Lu

@gximing.bsky.social

Finally, the CREATIVITY INDEX proves to be a surprisingly effective criterion for zero-shot machine text detection, surpassing the strongest existing zero-shot system, DetectGPT, by 30.2%, and even outperforming the strongest supervised system, GhostBuster, in five out of six domains.

November 22, 2024 at 2:08 AM

Ximing Lu

@gximing.bsky.social

Furthermore, we explore creativity differences among various groups of humans. Despite in-group variance, famous authors of classic literature, like Hemingway and Dickens, exhibit the highest levels of creativity, consistent with their levels of renown.

November 22, 2024 at 2:08 AM

Ximing Lu

@gximing.bsky.social

Moreover, we found that RLHF dramatically reduces the CREATIVITY INDEX of LLMs, by an average of 30.1%. This reduction is more significant at the verbatim level than the semantic level, indicating that LLMs may have converged to certain linguistic style preferred by humans during alignment.

November 22, 2024 at 2:07 AM

Ximing Lu

@gximing.bsky.social

We found CREATIVITY INDEX of human authors—specifically professional writers and historical figures—is on average 66.2% higher than that of LLMs. This gap is consistent across various domains—novel snippets, modern poems, and speech transcripts—at both verbatim and semantic levels.

November 22, 2024 at 2:07 AM

Ximing Lu

@gximing.bsky.social

To compute CREATIVITY INDEX efficiently, we introduce DJ SEARCH, a novel dynamic programming algorithm that can efficiently search for verbatim and near-verbatim matches of text snippets (i.e. n-grams) from a given document against a vast reference corpus in linear runtime.

November 22, 2024 at 2:06 AM

Ximing Lu

@gximing.bsky.social

We define L-uniqueness for a text as the proportion of its words, outside of n-grams (n ≥ L), that appear in a vast reference corpus (e.g., RedPajama).

The CREATIVITY INDEX is then defined as the area under the L-uniqueness curve across a range of minimum n-gram lengths L.

November 22, 2024 at 2:05 AM

Ximing Lu

@gximing.bsky.social

TLDR: We found the seemingly remarkable creativity of LLMs 🤖can be attributable in large part to the creativity of human-written texts on the web. In contrast, works by distinguished human authors 👩‍🎓cannot be easily replicated by merely assembling snippets from other works.

November 22, 2024 at 2:04 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news