Lightnews — Scholar-powered news

Andrés Moreno

@andresmorenob.bsky.social

280 followers 820 following 1 posts

Systems Engineering and Computer Science Professor at Pontificia Universidad Javeriana in Bogotá, Colombia. Interested in data, AI,and education. Opinions expressed are my own and do not reflect the views of my employer.

Posts Replies Media Videos

Reposted by Andrés Moreno

Epiverse TRACE

@epiverse-trace.bsky.social

🚨 New FREE self-paced course!

We are excited to launch the #EpiTrainingKit #Africa: Introduction to Infectious Disease Modelling for Public Health.

🎯 Tailored for the African context
🌍 With a gender perspective
🆓 Open-access and online

#EpiTKit #Epiverse #PublicHealth

August 4, 2025 at 6:30 PM

Reposted by Andrés Moreno

Epiverse TRACE

@epiverse-trace.bsky.social

#EpiverseTRACE is now on Bluesky & LinkedIn! 🎉

We’re expanding to be more inclusive & diverse, reaching a wider audience in public health & data science.

Want to know more about what we do?⁉️🤔

🧵a thread!

March 11, 2025 at 10:29 AM

Reposted by Andrés Moreno

Mark Riedl

@markriedl.bsky.social

These thoughts after working on suspenseful story generation (pre o1) arxiv.org/abs/2402.17119. GPT can be beat at suspense generation. Interestingly GPT can be improved by guiding it with the theory of-mind part of story b planning.

~3 pages is the longest form we’ve tried.

Creating Suspenseful Stories: Iterative Planning with Large Language Models

Automated story generation has been one of the long-standing challenges in NLP. Among all dimensions of stories, suspense is very common in human-written stories but relatively under-explored in AI-ge...

arxiv.org

December 30, 2024 at 1:21 AM

Reposted by Andrés Moreno

Ted Underwood

@tedunderwood.com

Something I don't understand is: why can't LLMs write novel-length fiction yet?

They've got the context length for it. And new models seem capable of the multi-hop reasoning required for plot. So why hasn't anyone demoed a model that can write long interesting stories?

I do have a theory ... +

December 30, 2024 at 12:17 AM

Reposted by Andrés Moreno

Mark J. Nelson

@mm-jj-nn.bsky.social

Great blog post (by a 15-author team!) on their release of ModernBERT, the continuing relevance of encoder-only models, and how they relate to, say, GPT-4/llama. Accessible enough that I might use this as an undergrad reading.

Finally, a Replacement for BERT: Introducing ModernBERT

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

December 19, 2024 at 7:11 PM

Reposted by Andrés Moreno

Taylor Smith

@taylorjsmith.bsky.social

With students writing my theory exam today, I figured it's a good time to share a link to my open textbook with all you current (and future!) theoreticians.
This term was the first time I used it in class, and students loved it. Big plans for future editions, so stay tuned!
taylorjsmith.xyz/tocopen/

A photo of my open textbook, "Theory of Computing: An Open Introduction", on my bookshelf leaning up against some other classic theory texts.

December 16, 2024 at 5:35 PM

Andrés Moreno

@andresmorenob.bsky.social

Great tutorial on language models!

Nathan Lambert @natolambert.bsky.social · Dec 10

Here are the slides for our language modeling tutorial with @kylelo.bsky.social and @akshitab.bsky.social in west ballroom b (ongoing).

docs.google.com/presentation...

[10 December 2024, NeurIPs] Tutorial on Language Modeling

Language Modeling Kyle Lo – Akshita Bhagia – Nathan Lambert Allen Institute of AI olmo@allenai.org Neural Information Processing Systems (NeurIPS) 10 December 2024 1

docs.google.com

December 11, 2024 at 8:04 AM

Reposted by Andrés Moreno

Kosta Derpanis

@csprofkgd.bsky.social

Check out this BEAUTIFUL interactive blog about cameras and lenses

ciechanow.ski/cameras-and-...

November 27, 2024 at 2:54 AM

Reposted by Andrés Moreno

Ted Underwood

@tedunderwood.com

A timely paper exploring ways academics can pretrain larger models than they think, e.g. by trading time against GPU count.

Since the title is misleading, let me also say: US academics do not need $100k for this. They used 2,000 GPU hours in this paper; NSF will give you that. #MLSky