Lightnews — Scholar-powered news

Riyadh

@riyadhrazzaq.bsky.social

36 followers 180 following 2 posts

NLP Master's @UPV/EHU

Posts Replies Media Videos

Reposted by Riyadh

HiTZ zentroa (UPV/EHU)

@hitz-zentroa.bsky.social

#Latxa txatbota probatarako erabilgarri jarri dugu! latxa.hitz.eus

Jaso ditugun eskaerei erantzunez zuen eskura jarri dugu Latxaren bertsio ahaltsuena, chatGPT-tik gertu dabilena, baina euskara txukunagoa sortuz.

Gradio

Click to try out the app!

latxa.hitz.eus

October 31, 2025 at 6:57 AM

Riyadh

@riyadhrazzaq.bsky.social

A fun read on Deep RL from Alex Irpan.
www.alexirpan.com/2018/02/14/r...

July 29, 2025 at 3:11 PM

Reposted by Riyadh

Alexander Doria

@dorialexander.bsky.social

In summary, the infra layer is eating the model layer, the model layer is eating the application layer, the application layer might soon eat the people layer. AI food chain.

March 23, 2025 at 3:06 PM

Reposted by Riyadh

Thomas Wolf

@thomwolf.bsky.social

« appending "Wait" multiple times to the model's generation » is our current most likely path to AGI :)

See the fresh arxiv.org/abs/2501.19393 by Niklas Muennighoff et al.

February 3, 2025 at 2:31 PM

Reposted by Riyadh

Sung Kim

@sungkim.bsky.social

Cohere's Maya: Open-source (code, dataset, and model) Multimodal Multilingual LLM

- Paper: Maya: An Instruction Finetuned Multilingual Multimodal Model ( arxiv.org/abs/2412.07112 )
- Model and Dataset: huggingface.co/maya-multimo...
- Repo: github.com/nahidalam/maya

December 11, 2024 at 8:42 AM

Reposted by Riyadh

Michael Tschannen

@mtschannen.bsky.social

Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)?

We have been pondering this during summer and developed a new model: JetFormer 🌊🤖

arxiv.org/abs/2411.19722

A thread 👇

1/