Lightnews — Scholar-powered news

@gm8xx8.bsky.social

NVILA, a VLM, enhances VILA by scaling spatial and temporal resolutions before compressing visual tokens, enabling efficient high-resolution image & long video processing. Cuts training costs by 4.5X, improves memory & latency, and outperforms top VLMs on benchmarks. Code & models will be released 🔜

December 6, 2024 at 6:47 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

PaliGemma 2: A Family of Versatile VLMs for Transfer

paper: arxiv.org/abs/2412.03555

December 5, 2024 at 3:24 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

Liquid AI introduces synthesis of tailored architectures (STAR) a new approach to automate neural network design tailored to various tasks and hardware setups.

🔗: www.liquid.ai/research/aut...

December 2, 2024 at 11:45 PM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

Marconi: Prefix Caching for the Era of Hybrid LLMs

paper: arxiv.org/abs/2411.19379

Marconi improves caching for hybrid LLMs with policies optimizing reuse likelihood and compute savings, achieving 34.4× higher token hit rates and significantly reducing latency.

December 2, 2024 at 9:35 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

DeMo: Decoupled Momentum Optimization

code: github.com/bloc97/DeMo
paper: arxiv.org/abs/2411.19870

December 2, 2024 at 9:29 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

Training Agents with Weakly Supervised Feedback from Large Language Models

paper: arxiv.org/abs/2411.19547

December 2, 2024 at 7:36 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

paper: arxiv.org/abs/2411.19943

December 2, 2024 at 6:11 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

paper: arxiv.org/abs/2411.16579
project page: mathcritique.github.io

November 26, 2024 at 4:32 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models

paper: arxiv.org/abs/2411.15100

November 25, 2024 at 5:19 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

February 19, 2024 at 6:03 PM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

docs.datasette.io/en/1.0a5/cha...

August 29, 2023 at 6:38 PM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

Crazy how many “web or web3 people “ don’t know anything about IPFS.

August 10, 2023 at 3:12 PM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

April 30, 2023 at 4:38 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

Me scrolling the what’s hot feed

April 25, 2023 at 7:08 PM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

“Thanks for the bluesky invite, now what do I do?”

April 24, 2023 at 12:27 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

gm bluesky

April 19, 2023 at 11:43 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

April 19, 2023 at 3:28 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

April 15, 2023 at 1:56 AM

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8.bsky.social

40||||
this piece took a month to finish & 150+ layers to get the lighting just right.
You’ll notice a lot of my work has a 🟦 background that’s bc I create on blue and use yellow white, & orange to draw out my design/details. 🫠