𝚐𝔪𝟾𝚡𝚡𝟾
banner
gm8xx8.bsky.social
𝚐𝔪𝟾𝚡𝚡𝟾
@gm8xx8.bsky.social
☺︎
NVILA, a VLM, enhances VILA by scaling spatial and temporal resolutions before compressing visual tokens, enabling efficient high-resolution image & long video processing. Cuts training costs by 4.5X, improves memory & latency, and outperforms top VLMs on benchmarks. Code & models will be released 🔜
December 6, 2024 at 6:47 AM
PaliGemma 2: A Family of Versatile VLMs for Transfer

paper: arxiv.org/abs/2412.03555
December 5, 2024 at 3:24 AM
Liquid AI introduces synthesis of tailored architectures (STAR) a new approach to automate neural network design tailored to various tasks and hardware setups.

🔗: www.liquid.ai/research/aut...
December 2, 2024 at 11:45 PM
Marconi: Prefix Caching for the Era of Hybrid LLMs

paper: arxiv.org/abs/2411.19379

Marconi improves caching for hybrid LLMs with policies optimizing reuse likelihood and compute savings, achieving 34.4× higher token hit rates and significantly reducing latency.
December 2, 2024 at 9:35 AM
DeMo: Decoupled Momentum Optimization

code: github.com/bloc97/DeMo
paper: arxiv.org/abs/2411.19870
December 2, 2024 at 9:29 AM
Training Agents with Weakly Supervised Feedback from Large Language Models

paper: arxiv.org/abs/2411.19547
December 2, 2024 at 7:36 AM
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

paper: arxiv.org/abs/2411.19943
December 2, 2024 at 6:11 AM
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

paper: arxiv.org/abs/2411.16579
project page: mathcritique.github.io
November 26, 2024 at 4:32 AM
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models

paper: arxiv.org/abs/2411.15100
November 25, 2024 at 5:19 AM
February 19, 2024 at 6:03 PM
August 29, 2023 at 6:38 PM
Crazy how many “web or web3 people “ don’t know anything about IPFS.
August 10, 2023 at 3:12 PM
April 30, 2023 at 4:38 AM
Me scrolling the what’s hot feed
April 25, 2023 at 7:08 PM
“Thanks for the bluesky invite, now what do I do?”
April 24, 2023 at 12:27 AM
gm bluesky
April 19, 2023 at 11:43 AM
April 19, 2023 at 3:28 AM
April 15, 2023 at 1:56 AM
40||||
this piece took a month to finish & 150+ layers to get the lighting just right.
You’ll notice a lot of my work has a 🟦 background that’s bc I create on blue and use yellow white, & orange to draw out my design/details. 🫠
March 9, 2023 at 1:22 AM
gm
March 8, 2023 at 12:42 PM
ce||
March 7, 2023 at 2:38 AM