Huy Tran
banner
huytransformer.com
Huy Tran
@huytransformer.com
sampling reality
Reposted by Huy Tran
Slicing the Gaussian Mixture Wasserstein Distance

Moritz Piening, Robert Beinert

Action editor: Makoto Yamada

https://openreview.net/forum?id=yPBtJ4JPwi

#wasserstein #generative #minimization
December 27, 2025 at 5:19 AM
Reposted by Huy Tran
Yucen Lily Li, Daohan Lu, Polina Kirichenko, Shikai Qiu, Tim G. J. Rudner, C. Bayan Bruss, Andrew Gordon Wilson: Out-of-Distribution Detection Methods Answer the Wrong Questions https://arxiv.org/abs/2507.01831 https://arxiv.org/pdf/2507.01831 https://arxiv.org/html/2507.01831
July 3, 2025 at 6:34 AM
Reposted by Huy Tran
Sloan Nietert, Ziv Goldfeld
Estimation of Stochastic Optimal Transport Maps
https://arxiv.org/abs/2512.09499
December 11, 2025 at 6:27 AM
Reposted by Huy Tran
A Comprehensive Survey on Knowledge Distillation

Amir M. Mansourian, Rozhan Ahmadi, Masoud Ghafouri et al.

Action editor: Changyou Chen

https://openreview.net/forum?id=3cbJzdR78B

#distillation #dnns #knowledge
December 8, 2025 at 1:19 PM
Reposted by Huy Tran
Survey of Video Diffusion Models: Foundations, Implementations, and Applications

Yimu Wang, Xuye Liu, Wei Pang, Li Ma, Shuai Yuan, Paul Debevec, Ning Yu

Action editor: Anurag Arnab

https://openreview.net/forum?id=2ODDBObKjH

#video #generative #visual
December 6, 2025 at 1:19 AM
Reposted by Huy Tran
Open Problems in Mechanistic Interpretability

Lee Sharkey, Bilal Chughtai, Joshua Batson et al.

Action editor: Sarath Chandar

https://openreview.net/forum?id=91H76m9Z94

#interpretability #ai #mechanistic
December 4, 2025 at 5:18 PM
Reposted by Huy Tran
Read last night. Very nice. arxiv.org/abs/2512.01868
December 3, 2025 at 5:46 PM
Reposted by Huy Tran
Two Is Better Than One: Aligned Representation Pairs for Anomaly Detection

Alain Ryser, Thomas M. Sutter, Alexander Marx, Julia E Vogt

Action editor: Shinichi Nakajima

https://openreview.net/forum?id=Bt0zdsnWYc

#outliers #anomaly #anomalies
December 2, 2025 at 9:18 PM
Reposted by Huy Tran
Wondering how DeepSeek v3.2 rivals SOTA models (e.g., GPT5/Gemini 3 pro) while being ~30x cheaper? 🤔

Let's learn how the base model works!

We'll focus on attention, the need for KV caching, and key ideas for improving attention (MQA/GQA/MLA/DSA).

youtu.be/Y-o545eYjXM
December 1, 2025 at 6:23 PM
Reposted by Huy Tran
Label Embedding via Low-Coherence Matrices

Jianxin Zhang, Clayton Scott

Action editor: Jake C. Snell

https://openreview.net/forum?id=vrcWXcr4On

#embedding #classification #label
November 29, 2025 at 1:18 AM
Reposted by Huy Tran
🚨 OpenReview might have leaked names, but it won't leak the best hyperparameters, unfortunately! 😅

Tired of the drama? Solve your HPO problems before the ICML deadline with this new monograph by our own Luca Franceschi & Massimiliano Pontil (& colleagues).

arxiv.org/abs/2410.22854
Hyperparameter Optimization in Machine Learning
Hyperparameters are configuration variables controlling the behavior of machine learning algorithms. They are ubiquitous in machine learning and artificial intelligence and the choice of their values ...
arxiv.org
November 28, 2025 at 5:34 PM
Reposted by Huy Tran
I'm quite intrigued by possibility theory, so I must say this looks quite exciting!

arxiv.org/abs/2511.21223
Maxitive Donsker-Varadhan Formulation for Possibilistic Variational Inference
Variational inference (VI) is a cornerstone of modern Bayesian learning, enabling approximate inference in complex models that would otherwise be intractable. However, its formulation depends on expec...
arxiv.org
November 27, 2025 at 3:25 AM
Reposted by Huy Tran
🔄 Updated Arxiv Paper

Title: Modelling Global Trade with Optimal Transport
Authors: Thomas Gaskin, Guven Demirel, Marie-Therese Wolfram, Andrew Duncan

Read more: https://arxiv.org/abs/2409.06554
November 21, 2025 at 8:05 AM
Reposted by Huy Tran
A Mixture of Exemplars Approach for Efficient Out-of-Distribution Detection with Foundation Models

Evelyn Mannix, Howard Bondell

Action editor: Gabriel Loaiza-Ganem

https://openreview.net/forum?id=xpKqnSJtE4

#classifier #detection #classification
November 20, 2025 at 5:19 PM
Reposted by Huy Tran
A Unified Approach Towards Active Learning and Out-of-Distribution Detection

Sebastian Schmidt, Leonard Schenk, Leo Schwinn, Stephan Günnemann

Action editor: Chicheng Zhang

https://openreview.net/forum?id=HL75La10FN

#detection #deep #feature
November 19, 2025 at 5:18 AM
Reposted by Huy Tran
Reading group tomorrow: "How to build a consistency model: Learning flow maps via self-distillation" with Nicholas Boffi! arxiv.org/abs/2505.18825

Join us on zoom at 9am PT, 12pm ET, 6pm CET: portal.valencelabs.com/starklyspeak...
November 16, 2025 at 5:06 PM
Reposted by Huy Tran
Unifying Self-Supervised Clustering and Energy-Based Models

Emanuele Sansone, Robin Manhaeve

Action editor: Ole Winther

https://openreview.net/forum?id=NW0uKe6IZa

#generative #supervised #models
November 13, 2025 at 1:18 PM
Reposted by Huy Tran
Entangled Schrödinger Bridge Matching][new]
Models interacting particle dynamics by entangling velocities via coupled bias forces, improving trajectory simulation for systems with evolving interactions.
November 11, 2025 at 4:00 AM
Reposted by Huy Tran
Does equivariance matter at scale?

Johann Brehmer, Sönke Behrends, Pim De Haan, Taco Cohen

Action editor: Marcus Brubaker

https://openreview.net/forum?id=wilNute8Tn

#models #equivariance #equivariant
November 10, 2025 at 9:18 PM
Reposted by Huy Tran
"The Principles of Diffusion Models" by Chieh-Hsin Lai, Yang Song, Dongjun Kim, Yuki Mitsufuji, Stefano Ermon. arxiv.org/abs/2510.21890
It might not be the easiest intro to diffusion models, but this monograph is an amazing deep dive into the math behind them and all the nuances
The Principles of Diffusion Models
This monograph presents the core principles that have guided the development of diffusion models, tracing their origins and showing how diverse formulations arise from shared mathematical ideas. Diffu...
arxiv.org
October 28, 2025 at 8:35 AM
Reposted by Huy Tran
New paper on arXiv! And I think it's a good'un 😄

Meet the new Lattice Random Walk (LRW) discretisation for SDEs. It’s radically different from traditional methods like Euler-Maruyama (EM) in that each iteration can only move in discrete steps {-δₓ, 0, δₓ}.
August 29, 2025 at 3:07 PM
Reposted by Huy Tran
Samuel Duffield, Maxwell Aifer, Denis Melanson, Zach Belateche, Patrick J. Coles
Lattice Random Walk Discretisations of Stochastic Differential Equations
https://arxiv.org/abs/2508.20883
August 29, 2025 at 4:03 AM
Reposted by Huy Tran
Luca Ambrogioni
The Information Dynamics of Generative Diffusion
https://arxiv.org/abs/2508.19897
August 28, 2025 at 4:19 AM
Reposted by Huy Tran
A random old one:

"Kernels and Decision Trees"
hackmd.io/@sp-monte-ca...
My memory is that I have a few mostly-written drafts waiting in the wings on HackMD, and I'll try to upload them soon.

I'm also thinking about writing exercises which might be fun for me to explore, e.g. picking some topic from a list and taking <30 mins to write a personal impression / overview.
August 17, 2025 at 8:07 PM