Lightnews — Scholar-powered news

Krishna Balasubramanian

@krizna.bsky.social

310 followers 180 following 19 posts

https://sites.google.com/view/kriznakumar/ Associate professor at @ucdavis
#machinelearning #deeplearning #probability #statistics #optimization #sampling

Posts Replies Media Videos

Krishna Balasubramanian

@krizna.bsky.social

link to paper:

arxiv.org/abs/2505.15059

Restricted Spectral Gap Decomposition for Simulated Tempering Targeting Mixture Distributions

Simulated tempering is a widely used strategy for sampling from multimodal distributions. In this paper, we consider simulated tempering combined with an arbitrary local Markov chain Monte Carlo sampl...

arxiv.org

May 22, 2025 at 2:41 AM

Krishna Balasubramanian

@krizna.bsky.social

We implement these oracles using heat-kernel truncation & Varadhan's asymptotics, linking our method to entropy-regularized proximal point method on Wasserstein spaces, in the latter case.

Joint work with Yunrui Guan and @shiqianma.bsky.social

February 12, 2025 at 9:59 PM

Krishna Balasubramanian

@krizna.bsky.social

Our bounds show how key factors—like the number of matches and treatment balance—impact Gaussian approximation accuracy.

We also introduce multiplier bootstrap bounds for obtaining finite-sample valid, data-driven confidence intervals.

January 2, 2025 at 7:01 PM

Krishna Balasubramanian

@krizna.bsky.social

Matching-based ATE estimators align treated and control units to estimate causal effects without strong parametric assumptions.

Using Malliavin-Stein method we establish Gaussian Approximation bounds for these estimators.

January 2, 2025 at 7:01 PM

Krishna Balasubramanian

@krizna.bsky.social

thanks, resent the email now!

December 3, 2024 at 1:11 AM

Krishna Balasubramanian

@krizna.bsky.social

How well RF performs in these settings? That’s still an open question.

Bottom-line: Time to compare SGD-trained NNs with RF and not kernel methods!

November 27, 2024 at 3:07 PM

Krishna Balasubramanian

@krizna.bsky.social

Going beyond mean-field regime for SGD trained NNs certainly helps. Recent works connect learnability of SGD trained NNs with leap complexity and information exponent of function classes (like single and multi index models) with the goal of explaining feature learning.

November 27, 2024 at 3:07 PM

Krishna Balasubramanian

@krizna.bsky.social

It also creates an intriguing parallel with NNs: greedy-trained partitioning models and SGD-trained NNs (in the mean-field regime) both thrive under specific structural assumptions (eg. MSP) but struggle otherwise.

However, under MSP, greedy RFs are provably better that SGD-trained 2-NNs!

November 27, 2024 at 3:07 PM

Krishna Balasubramanian

@krizna.bsky.social

In our work:

arxiv.org/abs/2411.04394

we show that If the true regression function satisfies MSP, greedy training works well with 𝑂(log 𝑑) samples.

Otherwise, it struggles.

This settles the question of learnability for greedy recursive partitioning algorithms like CART.

November 27, 2024 at 3:07 PM

Krishna Balasubramanian

@krizna.bsky.social

MSP is used to argue that SGD trained 2-layer NNs are better than vanilla kernel methods.

But how do neural nets compare with random forest (RF) trained using greedy algorithms like CART?

November 27, 2024 at 3:07 PM

Krishna Balasubramanian

@krizna.bsky.social

add me please
🙋

November 26, 2024 at 1:18 AM

Krishna Balasubramanian

@krizna.bsky.social

Yes, but is the cover indicative of RL notations by any chance :P

November 24, 2024 at 5:31 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news