Spinning Weights
banner
spinningweights.bsky.social
Spinning Weights
@spinningweights.bsky.social
Daily AI research updates

Customized AI research newsletters at https://spinningweights.com
📑 New paper: DiT-Air

Explores Diffusion Transformers for text-to-image tasks, introducing DiT-Air and DiT-Air-Lite. Achieves state-of-the-art results with fewer parameters, highlighting efficient models' potential.
March 15, 2025 at 3:00 PM
📑 New paper: ETCH

ETCH fits 3D body models to clothed human point clouds using SE(3) equivariance, mapping cloth-to-body surfaces. It generalizes well across poses/garments, excelling in accuracy and robustness.
March 15, 2025 at 2:00 PM
📑 New paper: UniGoal

UniGoal is a novel framework for zero-shot goal-oriented navigation using a uniform graph to unify goals. Leveraging large language models, it achieves top performance on multiple benchmarks, surpassing task-specific and supervised methods.
March 15, 2025 at 1:00 PM
📑 New paper: ConsisLoRA

ConsisLoRA is a novel LoRA-based style transfer method that boosts content and style consistency by tuning LoRA weights to predict the original image, solving key style transfer challenges.
March 15, 2025 at 12:00 PM
📑 New paper: Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology

HSAT enhances vision model robustness in histopathology, surpassing current methods in adversarial settings and setting new healthcare model reliability benchmarks.
March 15, 2025 at 11:00 AM
📑 New paper: Transformers without Normalization

Introduces Dynamic Tanh (DyT), replacing normalization layers with an element-wise operation. DyT adjusts activation range, matching/exceeding traditional methods, challenging the need for normalization.
March 15, 2025 at 10:00 AM
📑 New paper: Charting and Navigating Hugging Face's Model Atlas

Maps the landscape of publicly available neural nets on Hugging Face, unearthing trends, model attributes, and aiding in prediction and documentation through real-world training practices.
March 15, 2025 at 9:00 AM
📑 New paper: HybridVLA

HybridVLA unifies vision-language-action with autoregressive and diffusion policies for better robotic manipulation. Collaborative training boosts performance across tasks, outperforming previous methods in simulations and real-world tests.
March 15, 2025 at 8:00 AM
📑 New paper: V2Edit

V2Edit offers training-free, instruction-led video and 3D scene editing, balancing content preservation and edits. It excels in complex scenarios with fast-moving cameras and major geometric shifts.
March 15, 2025 at 2:00 AM
📑 New paper: A Frustratingly Simple Yet Highly Effective Attack Baseline

Introducing M-Attack: achieving over 90% success on black-box LVLMs like GPT-4.5 by encoding semantic details in local image regions, surpassing previous state-of-the-art attack methods.
March 15, 2025 at 1:00 AM
📑 New paper: OVTR

OVTR is an end-to-end open-vocabulary multiple object tracking system using transformers. It introduces Category Information Propagation and dual-branch structures for better classification and faster inference.
March 15, 2025 at 12:00 AM
📑 New paper: CoSTA$\ast$

CoSTA* is an innovative cost-sensitive toolpath agent for multi-turn image editing. It merges LLMs and A* search to optimize AI tool use, achieving a superior cost-quality balance on new benchmarks.
March 14, 2025 at 4:00 PM
📑 New paper: Distilling Diversity and Control in Diffusion Models

Tackles reduced diversity in distilled diffusion models by introducing diversity distillation, enhancing diversity beyond base models while maintaining efficiency.
March 14, 2025 at 3:00 PM
📑 New paper: Benefits of Learning Rate Annealing for Tuning-Robustness in Stochastic Optimization

Shows how learning rate annealing, like cosine schedules, boosts robustness and cuts computational load, enabling sublinear convergence in large-scale model training.
March 14, 2025 at 9:00 AM
📑 New paper: Optimal ISAC Beamforming Structure and Efficient Algorithms for Sum Rate and CRLB Balancing

Enhances ISAC beamforming, improving spectrum efficiency and estimation accuracy using SCA for convex subproblems and lower complexity.
March 14, 2025 at 8:00 AM
📑 New paper: Mitigating Membership Inference Vulnerability in Personalized Federated Learning

Tackles privacy risks in PFL from Membership Inference Attacks in IFCA. Introduces IFCA-MIR to enhance clustering with MIA risk assessment, boosting accuracy, fairness, and privacy.
March 14, 2025 at 2:00 AM
📑 New paper: Graph-based Full Event Interpretation

GraFEI uses graph neural networks to reconstruct events at the Belle II experiment, boosting B meson decay detection with invisible particles. It enhances background reduction while keeping signal efficiency high.
March 14, 2025 at 1:00 AM
📑 New paper: Got Compute, but No Data

Explores post-training LLMs for instruction-following in English & Finnish, tackling data scarcity in small languages. Shows competitive Finnish results with few samples & best outcomes using bilingual data. Open-sourced.
March 14, 2025 at 12:00 AM
📑 New paper: CASTLE

CASTLE evaluates static analyzers, formal verification tools, and LLMs on C program vulnerability detection. It highlights pros and cons of each, with a novel CASTLE Score for fair tool comparison.
March 13, 2025 at 11:00 PM
📑 New paper: Evaluating Multi-Instance DNN Inferencing on Multiple Accelerators of an Edge Device

Examines ResNet50’s multi-instance performance on Nvidia Jetson accelerators, focusing on batch size effects on throughput, latency, and scheduling amid resource contention.
March 13, 2025 at 10:00 PM
📑 New paper: Differentially Private Equilibrium Finding in Polymatrix Games

Tackles finding equilibria in polymatrix games with privacy. Introduces a distributed algorithm offering both accuracy and privacy as player numbers grow, advancing multi-agent systems.
March 13, 2025 at 9:00 PM
📑 New paper: TPDiff

TPDiff is a temporal pyramid video diffusion model boosting video generation efficiency by adjusting frame rates during diffusion. It cuts training/inference costs and outperforms traditional methods.
March 13, 2025 at 8:00 PM
📑 New paper: Manify

Manify, a Python library, enables learning non-Euclidean representations via manifold learning. It aids data embedding into non-Euclidean spaces, boosting classification, regression, and curvature estimation.
March 13, 2025 at 7:00 PM
📑 New paper: PISA Experiments

Explores post-training of large video models to better simulate physics, especially object freefall. Introduces PISA framework for fine-tuning and reward modeling to boost physical accuracy in video generation.
March 13, 2025 at 6:00 PM
📑 New paper: Audio Flamingo 2

AF2 is an advanced Audio-Language Model improving reasoning over non-speech sounds and music. It uses a custom CLAP model, synthetic audio QA data, and a curriculum learning strategy for top results.
March 8, 2025 at 1:00 PM