Flora Salim
florasalim.bsky.social
Flora Salim
@florasalim.bsky.social

Professor, CSE, UNSW Sydney. #AI #ML #UbiComp #LLM #MFM #timeseries #ST #multimodal #sensors #continuallearning #trustworthyAI ❤️ #coffee
Why am I here? Scouting for a new platform to discover and learn new papers (let’s see if it’s the one) .. more

Computer science 53%
Engineering 39%

Finally, we compared Bisecle to frontier models like GPT-4o and Gemini 2.5. Note that we can only use APIs to test them. We show that these frontier LLMs still struggle with temporal reasoning and dealing with non-stationary, evolving video tasks. In some tasks, Bisecle show superior performance
Bisecle: Binding and Separation in Continual Learning for Video Language Understanding
Frontier vision-language models (VLMs) have made remarkable improvements in video understanding tasks. However, real-world videos typically exist as continuously evolving data streams (e.g., dynamic s...
arxiv.org

- Bisecle exhibits remarkable resistance to forgetting
- Bisecle is compatible with LLMs from 1B to 13B, introducing only a small number of additional parameters and computational cost.
- Bisecle can achieve superior performance even in low-resource settings.

The results show that:
- Bisecle establishes a new SOTA results surpassing others in both accuracy (+15.79%) and forgetting reduction (8.49% lower Forgetting rate).
- Our method Bisecle consistently outperforms others, indicating strong robustness even when training data is limited.

The two components of Bisecle with complementary angles:
- multi-directional supervision mechanism improves knowledge preservation.
- contrastive prompt learning scheme is designed to isolate task-specific knowledge to facilitate efficient memory storage, and to explicitly mitigate update conflict.

Excited to share that Bisecle is accepted at #NeurIPS2025.
🧠 Bisecle: Binding and Separation in Continual Learning for Video Language Understanding.
Preprint: arxiv.org/abs/2507.00469
Code: github.com/cruiseresear...
Inspired by the rapid binding and pattern separation mechanisms in the hippocampus

Reposted by Flora D. Salim

Congrats to team members Prof Flora Salim, Breeze Chen & Wilson Wongso from ADM+S and other team members Xiaoqian Hu & Yue Tan from UNSW who placed 3rd in the highly competitive KDD Cup 2025 Meta CRAG-MM Challenge @florasalim.bsky.social
#KDD2025 #WearableAI #VLLMs admscentre.org/4mu7lFx
Award-winning tech enhances information retrieval in wearable AI devices - ADM+S Centre
A team of researchers from ADM+S at UNSW has been awarded third place for single-source augmentation in the highly competitive KDD Cup 2025 Meta CRAG-MM Challenge, ranking alongside top institutions s...
admscentre.org

Multimodal CL #postdocjob #jobs 🦘🎓

* Potential for LLM steering:  The research explored the potential to manipulate ToM-related information within the LLMs to generate more aligned and contextually appropriate responses.

The first author - 1st year student Mehdi Jafari is attending his first academic conference #ACL2025.

* ToM-informed Alignment Improves Response Quality: Empirical evaluations of LLaMA-3 models (3B and 8B) demonstrated that incorporating ToM principles into the conversational agents improved response quality significantly, achieving win rates of 63% and 67% respectively.

Key Findings:
* LLMs Can Represent and Retain ToM-related Constructs: The study investigated whether LLMs could represent and retain ToM-related constructs and found evidence supporting this ability.

* ToM-informed Alignment Improves Response Quality:

In Beyond Words, we explore:
a) The extent to which the activation space of LLMs represents ToM of interlocutors,
b) Whether these representations form a consistent model of ToM,
and
c) How can we leverage ToM-related features to generate more aligned responses?

Current LLMs often generate contextually appropriate responses, but they don’t truly understand the user's goals, beliefs, or misunderstandings. 
Using ToM, we can analyse interlocutor behaviours based on the understanding of their mental and emotional states.

Beyond Words: Integrating Theory of Mind into Conversational Agents for Human-Like Belief, Desire, and Intention Alignment (#ACL2025 Findings)

aclanthology.org/2025.finding...

Codes: github.com/cruiseresear...

Findings in the thread below.

I’m recruiting a 1.5 year postdoc in multimodal continual learning external-careers.jobs.unsw.edu.au/cw/en/job/53...

Check our recent work on this topic. Bisecle: Binding and Separation in Continual Learning for Video Language Understanding
arxiv.org/abs/2507.00469

Know anyone suitable? Pls repost
Postdoctoral Research Associate / Senior Research Associate in Multimodal Learning
Conduct research field of deep learning focusing on novel continual multimodal learning methods.
external-careers.jobs.unsw.edu.au

Reposted by Flora D. Salim

If you reviewed for #ICLR, please make sure to read other reviewers' comments too and reflect on whether you may have missed something.

The paper will need to have a single decision; the point of this exercise is not just about addressing each reviewer's concerns individually.

Reposted by Flora D. Salim

(Not quite) end of term drinks @florasalim.bsky.social

Reposted by Flora D. Salim

Overleaf is down. Ah well... theoretically, only 30,000+ people hammering it 2 days before NeurIPS deadline.

Reposted by Flora D. Salim

Reposted by Flora D. Salim

NVIDIA Dynamo. Exciting announcement by Jensen Huang #gtc2025. A new open source VMWare-like inference framework for reasoning. It breaks up prefill and decode steps efficiently. DeepSeek R1 requests can be boosted by 30x. Already used by Perplexity, Meta, etc
developer.nvidia.com/blog/introdu...

I’m in San Jose this week for NVIDIA GTC. I’m a panelist for “The Role of AI and Accelerated Computing in Understanding and Mitigating Urban Climate Change” session.
We'll discuss how AI transforms climate modeling, weather forecasting, and high-resolution urban simulations.
Anyone else attending?

We know many cases of automated hiring gone wrong in the past. Automated firing? Doesn’t look good. No guarantee on transparency. The agency will be left to an unnamed LLM: “info would be fed into an unspecified LLM that would assess whether an employee was necessary” www.wired.com/story/doge-a...
DOGE Is Working on Software That Automates the Firing of Government Workers
Operatives working for Elon Musk’s DOGE appear to be editing the code of AutoRIF—software designed by the Defense Department that could assist in mass firings of federal workers, sources tell WIRED.
www.wired.com
Francis Collins, the NIH Director for 12 years, led the Human Genome Project and other NIH efforts for 32 years, resigned today. Key words from his resignation letter
www.nytimes.com/2025/03/01/u...

😔😢

Reposted by Flora D. Salim

Public Environmental Data Partners archives climate datasets, uploading copies to public repositories and cataloging where and how to find them if they go missing from government websites. Scholars explain:
How to find climate data and science the Trump administration doesn’t want you to see
Several groups are working to preserve webpages, tools and data – some of which have already gone missing from government webpages since the start of the Trump administration.
buff.ly

The Brick-by-Brick 2024 challenge focuses only on the multi-label classification problem, which we consider to be harder, and the holy grail for automation and management of net-zero and sustainable buildings.
Round 2 of Brick by Brick 2024 has commenced! To join: www.aicrowd.com/challenges/b...
AIcrowd | Brick by Brick 2024 | Challenges
Automating Building Data Classification
www.aicrowd.com

The task also tackles issues like imbalanced data and sparse labels, all while addressing real-world problems like building optimization and sustainability.
Our NeurIPS 2024 paper includes both a multi-label classification benchmark and a zero-shot forecasting benchmark.
neurips.cc/virtual/2024...
NeurIPS Poster Building Timeseries Dataset: Empowering Large-Scale Building AnalyticsNeurIPS 2024
neurips.cc

BTS is more challenging than existing TS datasets and a lot more interesting because BTS captures the complexities of real-world operations: 1) Temporal Irregularity; 2) Spatial Heterogeneity; 3) Long-tail Distribution.
-- requires models to manage hierarchical dependencies and ensure consistency.

** Knowledge Graph (KG)
BTS also includes a KG that captures the relationships between TS and their physical, logical, and virtual entities.
Making it a great case for Hierarchical Multi-Label Classification. The TS are to be classified across nested categories (e.g. Point>Sensor>Air Quality>CO2).