Lightnews — Scholar-powered news

David Debot

@daviddebot.bsky.social

180 followers 240 following 16 posts

PhD student @dtai-kuleuven.bsky.social in neurosymbolic AI and concept-based learning
https://daviddebot.github.io/

Posts Replies Media Videos

David Debot

@daviddebot.bsky.social

See you at #AAAI2025!

Site: dtai.cs.kuleuven.be/projects/nes...

Video: youtu.be/3uLVxwlcSQc?...

@daviddebot.bsky.social, @gabventurato.bsky.social, @giuseppemarra.bsky.social, @lucderaedt.bsky.social

#ReinforcementLearning #AI #MachineLearning #NeurosymbolicAI
(8/8)

February 24, 2025 at 12:29 PM

David Debot

@daviddebot.bsky.social

Open-source & easy to use!
🔷 Code: github.com/ML-KULeuven/...
🔷 Based on MiniHack & Stable Baselines3
🔷 Define new shields in just a few lines of code!

🚀 Let’s make RL safer & smarter, together!
(7/8)

February 24, 2025 at 12:28 PM

David Debot

@daviddebot.bsky.social

Want to try it yourself? 🎮

Use our interactive web demo!
🔷 Modify environments (add lava, monsters!)
🔷 Test shielded vs. non-shielded agents

🖥️ Play with it here: dtai.cs.kuleuven.be/projects/nes...
(6/8)

February 24, 2025 at 12:28 PM

David Debot

@daviddebot.bsky.social

Why does this matter?
🔷 Faster training ⌛
🔷 Safer exploration 🔒
🔷 Better generalization 🌍
(5/8)

February 24, 2025 at 12:27 PM

David Debot

@daviddebot.bsky.social

How does it work? 🤔🛡️

The shield:
✅ Exploits symbolic data from sensors 🌍
✅ Uses logical rules 📜
✅ Prevents unsafe actions 🚫
✅ Still allows flexible learning 🤖

A perfect blend of symbolic reasoning & deep learning!
(4/8)

February 24, 2025 at 12:27 PM

David Debot

@daviddebot.bsky.social

Enter MiniHack, our demo's testing ground! 🏰🗡️

There, RL agents face:
✅ Lava cliffs & slippery floors
✅ Chasing monsters
✅ Locked doors needing keys

Findings:
🔷 Standard RL struggles to find an optimal, safe policy.
🔷 Shielded RL agents stay safe & learn faster!
(3/8)

February 24, 2025 at 12:27 PM

David Debot

@daviddebot.bsky.social

Deep RL is powerful, but...
⚠️ It can take dangerous actions
⚠️ It lacks safety guarantees
⚠️ It struggles with real-world constraints

Yang et al.'s probabilistic logic shields fix this, enforcing safety without breaking learning efficiency! 🚀
(2/8)

February 24, 2025 at 12:26 PM

David Debot

@daviddebot.bsky.social

A short overview video can be found on YouTube: youtu.be/CgSDhQKESD0?...

#NeurIPS2024

Interpretable Concept-Based Memory Reasoning - NeurIPS 2024

YouTube video by David Debot

youtu.be

December 23, 2024 at 10:23 AM

David Debot

@daviddebot.bsky.social

Or check out our Medium post: 👉 medium.com/@pyc.devteam... (7/7)

December 4, 2024 at 8:50 AM

David Debot

@daviddebot.bsky.social

With CMR, we’re reaching the sweet spot of accuracy and interpretability. Check it out at our poster at #NeurIPS2024! 👉 neurips.cc/virtual/2024... (6/7)

NeurIPS Poster Interpretable Concept-Based Memory ReasoningNeurIPS 2024

neurips.cc

December 4, 2024 at 8:49 AM

David Debot

@daviddebot.bsky.social

During training, CMR learns embeddings as latent representations of logic rules, and a neural rule selector identifies the most relevant rule for each instance. Due to a clever factorization and rule selector, inference is linear in the number of concepts and rules. (5/7)

December 4, 2024 at 8:49 AM

David Debot

@daviddebot.bsky.social

CMR makes a prediction in 3 steps:
1) Predict concepts from the input
2) Neurally select a rule from a memory of learned logic rules ➨ Accuracy
3) Evaluate the selected rule with the concepts to make a final prediction ➨ Interpretability (4/7)

December 4, 2024 at 8:48 AM

David Debot

@daviddebot.bsky.social

CMR has:
⚡ State-of-the-art accuracy that rivals black-box models
🚀 Pure probabilistic semantics with linear-time exact inference
👁️ Transparent decision-making so human users can interpret model behavior
🛡️ Pre-deployment verifiability of model properties (3/7)

December 4, 2024 at 8:47 AM

David Debot

@daviddebot.bsky.social

CMR is our latest neurosymbolic concept-based model. A proven 𝘶𝘯𝘪𝘷𝘦𝘳𝘴𝘢𝘭 𝘣𝘪𝘯𝘢𝘳𝘺 𝘤𝘭𝘢𝘴𝘴𝘪𝘧𝘪𝘦𝘳 irrespective of the concept set, CMR achieves near-black-box accuracy by combining 𝗿𝘂𝗹𝗲 𝗹𝗲𝗮𝗿𝗻𝗶𝗻𝗴 and 𝗻𝗲𝘂𝗿𝗮𝗹 𝗿𝘂𝗹𝗲 𝘀𝗲𝗹𝗲𝗰𝘁𝗶𝗼𝗻! (2/7)

December 4, 2024 at 8:47 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news