Author | Lightnews

Dylan Cope

@dylancope.bsky.social

1.4K followers 660 following 21 posts

Researching multi-agent RL, emergent communication, and evolutionary computation.

Postdoc at FLAIR Oxford. PhD from Safe and Trusted AI CDT @ KCL/Imperial. Previously visiting researcher at CHAI U.C. Berkeley.

dylancope.com

he/him
London 🇬🇧

Posts Replies Media Videos

Dylan Cope

@dylancope.bsky.social

@ordinarythings.bsky.social has a better understanding of the social impacts of AI than many of the people in the industry, and is doing a great job clearly explaining these issues in an entertaining way. This is the kind of public outreach the world needs more of.

June 27, 2025 at 9:09 AM

Dylan Cope

@dylancope.bsky.social

"People want more friends, sure. But if your solution to that is to build a product that makes it easier and more pleasurable to talk to no one, then fuck you. You are a misery merchant no better than a drug dealer"

youtu.be/NuIMZBseAOM?...

Will AI Slop Kill the Internet? | SlopWorld

YouTube video by Ordinary Things

youtu.be

June 27, 2025 at 9:05 AM

Dylan Cope

@dylancope.bsky.social

The way cars race up to the zebra crossing in SF is wild. I see the painted line a couple metres back from the crossing, but that seems to be a mere suggestion.

June 1, 2025 at 3:48 AM

Dylan Cope

@dylancope.bsky.social

I'm blushing

January 10, 2025 at 2:55 PM

Dylan Cope

@dylancope.bsky.social

Mmh I don't know if I would say they're Sagans of our time. I think it's people like Vsauce, Hank Green, 3blue1brown, Physicsgirl, smartereveryday, Simone Giertz, Veritasium, MinutePhysics, etc.

December 5, 2024 at 9:45 PM

Dylan Cope

@dylancope.bsky.social

I think some people are annoyed and the baby bird response is a form of condescension. I don't like it.

I think it's good to be considerate and express gratitude if a reviewer has put in time. But you also have to make actual arguments.

December 4, 2024 at 12:22 AM

Dylan Cope

@dylancope.bsky.social

I never knew a photo of someone holding a hedgehog could feel so inspirational. This looks like it should be on a political poster or something!

November 27, 2024 at 1:30 PM

Dylan Cope

@dylancope.bsky.social

When people are interested in learning about how to train agents to communicate (emergent communication), I always recommend this paper as a first read: dl.acm.org/doi/10.5555/...

Attached meme summarises the main pitfall to be wary of!

November 25, 2024 at 1:43 PM

Dylan Cope

@dylancope.bsky.social

Chiming into the conversation on peer-review. I think this is a good point that we need to take seriously. Science denialism has gotten a huge boost recently and many grifters benefit from well-meaning debates that they can twist into anti-intellectual narratives.

M A Osborne @maosbot.bsky.social · Nov 24

I have seen some anti-peer-review takes on here, and, believe me, I understand the frustrations, but I would urge caution before doing or publicly saying anything too drastic. We live in age rife with misinformation and anti-intellectualism—not everyone wants academia to survive

November 24, 2024 at 7:07 PM

Dylan Cope

@dylancope.bsky.social

👋🏻

November 24, 2024 at 2:56 PM

Reposted by Dylan Cope

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

Kind of a broken record here but proceedings.neurips.cc/paper_files/...
is totally fascinating in that it postulates two underlying, measurable structures that you can use to assess if RL will be easy or hard in an environment

e introduce the effective horizon, a property of
MDPs that controls how difficult RL is. Our analysis is mo-
tivated by Greedy Over Random Policy (GORP), a simple
Monte Carlo planning algorithm (left) that exhaustively ex-
plores action sequences of length k and then uses m random
rollouts to evaluate each leaf node. The effective horizon
combines both k and m into a single measure. We prove
sample complexity bounds based on the effective horizon that
correlate closely with the real performance of PPO, a deep
RL algorithm, on our BRIDGE dataset of 155 deterministic
MDPs (right).

November 23, 2024 at 6:18 PM

Dylan Cope

@dylancope.bsky.social

I think the LLMs would generally write jax that isn't compatible with jit - lots of non-concrete shape issues. But if you know a couple patterns for doing branchless conditionals in SIMD settings it's not too hard to fix.

Or you could try aggressively prompting the LLMs 😂

November 23, 2024 at 3:39 PM

Dylan Cope

@dylancope.bsky.social

For my domains it is night and day! Easily 10x speed-ups. I've been using JAX for the last 8 months, and I was using RLlib before which was very slow for my purposes.

Writing custom environments in JAX can be a bit of a pain though.

November 22, 2024 at 5:12 PM

Dylan Cope

@dylancope.bsky.social

Currently I'm using:

- Custom gymnax env
- PureJAXRL
- PPO
- GRU RNNs
- wandb
- praying that my choice of hyper parameters is fine

November 22, 2024 at 1:15 PM

Dylan Cope

@dylancope.bsky.social

My hopeful interpretation is that tweet is getting less engagement because we're all over here now, and not looking at Twitter!

But it also wouldn't remotely surprise me if Musk is suppressing mentions of bluesky over there.

November 22, 2024 at 11:48 AM

Dylan Cope

@dylancope.bsky.social

I really hope it lasts! Feels very refreshing to see so many interesting things on the feed.

November 22, 2024 at 1:17 AM

Dylan Cope

@dylancope.bsky.social

Post a non-religious photo you think of as holy.

November 21, 2024 at 5:20 PM

Dylan Cope

@dylancope.bsky.social

Put differently - LLM pre training is imitation learning, and so maybe they will imitate our ability to adapt OOD?

Imo the problem is that IL is notoriously bad OOD. Not yet convinced "just scale" fixes the fundamental issue of biased demo data/compounding errors.

November 20, 2024 at 7:52 PM

Dylan Cope

@dylancope.bsky.social

Managed to stump it with a drop that relies on correcting your balance with the wall. Wasn't too hard for me to get it but the agents don't get it!

November 20, 2024 at 12:37 PM

Dylan Cope

@dylancope.bsky.social

Could you add me! :)

November 20, 2024 at 9:51 AM

Reposted by Dylan Cope

Natasha Jaques

@natashajaques.bsky.social

One of my first posts on twitter was "fuck twitter". I'd just like to reiterate that sentiment today, as I join bluesky

November 19, 2024 at 12:37 AM

Dylan Cope

@dylancope.bsky.social

Please get the others from Novara on too 😅

November 14, 2024 at 1:47 PM

Dylan Cope

@dylancope.bsky.social

It works better than Twitter!

November 14, 2024 at 1:41 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news