Our best method (Uncertainty Sampling) picked examples with the most uncertain predictions. This identified challenging examples, but without too much label noise
Our best method (Uncertainty Sampling) picked examples with the most uncertain predictions. This identified challenging examples, but without too much label noise
See Standard-OOD scores below (avg), where the simplest LLM-generated data (Short & Simple Generation) performed best, with substantial improvements
See Standard-OOD scores below (avg), where the simplest LLM-generated data (Short & Simple Generation) performed best, with substantial improvements
This involved sampling methods to choose more complex examples in our training data, and generating new synthetic examples
Some methods were pretty fun, e.g. asking an LLM to assess the difficulty of training examples
This involved sampling methods to choose more complex examples in our training data, and generating new synthetic examples
Some methods were pretty fun, e.g. asking an LLM to assess the difficulty of training examples
We find that fine-tuned LLMs are substantially more robust than commonly used encoder models, despite being fine-tuned on x50 less data.
This is especially the case on challenging OOD datasets (see Challenge-OOD avg below)
We find that fine-tuned LLMs are substantially more robust than commonly used encoder models, despite being fine-tuned on x50 less data.
This is especially the case on challenging OOD datasets (see Challenge-OOD avg below)
Have a look :)
arxiv.org/abs/2505.20209
Have a look :)
arxiv.org/abs/2505.20209
E.g. for an entailment NLI example, each hypothesis atom should also be entailed by the premise.
Very nice idea 👏👏
E.g. for an entailment NLI example, each hypothesis atom should also be entailed by the premise.
Very nice idea 👏👏
My local train line in Bristol is so heart warming. My highlight was commuting to the obscure, industrial St Andrews Road station (pictured) when I was a receptionist at the nearby firefighters hotel / training centre
My local train line in Bristol is so heart warming. My highlight was commuting to the obscure, industrial St Andrews Road station (pictured) when I was a receptionist at the nearby firefighters hotel / training centre
I did this journey a long time ago so I'm sure the train will have totally transformed now. I chose the cheapest 'hard seat' class, and remember curling up on the floor to sleep at night.
But wow the journey was fun, and I was sitting with the funnest maths students.
I did this journey a long time ago so I'm sure the train will have totally transformed now. I chose the cheapest 'hard seat' class, and remember curling up on the floor to sleep at night.
But wow the journey was fun, and I was sitting with the funnest maths students.
I stoped a few times along the way, but going up into the artic circle was pretty exciting, and taking the train through snowy blizzards was also new for me.
Picture from google.
I stoped a few times along the way, but going up into the artic circle was pretty exciting, and taking the train through snowy blizzards was also new for me.
Picture from google.
I did this trip in winter, and got to see Canada in the snow. Apart from the scenery, I love when the train stops for a few hours somewhere and you can run out and explore.
I did this trip in winter, and got to see Canada in the snow. Apart from the scenery, I love when the train stops for a few hours somewhere and you can run out and explore.
I love Amtrak long distance trains, with their amazing little sleeper rooms and dining cars. The views were great, but the real highlight on Amtrak is always the people I meet there
I love Amtrak long distance trains, with their amazing little sleeper rooms and dining cars. The views were great, but the real highlight on Amtrak is always the people I meet there
From Beijing to Moscow took me ~7 days, where I lived off instant noodles and vodka from the dining car. You get to see China gradually change into Mongolia and then into Russia.
From Beijing to Moscow took me ~7 days, where I lived off instant noodles and vodka from the dining car. You get to see China gradually change into Mongolia and then into Russia.
London is a great city to live in. I love all the green spaces personally, and the public transport (with the fantastic new Elizabeth line), but this city has just about everything.
London is a great city to live in. I love all the green spaces personally, and the public transport (with the fantastic new Elizabeth line), but this city has just about everything.
The location is brilliant! Imperial is right by Hyde Park / Kensington Gardens, which are enormous green parks. My commute involves walking through them on my way to uni and I love it.
Perfect for running too if that's the sort of thing you're into.
The location is brilliant! Imperial is right by Hyde Park / Kensington Gardens, which are enormous green parks. My commute involves walking through them on my way to uni and I love it.
Perfect for running too if that's the sort of thing you're into.
Imperial has a fantastic reputation, and we came 2nd in the world in the last QS world university rankings.
Help us get #1 from MIT 😊
Imperial has a fantastic reputation, and we came 2nd in the world in the last QS world university rankings.
Help us get #1 from MIT 😊
Just over a week ago and I was in the crazy Miami heat for #EMNLP2024
Just over a week ago and I was in the crazy Miami heat for #EMNLP2024
1) NLI remains a difficult task for LLMs
2) Having more few-shot examples is helpful (in my view, helping LLMs better understand class boundaries)
3) Incorrect predictions are often a result of ambiguous labels
1) NLI remains a difficult task for LLMs
2) Having more few-shot examples is helpful (in my view, helping LLMs better understand class boundaries)
3) Incorrect predictions are often a result of ambiguous labels
and super excited!! Amtrak trains are the fantastic, and I’ve got my own little room with two seats, a bed above, and toilet next to the bed.
The toilet thing is a bit weird though if you have two to a room
and super excited!! Amtrak trains are the fantastic, and I’ve got my own little room with two seats, a bed above, and toilet next to the bed.
The toilet thing is a bit weird though if you have two to a room
Chocolates went down well too!
Massive thanks to everyone for all your ideas and feedback
Chocolates went down well too!
Massive thanks to everyone for all your ideas and feedback
It's cool work about creating inherently interpretable models, and (as always) I will have chocolate to give out 😀
Paper is here: aclanthology.org/2024.emnlp-m...
It's cool work about creating inherently interpretable models, and (as always) I will have chocolate to give out 😀
Paper is here: aclanthology.org/2024.emnlp-m...