Work on:
- Compositionality, syntax (language structure)
- Web Agents: Synthetic data, tree search, exploration (language interpretation)
We propose methods for training LLMs with open-ended, unsupervised interaction on live websites:
✅ OSS SoTA on WebVoyager
✅ world's smallest high-performing web-agent
Try it here: nnetnav.dev
We propose methods for training LLMs with open-ended, unsupervised interaction on live websites:
✅ OSS SoTA on WebVoyager
✅ world's smallest high-performing web-agent
Try it here: nnetnav.dev
⏰When? today, 9am-5:30pm
📍West Ballroom B
s2r-at-scale-workshop.github.io
#NeurIPS2024
Look at the @neuripsconf.bsky.social tutorials in 2024!
neurips.cc/virtual/2024...
14 tutorials; 6 have "LLM" in the title; 4 more cover foundation models, with large NLP coverage. That's > 70% 😲
Look at the @neuripsconf.bsky.social tutorials in 2024!
neurips.cc/virtual/2024...
14 tutorials; 6 have "LLM" in the title; 4 more cover foundation models, with large NLP coverage. That's > 70% 😲
We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.
We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.
Go attend, and use the link below to ask all of your burning questions about LLM reasoning, agents and compositionality!
🔥Got spicy questions? Submit & vote here:
app.sli.do/event/dJNU63...
Go attend, and use the link below to ask all of your burning questions about LLM reasoning, agents and compositionality!
🔥Got spicy questions? Submit & vote here:
app.sli.do/event/dJNU63...
🔥Got spicy questions? Submit & vote here:
app.sli.do/event/dJNU63...
These folks care about their sub-field and i learn something new every time!
These folks care about their sub-field and i learn something new every time!
These folks care about their sub-field and i learn something new every time!
Context: I have a probe trained to predict dependency relations, and would like to train another one on a semantics only task (for research purposes)
Context: I have a probe trained to predict dependency relations, and would like to train another one on a semantics only task (for research purposes)
It's crazy how the classic "sample and rerank" baseline from machine translation and IR, got re-branded as "scaling up inference-time compute".
It's crazy how the classic "sample and rerank" baseline from machine translation and IR, got re-branded as "scaling up inference-time compute".