https://flow-ai.com
[Fischli/Weiss, Fondazione Prada]
[Fischli/Weiss, Fondazione Prada]
arxiv.org/abs/2411.04118
arxiv.org/abs/2411.04118
This paper goes a step further by focusing on reducing the compute required to build a dataset and train an LLM for a low-resource language.
huggingface.co/papers/2411....
This paper goes a step further by focusing on reducing the compute required to build a dataset and train an LLM for a low-resource language.
huggingface.co/papers/2411....
In this work we take the first steps towards asking whether LLMs can cater to diverse cultures in *user-facing generative* tasks.
[1/7]
In this work we take the first steps towards asking whether LLMs can cater to diverse cultures in *user-facing generative* tasks.
[1/7]
"LLM-as-a-judge" can replace fully manual judgments to accurately capture run-level effectiveness. It also does not appear to increase correlation with fully manual assessments.
"LLM-as-a-judge" can replace fully manual judgments to accurately capture run-level effectiveness. It also does not appear to increase correlation with fully manual assessments.
This document is for anyone who would like to get better at prompting post-trained LLMs. We assume that readers have had some basic interactions with some sort of LLM (e.g. Gemini), but we do not assume a rigorous technical understanding.
github.com/varungodbole...
This document is for anyone who would like to get better at prompting post-trained LLMs. We assume that readers have had some basic interactions with some sort of LLM (e.g. Gemini), but we do not assume a rigorous technical understanding.
github.com/varungodbole...
AI: go.bsky.app/SipA7it
RL: go.bsky.app/3WPHcHg
Women in AI: go.bsky.app/LaGDpqg
NLP: go.bsky.app/SngwGeS
AI and news: go.bsky.app/5sFqVNS
You can also search all starter packs here: blueskydirectory.com/starter-pack...
AI: go.bsky.app/SipA7it
RL: go.bsky.app/3WPHcHg
Women in AI: go.bsky.app/LaGDpqg
NLP: go.bsky.app/SngwGeS
AI and news: go.bsky.app/5sFqVNS
You can also search all starter packs here: blueskydirectory.com/starter-pack...
1. Diversity,
2. Quality responses, and
3. Verification.
AI-Assisted Generation of Difficult Math Questions
Shah et al.
When you do this stuff, plz release the data ;) - "plan to release"...
1. Diversity,
2. Quality responses, and
3. Verification.
AI-Assisted Generation of Difficult Math Questions
Shah et al.
When you do this stuff, plz release the data ;) - "plan to release"...