For (i), MASH outperforms efficient search baselines, esp. for multi-hop datasets (7.6% accuracy boost), even matching search baselines w/o any search penalties!
For (i), MASH outperforms efficient search baselines, esp. for multi-hop datasets (7.6% accuracy boost), even matching search baselines w/o any search penalties!
LLMs learn to use search tools to answer questions they would otherwise hallucinate on. But can this also teach them what they know vs not?
We introduce MASH that trains LLMs for search and gets abstentions for free!
LLMs learn to use search tools to answer questions they would otherwise hallucinate on. But can this also teach them what they know vs not?
We introduce MASH that trains LLMs for search and gets abstentions for free!