but often lead to inaccuracies/distractions in LLM outputs
but often lead to inaccuracies/distractions in LLM outputs
they modified the questions with unnecessary information to distract the LLMs
It led to much lower accuracy even for o1
they modified the questions with unnecessary information to distract the LLMs
It led to much lower accuracy even for o1