When these agents cite sources they claim the report is based on, how much can we actually trust them? In our new #ACL2025 paper, ...
When these agents cite sources they claim the report is based on, how much can we actually trust them? In our new #ACL2025 paper, ...
In my new blog post, I revisit the brief history of 𝗛𝗼𝘁𝗽𝗼𝘁𝗤𝗔, why it defined ...
1/
1/
Big tech executives and business analysts are racing to share eye-catching statements like "AI will write XX% of the code at MetaCorp by 20YY." How much truth is there to these, and what implications might this have?
🧵
Big tech executives and business analysts are racing to share eye-catching statements like "AI will write XX% of the code at MetaCorp by 20YY." How much truth is there to these, and what implications might this have?
🧵
* VP of AI at a unicorn public company
* Applied AI Engineer II at a F50 company
* Applied Scientist II at a F10 co.
* Research Intern at a F50 co.
* Senior Principal Applied Scientist at a F500 co.
* Senior Director of Applied Science at a F500 co.
Who am I?!
* VP of AI at a unicorn public company
* Applied AI Engineer II at a F50 company
* Applied Scientist II at a F10 co.
* Research Intern at a F50 co.
* Senior Principal Applied Scientist at a F500 co.
* Senior Director of Applied Science at a F500 co.
Who am I?!
🧵
🧵
1/n
1/n
🧵
🧵
Computer scientists in the era of #AI boom: I'm not so sure about that...
Computer scientists in the era of #AI boom: I'm not so sure about that...
1/n
1/n
1/n
1/n
1/n
1/n
A core challenge in developing a reliable #AIAgent is the ability to simulate potential outcomes of agent actions, especially at inference time, to guide robust planning and search.
1/n
A core challenge in developing a reliable #AIAgent is the ability to simulate potential outcomes of agent actions, especially at inference time, to guide robust planning and search.
1/n