We've updated last week's Data Insight with 3 additional benchmarks to judge capabilities.
The result? Across all four benchmarks, small open models lag frontier performance by less than a year. 🧵
x.com/EpochAIRese...
titotal.substack.com/p/a-deep-cri...
titotal.substack.com/p/a-deep-cri...
michaelnotebook.com/optimism/ind...
Part 1: condensedconcepts.blogspot.com/2024/09/the-...
Part 2: condensedconcepts.blogspot.com/2024/09/the-...
Part 1: condensedconcepts.blogspot.com/2024/09/the-...
Part 2: condensedconcepts.blogspot.com/2024/09/the-...
CAC "chose for its B-24 bomber a somewhat mysterious [airfoil] section devised by a lone inventor named David R. Davis... The B-24 went on to become the most numerous and one of the most successful bombers of World War II."
CAC "chose for its B-24 bomber a somewhat mysterious [airfoil] section devised by a lone inventor named David R. Davis... The B-24 went on to become the most numerous and one of the most successful bombers of World War II."