In 2019, Tenney et al. found that BERT rediscovers the classical NLP pipeline, offering linguistic insights into early LMs.
But models today are >>100x larger, trained on vastly more data…
Do they still rediscover the classical NLP pipeline? 🧵
In 2019, Tenney et al. found that BERT rediscovers the classical NLP pipeline, offering linguistic insights into early LMs.
But models today are >>100x larger, trained on vastly more data…
Do they still rediscover the classical NLP pipeline? 🧵