Ayush Thakur
ayushthakur.bsky.social
Ayush Thakur
@ayushthakur.bsky.social
MLE @ Weights and Biases
Back in the days, WMT14 en-de dataset with 400k training samples was used a lot for NMT tasks. The reason for that is German is morphologically richer than other subsets in that benchmark.
November 25, 2024 at 10:55 AM