Multilingual LLM training/eval/safety @ SRI lab
ayukh.com
For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for eval🙂
For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for eval🙂
Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased
Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased