Interested in NLP for low-resource languages/terms, tokenization, and linguistics
Check out the data and code here: github.com/andhmak/rule...
4/4
Check out the data and code here: github.com/andhmak/rule...
4/4
After normalizing we even find cultural insights which were previously obscured!
3/4
After normalizing we even find cultural insights which were previously obscured!
3/4
By applying rule-based, linguistically informed transformations to the input before passing it to a LLM, with targeted few-shot prompting, we can obtain high-quality normalized outputs.
2/4
By applying rule-based, linguistically informed transformations to the input before passing it to a LLM, with targeted few-shot prompting, we can obtain high-quality normalized outputs.
2/4