tenojo.bsky.social
@tenojo.bsky.social
Reposted
(Nojonen, Korsu, Ginter, Laippala & Kanerva 2025) introduce TCBLex, a lexical database of Finnish literary works read by children (7-15y). Data consists of 14 sub-lexicons and over 11 million tokens, annotated and lemmatized.
Paper: link.springer.com/article/10.3...
Data: doi.org/10.5281/zeno...
TCBLex - A lexical database of Finnish literary texts for children - Behavior Research Methods
This work introduces TCBLex, a lexical database of Finnish literary works read by children between the ages of 7 and 15. We explain in detail the work done to build the corpus TCBLex is based on, incl...
link.springer.com
October 20, 2025 at 8:48 AM