Raphael Schumann
schumann.bsky.social
Raphael Schumann
@schumann.bsky.social
Natural Language Processing PhD Student @ Heidelberg University.

https://schumann.pub

#NLP #NLProc #ML #AI
Reposted by Raphael Schumann
A starter pack for #NLP #NLProc researchers! 🎉

go.bsky.app/SngwGeS
November 4, 2024 at 10:01 AM
Reposted by Raphael Schumann
#EMNLP has a nice set of tokenization/subword modeling papers this year.

It's a good mix of tokenization algorithms, tokenization evaluation, tokenization-free methods, and subword embedding probing. Lmk if I missed some!

Here is a list with links + presentation time (in chronological order).
November 11, 2024 at 10:38 PM
First time ML/NLP Bluesky feels alive.
November 7, 2024 at 9:39 PM
TIL that the ACL Wiki has/had a state-of-the-art overview:

aclweb.org/aclwiki/Stat...
State of the art - ACL Wiki
aclweb.org
November 27, 2023 at 9:12 AM
Turns out that with the right attention_mask and position_ids you can prefill tokens AND pad batches in huggingface transformers. This speeds up inference, especially if if each instance has the same system prompt prepended. Code below ↓
October 13, 2023 at 11:34 AM