Lightnews — Scholar-powered news

Raphael Schumann

@schumann.bsky.social

1.8K followers 870 following 12 posts

Natural Language Processing PhD Student @ Heidelberg University.

https://schumann.pub

#NLP #NLProc #ML #AI

Posts Replies Media Videos

Raphael Schumann

@schumann.bsky.social

Same boat as your AC

March 2, 2025 at 11:13 AM

Raphael Schumann

@schumann.bsky.social

Could you add me please?

January 14, 2025 at 6:31 PM

Raphael Schumann

@schumann.bsky.social

CBOW vs. Skip-gram

December 20, 2024 at 11:59 AM

Raphael Schumann

@schumann.bsky.social

Great work! Are you going to release the models?

December 14, 2024 at 11:16 AM

Raphael Schumann

@schumann.bsky.social

This helped a lot!

November 7, 2024 at 9:27 PM

Raphael Schumann

@schumann.bsky.social

I make sure to even delete paths with my username from code in supplementary material

January 5, 2024 at 3:49 PM

Raphael Schumann

@schumann.bsky.social

It also works with Flash Attention 2, although I don't see additional speedups. I don't think FA is optimized for generation.

October 13, 2023 at 11:35 AM

Raphael Schumann

@schumann.bsky.social

Conceptually it is clear that this works but I wasn't aware that huggingface passes this through correctly.
Github Gist to reproduce:
gist.github.com/raphael-sch/...

Using padding and prefill during inference in huggingface transformers

Using padding and prefill during inference in huggingface transformers - run_padding_prefill.py

gist.github.com

October 13, 2023 at 11:35 AM

Raphael Schumann

@schumann.bsky.social

You have to place the padding tokens in between the prefill and input tokens (example with 3 prefilled tokens):
input_ids: [0, 0, X, X, X, X]
position_ids: [0, 0, 3, 4, 5, 6]
attn_mask: [1, 1, 1, 0, 0, 1, 1, 1, 1]

October 13, 2023 at 11:35 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news