Mozhdeh Gheini
mgheini.bsky.social
Mozhdeh Gheini
@mgheini.bsky.social
USC Graduate Student | USC ISI NLP Researcher | 3x Apple Intern | Self-proclaimed Michelin 3-star Foodie | she/her
Are there any good pointers on when/why one would decide to run pre-training from scratch (and follow it with post-training ofc) to create a fresh LLM? Is it simply about shifting the knowledge cutoff or more than that? Do we know how/if that happens nowadays? What are the deciding factors?
January 7, 2025 at 2:40 AM
Reposted by Mozhdeh Gheini
i was annoyed at having many chrome tabs with PDF papers having uninformative titles, so i created a small chrome extension to fix it.

i'm using it for a while now, works well.

today i put it on github. enjoy.

github.com/yoavg/pdf-ta...
January 5, 2025 at 10:22 PM
Reposted by Mozhdeh Gheini
USC NLP folks are on Bluesky!
Follow my amazing colleagues here

go.bsky.app/KUwSZ6W
November 12, 2024 at 5:44 PM