Mozhdeh Gheini
mgheini.bsky.social
Mozhdeh Gheini
@mgheini.bsky.social
USC Graduate Student | USC ISI NLP Researcher | 3x Apple Intern | Self-proclaimed Michelin 3-star Foodie | she/her
I must also add that I’m assuming there’s no breakthrough architecture/pre-training/post-training method that pushes us to start everything from scratch. I’m simply asking about the decision factors in greenlighting such a full restart in the current status quo.
January 7, 2025 at 2:47 AM
Given how bad I am at it, it’s out of my league too; still fun though 😅
December 6, 2024 at 5:54 AM
Were you doing the NYT’s crossword? That’s how it happened for me. Also, if you want a bonus one, “doe” :)
December 5, 2024 at 1:33 AM
f’ as in fine-tuned from f, not the derivative of f 😅
December 3, 2024 at 4:53 AM
I got confused there yoo. Maybe something like “further condition the model’s output” (instead of update the model)?
So if the model is f(x), before the dashed line it’s f’(x), and after that it’s f(x|prompt/context).
December 3, 2024 at 4:49 AM