Tyler Chang
tylerachang.bsky.social
Tyler Chang
@tylerachang.bsky.social
PhD student at UC San Diego.
He/him/his.

https://tylerachang.github.io/
Play with it yourself: see influential pretraining examples from our method for facts, factual errors, commonsense reasoning, arithmetic, and open-ended generation: github.com/PAIR-code/pr...
December 13, 2024 at 6:57 PM
Our method, TrackStar, refines existing gradient-based approaches to scale to much larger settings: over 100x more queries and a 30x larger retrieval corpus than previous work at this model size.
December 13, 2024 at 6:57 PM
We scaled training data attribution (TDA) methods ~1000x to find influential pretraining examples for thousands of queries in an 8B-parameter LLM over the entire 160B-token C4 corpus!
medium.com/people-ai-re...
December 13, 2024 at 6:57 PM