Jaydeep Borkar
banner
jaydeepborkar.bsky.social
Jaydeep Borkar
@jaydeepborkar.bsky.social
Visiting Researcher at Meta NYC🦙 and PhD student at Northeastern. Organizer at the Trustworthy ML Initiative (trustworthyml.org). s&p in language models + mountain biking.

jaydeepborkar.github.io
We observe a phenomenon we call assisted memorization: we find that most PII (email ID) isn’t extracted after it is first seen. But fine-tuning further on data that contains n-grams that overlap with these PII finally leads to their extraction. This is a key factor (in our settings, up to 1/3).
March 2, 2025 at 7:20 PM