Lightnews — Scholar-powered news

Jaydeep Borkar

@jaydeepborkar.bsky.social

Visiting Researcher at Meta NYC🦙 and PhD student at Northeastern. Organizer at the Trustworthy ML Initiative (trustworthyml.org). s&p in language models + mountain biking.

jaydeepborkar.github.io

Posts Replies Media Videos

Jaydeep Borkar

@jaydeepborkar.bsky.social

We observe a phenomenon we call assisted memorization: we find that most PII (email ID) isn’t extracted after it is first seen. But fine-tuning further on data that contains n-grams that overlap with these PII finally leads to their extraction. This is a key factor (in our settings, up to 1/3).

March 2, 2025 at 7:20 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news