Google TechTalks•January 27, 2026

Large Language Models (LLMs)Privacy Data Security

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training

YouTube · IzIsHFCqXGo

Quick Read

Research reveals how dynamic LLM training, including PII additions and removals, creates 'assisted memorization' and 'privacy ripple effects,' making sensitive data extractable even when initially unmemorized.

●PII can become extractable later in training, even if not initially memorized ('assisted memorization').

●Adding more PII increases overall leakage, and removing PII can expose new sensitive data.

●Overlapping data fragments (engrams) are a key driver of PII leakage.

Summary

This research explores the dynamic nature of PII extractability in Large Language Models (LLMs) during continuous training, data additions (opt-ins), and data removals (opt-outs). It introduces 'assisted memorization,' a phenomenon where PII not initially extractable becomes extractable later in training, primarily driven by training on overlapping engrams. The study demonstrates that adding more PII to training data significantly increases overall PII extractability, with a super-linear effect for top-k sampling. Crucially, removing memorized PII does not eliminate privacy risks; instead, it often exposes new layers of previously unextracted PII, dubbed a 'privacy onion' effect. The findings highlight that current static memorization audits are insufficient and emphasize the need for continuous evaluation considering the ripple effects of data changes.

As LLMs are increasingly trained on sensitive data and continuously updated, understanding the dynamic risks of PII leakage is critical. This research challenges the assumption that data seen early in training becomes less vulnerable or that removing some PII resolves privacy concerns. It provides concrete evidence that PII extractability is a complex, evolving problem influenced by data composition, training dynamics, and even the choice of decoding methods, necessitating more sophisticated privacy audits and data handling strategies for LLM developers and deployers.

Takeaways

❖LLMs are increasingly trained on sensitive data (e.g., EHRs, private emails) and are prone to memorizing and regurgitating it.
❖The research introduces 'assisted memorization,' where PII not extractable at an early checkpoint becomes extractable at a later one.
❖Assisted memorization is not merely delayed memorization; it is triggered by training on data containing overlapping engrams (partial duplicates) with the PII.
❖Removing overlapping engrams significantly reduces assisted memorization, indicating their causal role.
❖A logistic regression model predicts assisted memorization based on engram statistics, last name counts, and domain counts.
❖Adding more PII to training data (PII opt-ins) leads to a substantial increase in overall PII extraction, particularly with top-k sampling.
❖The inclusion of new PII also increases the risk of extraction for *existing* PII that was already in the training set.
❖Removing memorized PII (PII opt-outs) can inadvertently trigger the extraction of a new 'layer' of previously unextracted PII, akin to a 'privacy onion' effect.
❖The choice of decoding method significantly impacts PII extraction; top-k sampling extracts many more emails than greedy decoding.
❖Memorization audits should evaluate examples not currently extracted, as they may become vulnerable later due to assisted memorization.

Insights

1Assisted Memorization: PII Can Become Extractable Later in Training

The research identifies 'assisted memorization,' where PII seen early in training is not immediately memorized or extractable, but becomes so at a later checkpoint as training progresses. This challenges the assumption that older data is less vulnerable to extraction.

Examples seen early in training may still remain vulnerable to extraction at later stages. Models on average show equal or more assisted memorization than immediate memorization, with the effect increasing with model scale (e.g., GPT-2 large vs. small).

2Overlapping Engrams Drive Assisted Memorization

Assisted memorization is primarily triggered by training on new data that contains overlapping engrams (sequences of tokens) with the previously unmemorized PII. These partial duplicates or related fragments 'assist' the model in later recalling the full PII.

When a model is trained on data with overlapping engrams (e.g., 'John McCarthy' when 'Elisabeth.McCarthy' was previously unmemorized), the original PII becomes extractable. Removing these overlapping engrams from training batches reduced assisted memorization from 177 emails to only 10.

3Adding PII Increases Overall and Existing Data Extraction Risk

Incrementally adding more PII to an LLM's training data significantly increases the total extractability of PII. This effect is super-linear for top-k sampling. Furthermore, adding new PII also increases the risk of extraction for PII that was already present in the training data.

As the percentage of PII in the training data increased from 10% to 100%, total memorization for top-k sampling rose from 57 emails (at 50% PII) to 283 emails (at 100% PII). For a fixed data set (e.g., D40%), the number of memorized emails increased as more PII was added to subsequent training models (e.g., from 43 emails in M4 to higher counts in M5 and beyond).

4PII Removal Can Expose New Layers of Sensitive Data

Attempting to remove specific memorized PII by retraining the model can inadvertently cause a new set of previously unextracted PII to become vulnerable to memorization. This 'privacy onion' effect suggests that PII on the verge of memorization surfaces once a 'first layer' of memorized PII is removed.

After removing an initial set of memorized emails and retraining, a new set of emails became extractable. This pattern continued for multiple rounds of removal, showing successive 'layers' of PII becoming vulnerable. Perplexity analysis confirmed that these newly extracted emails were already 'close' to memorization in the original model.

5Decoding Methods Significantly Impact PII Extraction

The choice of decoding and sampling method used for LLM generation plays a critical role in PII extractability. Top-k sampling consistently leads to significantly higher rates of PII extraction compared to greedy decoding.

Top-k sampling extracted many more emails than greedy decoding across all experiments. Top-k also generated more email addresses on average, suggesting its wider token pool contributes to increased leakage.

Key Concepts

Assisted Memorization

A phenomenon where Personally Identifiable Information (PII) is present in a model's training data and initially not extractable, but becomes extractable at a later stage of continuous training, often due to exposure to related or overlapping data fragments (engrams).

Layered Memorization (Privacy Onion Effect)

The observation that PII extractability in LLMs can be layered. Removing a set of currently memorized PII (the 'outer layer') can reveal and make extractable a new set of previously unmemorized PII (an 'inner layer'), implying that privacy risks are not fully mitigated by simple removal.

Lessons

Implement continuous monitoring for PII extractability, not just one-time audits, as 'assisted memorization' means data can become vulnerable long after initial training.
Scrutinize training data for overlapping engrams, as these partial duplicates are a major driver of assisted memorization and can expose sensitive information.
Exercise caution when adding new PII to an LLM's training data, as it can super-linearly increase the extractability of *all* PII, including existing data.
Recognize that PII removal efforts may inadvertently expose other sensitive data; a 'privacy onion' effect means new layers of PII can become extractable.
Select decoding and sampling methods carefully for LLM deployment, as methods like top-k sampling significantly increase the likelihood of PII extraction compared to greedy decoding.

Quotes

"Assisted memorization actually happens when you know a PII which is seen in step one is actually not memorized in step one but it actually ends up getting memorized in step two or later."

JD Borcar

"If we remove the first layer then a second layer becomes vulnerable to memorization. If we remove the second layer then a third layer becomes vulnerable to memorization."

JD Borcar

"Only evaluating examples that get extracted may actually create a false sense of privacy because assisted memorization tells us that examples that don't get extracted at current checkpoint may get extracted later."

JD Borcar

Q&A

Related Episodes

Google TechTalks• Jan 27, 2026

Threat Models for Memorization: Privacy, Copyright, and Everything In-Between

"Relaxing threat models for machine learning memorization, even with natural data or benign users, creates unexpected privacy and copyright vulnerabilities in AI models."

Machine LearningPrivacyLarge Language Models+2

Just Trish• Jun 30, 2026

Alissa Violet Is Pregnant?! + Nobody Showed Up to VidCon... | Just Trish Ep. 289

"This episode unpacks the latest celebrity drama, from Alissa Violet's alleged pregnancy and FaZe Banks' controversial stream to the evolving landscape of fan culture, AI's influence on tech and media, and the hosts' unfiltered takes on modern relationships."

Pop CultureCelebrity GossipSocial Media+2

Julian Dorey Podcast• Jun 26, 2026

“Psychopaths in Power!” - HEATED Debate on Peter Thiel, Lutnick & Scariest AI Outcome | Pomp • 440

"A heated debate unpacks the societal and economic implications of AI, challenging official inflation data and questioning the ethics of powerful tech figures amidst a disappearing middle class."

Artificial IntelligenceEconomicsInflation+2

This Past Weekend w/ Theo Von• Jun 10, 2026

Matt Rife | This Past Weekend w/ Theo Von #662

"Comedian Matt Rife discusses his rapid rise in stand-up, balancing acting with touring, his investment in the Ed and Lorraine Warren paranormal museum, and his struggles with severe insomnia."

Comedy CareerStand-UpActing+2

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training

Quick Read

Summary

Takeaways

Insights

1Assisted Memorization: PII Can Become Extractable Later in Training

2Overlapping Engrams Drive Assisted Memorization

3Adding PII Increases Overall and Existing Data Extraction Risk

4PII Removal Can Expose New Layers of Sensitive Data

5Decoding Methods Significantly Impact PII Extraction

Key Concepts

Assisted Memorization

Layered Memorization (Privacy Onion Effect)

Lessons

Quotes

Q&A

Recent Questions

Related Episodes

Threat Models for Memorization: Privacy, Copyright, and Everything In-Between

Alissa Violet Is Pregnant?! + Nobody Showed Up to VidCon... | Just Trish Ep. 289

“Psychopaths in Power!” - HEATED Debate on Peter Thiel, Lutnick & Scariest AI Outcome | Pomp • 440

Matt Rife | This Past Weekend w/ Theo Von #662