Back to blog

Gmail Will Use Your Emails to Train AI: Understand the Controversy and How to Protect Yourself

Hello HaWkers, recent news has stirred the tech community and raised intense debates about digital privacy. Google announced changes to its terms of service that allow the use of Gmail email content to train their artificial intelligence models.

After the negative backlash, the company issued clarifications, but concerns persist. What exactly is happening and how can you protect your data?

What Was Announced

The Google terms of service update includes clauses that allow the use of user content to "improve AI services." The vague language left many users concerned.

Main Points of the Announcement

  • Usable data: Emails, attachments, and metadata can be processed
  • Declared purpose: Improve features like Smart Compose and summaries
  • Opt-out available: Users can disable in settings
  • Timeline: Changes take effect in January 2026
  • Scope: Affects personal and Workspace accounts (with different settings)

Context: This change occurs while Google accelerates Gemini development and competes directly with OpenAI and Anthropic in the AI market.

Google's Response

After the initial negative backlash, especially in tech communities, Google issued a statement trying to clarify the situation.

What Google Claims

Guarantees offered:

  • Data will not be used to train public models
  • Processing is internal and secure
  • Users can opt out
  • Workspace Enterprise has separate controls

What Experts Question

Concerns raised:

  • Terms language is still ambiguous
  • "Improve services" can have broad interpretation
  • Opt-out is not the default setting
  • Google's privacy track record doesn't inspire total confidence

💡 Important point: The difference between "using data to improve services" and "training AI models" can be subtle but significant in legal and practical terms.

Why This Matters For Developers

If you're a developer, you probably use Gmail for professional communications, receive codes by email, authentication tokens, and discuss projects with colleagues.

Sensitive Data at Risk

What might be in your emails:

  • API credentials and tokens
  • Discussions about system architecture
  • Code shared in attachments
  • Client information
  • Contracts and confidential documents

Implications For Companies

Compliance considerations:

  • GDPR/CCPA may be affected
  • NDA contracts may be compromised
  • Client data in emails
  • Sector regulations (healthcare, finance)
Sector Risk Regulation
Healthcare High HIPAA, GDPR
Finance High PCI-DSS, SOX
Tech Medium NDAs, IP
General Medium GDPR, CCPA

How to Disable Data Use For AI

If you want to ensure your data isn't used to train AI, here's a step-by-step guide.

On Personal Gmail

Step by step:

  1. Access myaccount.google.com
  2. Go to "Data and privacy"
  3. Scroll to "History and personalization"
  4. Find "Web & App Activity"
  5. Uncheck "Include voice and audio activity"
  6. Look for options related to "AI" or "Machine Learning"

Additional settings:

  • Disable "Ad personalization"
  • Review "Data sharing"
  • Configure "Auto-delete" for old data

On Google Workspace

For administrators:

  1. Access the Admin Console
  2. Go to Apps > Google Workspace > Settings
  3. Look for "AI and Machine Learning"
  4. Configure data usage policies
  5. Set defaults for the entire organization

Note: Workspace Enterprise has more granular controls than personal accounts.

Alternatives to Gmail

If you decide to migrate away from the Google ecosystem, there are privacy-focused alternatives.

Privacy-Focused Providers

ProtonMail:

  • End-to-end encryption
  • Servers in Switzerland
  • Open-source code
  • Free with paid plans

Tutanota:

  • Complete encryption
  • Servers in Germany
  • Calendar and contacts included
  • Affordable price

Fastmail:

  • No tracking or AI
  • Excellent filters and organization
  • Custom domain support
  • Productivity focus

Alternatives Comparison

Service Starting Price Encryption Storage
ProtonMail Free E2E 500MB
Tutanota Free E2E 1GB
Fastmail $3/month In transit 2GB
Gmail Free In transit 15GB

The Larger Discussion About Privacy and AI

This episode is part of a concerning trend in the tech market: massive data collection to train AI models.

Industry Patterns

Common practices:

  • Meta uses Instagram/Facebook posts for AI
  • Microsoft used LinkedIn data
  • Apple promises not to use iCloud data
  • OpenAI trained with internet data

The Consent Problem

Legal and ethical questions:

  • Terms of service are updated unilaterally
  • Opt-out is not the default
  • Users rarely read complete terms
  • Limited alternatives for essential services

🔥 Reflection: We're at an inflection point where personal data has become raw material for AI. The decisions we make now will define the balance between innovation and privacy.

Necessary Regulation

What regulators are considering:

  • Explicit consent for AI use
  • Right to exclusion from training datasets
  • Transparency about data use
  • Significant penalties for violations

Digital Privacy Best Practices

Regardless of what you decide about Gmail, some practices help protect your online privacy.

Basic Digital Hygiene

Recommended actions:

  1. Use different emails for different services
  2. Avoid sending sensitive information by email
  3. Use encryption when possible (PGP)
  4. Review app permissions regularly
  5. Configure two-factor authentication

For Developers Specifically

Security practices:

  • Never send credentials by email
  • Use password managers (1Password, Bitwarden)
  • API tokens should go through secure channels
  • Sensitive code should use private repositories
  • Consider self-hosted solutions for team communication

Recommended Tools

Secure communication:

  • Signal for messages
  • Keybase for encrypted chat
  • Matrix/Element for teams
  • Mattermost self-hosted

Password management:

  • 1Password
  • Bitwarden (open-source)
  • KeePassXC (local)

Conclusion

The Gmail controversy about using emails to train AI is an important reminder that digital privacy requires constant vigilance. Even if Google has backed down on some points, the trend of using user data for AI is clear and probably irreversible.

For developers, the message is clear: treat email as a potentially compromised channel and adopt appropriate security practices. Consider privacy-focused alternatives for sensitive communications and stay informed about changes in terms of service.

If you're interested in security and privacy, I recommend checking out another article: Web Security: OWASP Top 10 in 2025 where you'll discover the main vulnerabilities and how to protect yourself.

Let's go! 🦅

Comments (0)

This article has no comments yet 😢. Be the first! 🚀🦅

Add comments