Email Extractor
Extract all email addresses from any text instantly.
Why use it
Clean and extract emails in seconds
RFC 5322 compliant
Detects all valid email formats including new TLDs and subdomains.
100% private
Your text never leaves the browser. No server submissions, no logs.
No duplicates
Automatic deduplication. Each email appears only once in the result.
Instant
Extraction in milliseconds, regardless of text length.
How it works
Three steps, no hassle
Paste your text
Paste the content you want to extract emails from: plain text, HTML, source code, lists.
Automatic extraction
The extractor detects all valid email addresses instantly using the RFC 5322 pattern.
Copy the clean list
Get the deduplicated email list, one per line, ready to use.
FAQ
Got questions?
The extractor uses a pattern based on RFC 5322, the standard defining the format of Internet email addresses. The pattern detects the local part (before @), which can contain letters, numbers, dots, hyphens, and plus signs, followed by @ and the domain. The domain can include subdomains and accepts any valid TLD, including newer TLDs longer than 3 characters like .academy, .photography, or .email.
The extractor recognizes addresses with subdomains in the domain part (user@mail.company.com) and with TLDs of any length (user@example.photography). It is not limited to .com, .net, .org — it accepts all ICANN-registered TLDs, including the 1500+ generic and country-code TLDs currently available. Addresses with Unicode characters in the domain (internationalized) are not detected, as they require prior Punycode conversion.
Yes. The email extractor automatically removes duplicate addresses from the result. Comparison is case-insensitive in practice: while the local part is technically case-sensitive per RFC 5321, all modern mail servers treat it as case-insensitive. The final result contains each unique address exactly once, in order of appearance.
Processing happens entirely in your browser — the text you paste is never sent to any server. This is especially important when working with text containing personal information. Note that extracting emails from websites without permission may violate the site's privacy policy and, in the European Union, the GDPR, which protects email addresses as personal data and limits their collection and use without consent.
The most common use cases are: cleaning and consolidating contact lists from multiple sources, extracting emails from documents exported from CRMs or ERPs, collecting addresses from newsletter replies or form submissions, analyzing log files for user emails, checking what emails exist in a text block before importing it, and extracting contacts from plain text files or CSV exports with irregular formatting.
Email format per RFC 5322 and email scraping ethics
The format of email addresses is defined by RFC 5322 (Internet Message Format), published in 2008 as an update to RFC 2822, which itself updated the original RFC 822 from 1982. The specification distinguishes the local part (before @) from the domain. The local part can contain alphanumeric characters and special characters like dots, hyphens, plus signs, and quotes, though in practice modern servers only accept a subset. The domain must be a valid hostname with at least one dot.
Bulk email collection (email scraping) carries significant legal and ethical implications. GDPR in the European Union classifies email addresses as personal data and requires a legal basis for processing. The CAN-SPAM Act in the United States regulates the use of collected emails for marketing. The Computer Fraud and Abuse Act (CFAA) has been used to prosecute unauthorized data scraping. The most ethical use is cleaning and processing data you already have authorization for.
In data cleaning workflows, the email extractor is a common tool in ETL (Extract, Transform, Load) pipelines. The most common input formats are: CRM CSV exports with emails mixed in notes fields, text files with copied correspondence, free-form submissions, and JSON or XML database exports. Deduplication is the most critical step before importing into email marketing tools like Mailchimp, ActiveCampaign, or HubSpot.