Skip to content

URL Extractor

Extract every URL from a Markdown document — filtered and deduped

Pull links, images, mailto addresses, and bare URLs from any Markdown content. Filter by kind, dedupe with occurrence counts, and export as a Markdown list, plain text, CSV, or JSON.

Filters

URLKind×
https://example.com/docsexternal2
https://api.example.comexternal1
https://cdn.example.com/logo.pngimage1
mailto:hi@example.commailto1
./notes.mdrelative1
https://github.com/example/repoexternal1

How to use the URL extractor

  1. Paste Markdown or load a .md file.
  2. Toggle the filters to include / exclude mailto, images, and relative paths.
  3. Pick a format (Markdown list, plain text, CSV, or JSON).
  4. Copy or download the result.

Why extract URLs?

  • SEO audits. Pull every outbound link to verify they all point to expected destinations.
  • Link-bank for newsletters. A long post often references 20+ resources; extract them once for a "links" section.
  • Bug reports. Capture the URL footprint of a doc before refactoring to confirm nothing is dropped.
  • AI prompts. Pasting a flat URL list into a research assistant is more reliable than asking it to skim a long document.

Output examples

- https://example.com/docs
- https://example.com/api
- mailto:hi@example.com

CSV:

url,kind,count,lines
"https://example.com/docs",external,3,"4;12;18"
"mailto:hi@example.com",mailto,1,"7"

Companion tools

Privacy & data

Extraction is a small client-side regex pass. We don't upload or log content.

Frequently asked questions

Which URLs does it find?
Markdown link syntax `[text](url)`, image syntax `![alt](url)`, mailto: links, and bare http(s) URLs in body text. The extractor lists each unique URL with its kind and how many times it appears.
Why dedupe?
Many documents reference the same URL multiple times (a 'docs' link cited from several sections). Deduplication keeps the output compact and shows you the unique audit surface with occurrence counts.
What export formats are available?
Markdown bullet list (paste into another doc), plain text (one URL per line), CSV (url, kind, count, lines — for spreadsheets), and JSON (for scripting). Toggle the format in the top-right select.
Will it follow links?
No — this tool only extracts. For follow-up validation, use the [Markdown Link Checker](/tools/markdown-link-checker) which classifies and pings links.
How do I get just the external URLs?
Uncheck the mailto, images, and relative filters. The kind column shows what's left.
Is my Markdown uploaded?
No. Extraction runs entirely in your browser.