Structured article extraction, PDF parsing, and AI-ready summaries through one fast API.
curl "https://instaparser.com/api/1/article" \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"url": "https://example.com/post"}'
from instaparser import InstaparserClient
client = InstaparserClient("YOUR_API_KEY")
article = client.Article("https://example.com/post")
print(article.title) # "How We Built Our Search Engine"
print(article.content) # "<p>Clean, structured HTML...</p>"
import { InstaparserClient } from 'instaparser-api';
const client = new InstaparserClient({'apiKey': 'YOUR_API_KEY'});
const article = await client.article({'url': 'https://example.com/post'});
console.log(article.title); // "How We Built Our Search Engine"
console.log(article.content); // "<p>Clean, structured HTML...</p>"
{
"title": "How We Built Our Search Engine",
"author": "Jane Kim",
"date": "2026-02-15T09:00:00Z",
"content": "<p>Clean, structured HTML of the article body...</p>",
"word_count": 1847,
"thumbnail": "https://example.com/images/hero.jpg",
"excerpt": "A deep dive into building search infrastructure...",
"url": "https://example.com/post",
"domain": "example.com"
}
Everything you need to extract, parse, and summarize web content. All included in every plan.
Structured article content from any URL — title, author, body, images, metadata. No selectors. No scraping. Just content.
POST /api/1/article
Upload a PDF or pass a URL. Get clean, structured text back. Handles scanned documents, multi-column layouts, and tables.
POST /api/1/pdf
GET /api/1/pdf?url={url}
Get a concise summary of any URL's content in a single API call. Great for previews, link cards, and trimming LLM context windows.
POST /api/1/summary
Other APIs fight with bot detection and proxies to access pages. Instaparser focuses on understanding page content — so you get clean, accurate structured data without managing infrastructure.
Built on the same engine that powers Instapaper — refined across billions of articles since 2008. Edge cases you'll encounter on the messy web, we've already solved.
Every response includes title, author, date, body content, images, and metadata as structured JSON. No post-processing, no markdown wrangling, no guessing.
Simple volume-based tiers. All three APIs included in every plan. No per-feature charges, no credit multipliers, no surprises.
See what the API returns. No signup required.
Built on the engine that powers Instapaper — serving millions of readers since 2008.
All plans include all three APIs. Pick the volume you need.