PDF Parser
PDF Document Processing Guide
Document Parser API supports extraction of text content from PDF files.
Endpoint
POST /documents/parse
PDF-Specific Configuration
Supported PDF Features
Single and multi-page PDFs
Text-based PDFs
Scanned PDFs (using OCR)
Password-protected PDFs (not supported)
Maximum file size: 50MB
Example Request
Response Format
Each page's content is separated by a page break marker:
PDF-Specific Limitations
Forms and fillable fields are processed as static text
Complex layouts may affect text ordering
Headers and footers are included in the extracted text
Images within PDFs are not processed
PDF versions supported: 1.0 to 2.0
Last updated