Doc Parser
Word Document Processing Guide
Document Parser API supports extraction of text from Microsoft Word documents (.doc and .docx).
Endpoint
POST /documents/parse
Word-Specific Configuration
Supported Word Features
DOC format (.doc)
DOCX format (.docx)
Text content
Tables (converted to text)
Headers and footers
Maximum file size: 50MB
Example Request
Response Format
Each page's content is separated by a page break marker:
Word-Specific Limitations
Macros are ignored
Comments are not included
Track changes are processed in their current state
Complex formatting may be simplified
Images are not processed
Last updated
Was this helpful?