PNG & JPG Parser

Image Processing Guide

Document Parser API supports text extraction from various image formats using OCR technology.

Endpoint

POST /documents/parse

Supported Image Types

JPEG/JPG

{
    "url": "string",
    "type": "jpg",          // or "jpeg"
    "output": "plain|markdown"
}

{
    "url": "string",
    "type": "png",
    "output": "plain|markdown"
}

TIFF/TIF

{
    "url": "string",
    "type": "tiff",         // or "tif"
    "output": "plain|markdown"
}

HEIC

{
    "url": "string",
    "type": "heic",
    "output": "plain|markdown"
}

Image Processing Features

Text extraction using OCR
Multi-page support for TIFF
Various image resolutions supported
Color, grayscale, and black/white images
Maximum file size: 50MB

Example Request

curl --location 'https://api.yetanotherapi.com/documents/parse' \
--header 'x-api-key: YOUR_API_KEY' \
--header 'Content-Type: application/json' \
--data '{
    "url": "https://example.com/scan.jpg",
    "type": "jpg",
    "output": "markdown"
}'

Response Format

For single-page images:

{
    "requestId": "string",
    "status": "COMPLETED",
    "data": "Extracted text content"
}

For multi-page TIFF:

{
    "requestId": "string",
    "status": "COMPLETED",
    "data": "Page 1 content\n\n=== Page Break ===\n\nPage 2 content"
}

Image-Specific Limitations

Image quality affects OCR accuracy
Minimum resolution required: 300 DPI
Handwritten text may not be accurately recognized
Complex backgrounds can affect accuracy
Text must be properly oriented

PreviousDoc Parser NextTXT Parser

Last updated 6 months ago

Was this helpful?