Structured JSON
This guide demonstrates how to create prompts that return precisely structured JSON responses from the LLM Web Scraper API. These prompts are ideal when you need consistent, predictable data structures for integration with other systems.
Key Principles for Structured Prompts
Explicitly Define Keys
Specify exact key names
Define expected data types
Indicate required vs optional fields
Specify Nesting Structure
Define object hierarchies
Specify array structures
Indicate relationships between objects
Include Validation Rules
Specify allowed values
Define number ranges
Indicate format requirements
E-commerce Examples
Product Details Extraction
{
"url": "https://example.com/product",
"prompt": "Extract product information using these exact keys and types: 'productName' (string), 'manufacturer' (string), 'price' (object with keys: 'current' (number), 'original' (number), 'currency' (string)), 'availability' (string: either 'In Stock', 'Out of Stock', or 'Pre-order'), 'specifications' (array of objects with keys: 'name' (string), 'value' (string)), 'features' (array of strings), 'shipping' (object with keys: 'free' (boolean), 'methods' (array of objects with keys: 'name' (string), 'price' (number), 'duration' (string)))"
}
Example Response:
{
"productName": "Professional DSLR Camera X100",
"manufacturer": "PhotoTech",
"price": {
"current": 1299.99,
"original": 1499.99,
"currency": "USD"
},
"availability": "In Stock",
"specifications": [
{
"name": "Sensor Type",
"value": "Full Frame CMOS"
},
{
"name": "Resolution",
"value": "24.2 MP"
},
{
"name": "Shutter Speed",
"value": "1/8000 to 30 sec"
}
],
"features": [
"4K video recording",
"Built-in WiFi",
"Weather-sealed body",
"Dual card slots"
],
"shipping": {
"free": true,
"methods": [
{
"name": "Standard",
"price": 0,
"duration": "3-5 business days"
},
{
"name": "Express",
"price": 29.99,
"duration": "1-2 business days"
}
]
}
}
Multiple Product Comparison
{
"url": "https://example.com/category",
"prompt": "Extract information for all products on the page using this structure: 'products' (array of objects), each product must have: 'id' (string), 'name' (string), 'brand' (string), 'category' (string), 'price' (number), 'rating' (object with keys: 'score' (number 0-5), 'count' (number)), 'specs' (object with keys matching exactly what's found in the product specs table), 'comparisonPoints' (array of objects with keys: 'feature' (string), 'value' (string), 'relativeMerit' (string: 'better', 'worse', or 'same'))"
}
Example Response:
{
"products": [
{
"id": "CAM-X100",
"name": "DSLR X100",
"brand": "PhotoTech",
"category": "Professional Cameras",
"price": 1299.99,
"rating": {
"score": 4.8,
"count": 245
},
"specs": {
"sensorType": "Full Frame",
"resolution": "24.2MP",
"weight": "780g"
},
"comparisonPoints": [
{
"feature": "Image Quality",
"value": "Exceptional",
"relativeMerit": "better"
},
{
"feature": "Battery Life",
"value": "1200 shots",
"relativeMerit": "same"
}
]
}
]
}
Article Examples
Structured Article Analysis
{
"url": "https://example.com/article",
"prompt": "Analyze the article using this exact structure: 'metadata' (object with keys: 'title' (string), 'author' (string), 'publishDate' (string in YYYY-MM-DD format), 'category' (string), 'readingTime' (number in minutes)), 'content' (object with keys: 'summary' (string), 'mainPoints' (array of strings), 'conclusions' (array of strings)), 'analysis' (object with keys: 'tone' (string: 'positive', 'negative', 'neutral'), 'audience' (string), 'expertise_level' (string: 'beginner', 'intermediate', 'advanced')), 'references' (array of objects with keys: 'text' (string), 'source' (string))"
}
Example Response:
{
"metadata": {
"title": "The Future of AI in Healthcare",
"author": "Dr. Sarah Johnson",
"publishDate": "2024-01-15",
"category": "Technology",
"readingTime": 12
},
"content": {
"summary": "Comprehensive analysis of AI's growing role in healthcare delivery and diagnosis",
"mainPoints": [
"AI improving diagnostic accuracy by 40%",
"Reduction in administrative workload",
"Enhanced patient monitoring capabilities",
"Cost savings through automation"
],
"conclusions": [
"AI will be integral to future healthcare",
"Human oversight remains crucial",
"Cost barriers decreasing rapidly"
]
},
"analysis": {
"tone": "positive",
"audience": "Healthcare professionals",
"expertise_level": "intermediate"
},
"references": [
{
"text": "WHO AI in Healthcare Report 2023",
"source": "World Health Organization"
},
{
"text": "AI Diagnostic Accuracy Study",
"source": "Medical AI Journal"
}
]
}
FAQ Page Structured Extraction
{
"url": "https://example.com/faq",
"prompt": "Extract FAQ content using this structure: 'categories' (array of objects with keys: 'name' (string), 'description' (string), 'questions' (array of objects with keys: 'question' (string), 'answer' (string), 'tags' (array of strings), 'related_questions' (array of numbers referencing question indices))). Each answer should be concise and formatted as a single paragraph."
}
Example Response:
Last updated
Was this helpful?