GPTPrompts.AIClaude
Claude
Vision.
Unlock Claude's image analysis capabilities with expert prompts for design review, chart extraction, OCR, and multimodal reasoning.
Setup and Capabilities
Claude processes JPEG, PNG, GIF, and WebP images (under 20MB) with advanced visual reasoning across design, technical, and analytical domains.
Image Processing Features
- • Upload: Direct chat or base64 via API
- • Batch: Up to 100 images via API, 5 on claude.ai
- • Capabilities: Context, spatial reasoning, OCR
- • Limitations: No precise measurements, low-res struggles, privacy-focused (no face ID)
- • Strategy: Prompt images before text, chain follow-ups for depth
General Image Analysis
Comprehensive Scene Breakdown
Analyze image in six dimensions:
- 1. Overall Scene: Main subject, mood, composition
- 2. Objects/Elements: List with positions (top-left, foreground)
- 3. Text Extraction: All readable text verbatim + location
- 4. Colors/Palette: Dominant colors, emotional impact
- 5. Relationships: Element interaction and spatial layout
- 6. Anomalies: Unusual or noteworthy details
Screenshot and UI Analysis
UI/UX Review Framework
Six evaluation areas:
- • Layout: Hierarchy, whitespace, flow direction
- • Typography: Font consistency, readability, emphasis
- • Colors: Scheme, contrast (WCAG), branding
- • Components: Buttons, links, forms—states and labels
- • Issues: Accessibility problems, clutter, confusion
- • Improvements: 5 prioritized fixes with rationale
Error Screenshot Debugging
Extract and analyze error messages systematically:
- • Full error message and stack trace
- • Probable cause identification
- • Environment details (version, OS)
- • Suggested fixes (3 ranked options)
- • Prevention strategies
Diagram and Chart Interpretation
Flowchart Decoder
Systematically reverse-engineer diagrams:
- 1. Overall purpose and process flow
- 2. Decision points and branches
- 3. Inputs and outputs mapping
- 4. Key symbols and meanings
- 5. Bottlenecks and loops
- 6. Pseudocode generation
Chart Data Extraction
Extract structured data from visual charts:
- • Chart type and axes scales
- • Key data points (top 5, extremes)
- • Trends and patterns identified
- • Annotations and legends decoded
- • CSV export of visible data
- • Strategic insights and story
Document and Text Processing
OCR and Content Extraction
Process handwritten and scanned documents:
- • Full verbatim text (line-by-line)
- • Categorization into themes
- • Diagram and sketch identification
- • Action items and decisions
- • Clarity issues flagged
- • Markdown format conversion
Slide Deck Analysis (Multi-Image)
Analyze presentation sequences (supports up to 5 images):
- • Narrative flow and logical progression
- • Visual consistency (theme, colors, fonts)
- • Key messages per slide
- • Data visualization quality
- • Presenter notes suggestions
- • Overall effectiveness rating
Advanced Vision Techniques
Comparison Mode (Multi-Image)
Analyze differences across image sequences:
- • Similarities (objects, layout, colors)
- • Differences (changes, additions)
- • Evolution or sequence patterns
- • Best/worst assessment with rationale
- • Merged insights and visual summary
Spatial Reasoning Query
Analyze layout and spatial relationships:
- • Object inventory with bounding descriptions
- • Relative positions (above, contains, adjacent)
- • Proportions and scale relationships
- • Movement and flow direction implied
- • Simplified wireframe generation
Best Practices and Optimization
| Challenge | Fix | Example |
|---|---|---|
| Low accuracy | High-res, clear images | "Focus on center panel" |
| Hallucinations | "Quote exactly" + verify | "List only visible text" |
| Complex scenes | Break into questions | Image → Objects → Relationships |
| Multi-image | Number and reference | "Image 1 vs Image 2" |