Session 4 • 20 minutes + Q&A

Vision Models for Medical Imaging

AI that can see: analyze images, interpret diagrams, and assist with visual documentation

Image Analysis

• Describe medical images in natural language
• Identify anatomical structures
• Compare before/after images
• Detect patterns and abnormalities

Diagram Interpretation

• Explain complex surgical diagrams
• Analyze flowcharts and decision trees
• Interpret data visualizations
• Extract info from figures in papers

Visual Q&A

• "What valve is shown in this echo?"
• "Describe the surgical approach"
• "What are the key findings in this CXR?"
• "Extract data from this graph"

Clinical Applications

Real-world uses for cardiothoracic surgeons

Imaging Documentation

Use Case: Generating structured reports from imaging studies

"Analyze this chest X-ray and provide a structured report including: cardiac silhouette size, lung fields, mediastinal contours, and any abnormalities. Use standard radiological terminology."

Faster preliminary readsConsistent reportingEducational tool

Surgical Planning

Use Case: Analyzing pre-operative imaging for surgical approach

"Based on this CT angiography, describe the coronary anatomy including: number of vessels, degree of stenosis, calcification burden, and suitability for bypass grafting vs. PCI."

Comprehensive pre-op assessmentTeam communicationPatient education

Research Figure Analysis

Use Case: Extracting data from published figures

"Extract the data points from this Kaplan-Meier survival curve and summarize the key findings including median survival times and p-values."

Rapid literature reviewData extraction for meta-analysisFigure interpretation

Vision AI Tools

GPT-4 Vision

Excellent general image understanding, detailed descriptions

ChatGPT Plus

Claude 3 with Vision

Careful, nuanced analysis, good at complex instructions

Claude Pro

Gemini 2.0 Flash

Fast multimodal processing, good for batch analysis

Google AI Studio

⚠️ Critical Limitations & Safety

Not FDA-approved for diagnosis: These are general-purpose AI tools, not medical devices

Variable accuracy: Performance depends on image quality and specific task

Hallucination risk: AI may confidently describe features that aren't present

Privacy critical: Remove all patient identifiers before uploading to public AI tools

✅ Appropriate Use Cases

✓ Educational tool for trainees
✓ Research figure analysis
✓ Generating draft reports for review
✓ Second opinion for complex cases
✓ Patient education material creation

❌ Inappropriate Use Cases

✗ Primary diagnostic tool
✗ Replacing radiologist interpretation
✗ Making treatment decisions without verification
✗ Processing identifiable patient images on public tools
✗ Automated clinical decision-making

Try It Yourself: Vision AI Playground

Upload de-identified medical images and see how vision models analyze visual content

Suggested Exercise:

Find a de-identified chest X-ray or CT scan (or use a published image from a paper)
Upload to the AI Playground and ask: "Describe the key findings in this image"
Try a more specific prompt: "Analyze this image and identify: [specific structures/findings]"
Compare responses from GPT-4 Vision, Claude, and Gemini
Verify AI descriptions against your own clinical assessment

Privacy Reminder: Only upload de-identified images or images from published literature. Never upload identifiable patient data to public AI tools.

Key Takeaways

✓

Vision AI extends capabilities beyond text to images, diagrams, and visual data

✓

Multiple applications in clinical practice, research, and education

✓

Verification is essential—never rely solely on AI for clinical decisions

✓

Privacy matters—remove identifiers and follow institutional policies

Ready to Go Deeper?

This 90-minute workshop is just the beginning. Our comprehensive full-day program covers advanced applications, hands-on projects, and personalized consultation for your research and clinical workflows.