Step 3: Upload & Process Document - Planning AI Walkthrough
Upload your planning application and watch AI extract information in real-time
Great! You've deployed the demo
Now let's walk through what you just deployed and see it in action.
Start WalkthroughChoose your next step
Generate Evidence Pack
Create your business case documentation with what you've learned.
Generate Evidence PackWalkthrough progress
Step 3 of 4 • 2 minutes
Upload & Process Document
Upload the planning application PDF and watch AI automatically extract applicant information in real-time.
Screenshot updating - please check back soon
Screenshot updating - please check back soon
Expected outcome
- Document uploads successfully
- Processing progress indicator shows real-time status
- Extraction completes in 8-20 seconds
- Extracted fields appear automatically
Upload your document
-
Locate the upload area
On the document processing interface, find the upload section. It should say "Drag and drop your planning application here, or click to browse".
-
Upload the PDF
You can either:
- Drag and drop: Drag the PDF file from your downloads folder and drop it on the upload area
- Click to browse: Click the upload area, then select the PDF from your file browser
-
Watch the upload progress
You'll see a progress bar as the file uploads to AWS S3. For a 1-4 MB file, this typically takes 1-3 seconds.
-
Wait for AI processing
After upload completes, AI extraction begins automatically. You'll see:
- Status changes to "Processing with Amazon Textract..."
- Animated spinner or progress indicator
- Estimated time remaining (8-20 seconds depending on document)
-
Extraction completes
When done, you'll see "Extraction complete!" and the extracted fields appear on screen.
What's happening behind the scenes:
- Your PDF uploads to a secure S3 bucket
- Lambda function triggers Amazon Textract API
- Textract performs OCR and identifies text regions
- AI models extract key-value pairs (applicant name, address, etc.)
- Results return to your browser in structured JSON
What to watch for
Pay attention to these impressive capabilities during processing:
Multi-page processing
AI reads all pages of the PDF, not just page 1. It finds information wherever it appears in the document.
UK address recognition
Accurately identifies UK postcodes, street addresses, and location formats. Handles "Land adjacent to..." descriptions.
Context understanding
Distinguishes between applicant name, agent name, site address, and billing address even when multiple names/addresses appear.
Speed at scale
Processing 200 applications/year takes 50 hours with AI vs 150 hours manually. No degradation in speed regardless of volume.
Processing time expectations
Different document types have different processing times:
| Document Type | File Size | Processing Time |
|---|---|---|
| House Extension | 1.2 MB | 8-12 seconds |
| Change of Use | 2.8 MB | 12-18 seconds |
| New Dwelling | 3.5 MB | 15-20 seconds |
Troubleshooting
File too large error (max 10MB)
If you see "File too large" error:
- Check file size in file properties - must be under 10 MB
- Use a PDF compression tool to reduce file size
- Try splitting very large multi-page applications
- Remove high-resolution images or photos if possible
- Export at lower quality from your planning software
Our sample documents are all under 4 MB and should work without issues.
Wrong format error (PDF, JPEG, PNG only)
If you see "Unsupported file format" error:
- Only PDF, JPEG, and PNG files are supported
- Convert Word documents (.doc, .docx) to PDF first
- Check the file extension is .pdf, .jpg, .jpeg, or .png
- Don't upload compressed files (.zip, .rar)
- Ensure you downloaded the PDF, not just opened it in browser
Processing timeout (over 30 seconds)
If processing takes longer than 30 seconds or stalls:
- Refresh the page and try uploading again
- Try a different sample document to rule out document corruption
- Check AWS CloudWatch logs for Lambda timeout errors
- Verify Textract API quotas haven't been exceeded
- Wait 2-3 minutes then retry - may be temporary API slowness
Normal processing is 8-20 seconds. Anything over 30 seconds indicates an issue.
API errors (500, 503 responses)
If you see server error messages:
- Wait 1-2 minutes and retry - may be temporary service issue
- Check AWS Service Health Dashboard
- Verify Lambda function has IAM permissions for Textract
- Check Lambda execution role includes
textract:AnalyzeDocument - Monitor CloudWatch Logs for detailed error messages
- Contact support if errors persist
Upload succeeds but no extracted fields
If upload completes but extraction returns empty:
- Check the document contains machine-readable text (not scanned handwriting)
- Verify PDF isn't password-protected or encrypted
- Try opening the PDF locally to confirm it's not corrupted
- Check CloudWatch logs for Textract response errors
- Try uploading one of our sample documents to rule out service issues