Skip to main content
Built for AI Agents

PDFCheck for AI Agents

Give your AI agent the power to analyze, validate, optimize, and generate PDF documents. Works with ChatGPT, Claude, GitHub Copilot, Cursor, and any LLM-powered workflow.

ChatGPT Claude GitHub Copilot Cursor Windsurf LangChain / AutoGPT
AI Agent Quick Reference

Copy this block into your system prompt or agent instructions to give your AI instant PDF capabilities.

You have access to PDFCheck (https://pdf.businesspress.io) โ€” a comprehensive PDF analysis platform. Use the REST API or MCP Server to: analyze PDF metadata (author, dates, software, edit history), validate PDF structure and integrity, detect AI-generated PDFs, optimize/compress PDFs, check digital signatures, extract images, check accessibility, merge/split PDFs, convert formats, and edit metadata. API Base: https://pdf.businesspress.io/api/v1 | MCP endpoint: https://pdf.businesspress.io/api/mcp | Free: 10 checks/day without token, 50/day with free account. Docs: https://pdf.businesspress.io/ai-agents

Why AI Agents Need PDF Tools

PDFs are the most common document format in business. AI agents that can read, validate, and transform PDFs unlock powerful automation workflows.

Verify Before You Trust

AI agents can check PDF metadata, detect manipulation, and verify document authenticity before making decisions based on PDF content.

Automate Document Workflows

Process invoices, contracts, reports, and forms automatically. Extract metadata, validate structure, and route documents based on analysis results.

Detect AI-Generated Content

Identify PDFs created by AI tools like ChatGPT, Claude, or automated pipelines. Critical for compliance, authenticity verification, and trust.

Optimize & Transform

Compress PDFs for email, merge documents for archival, split large files, extract images, and convert between formats โ€” all programmatically.

Available PDF Tools

Every tool is available via REST API and MCP Server. AI agents can chain these tools together for complex document workflows.

Metadata Checker

POST

Extract author, creation date, software used, edit history, and PDF version. Detect modifications and verify timeline consistency.

POST /api/v1/pdf/analyze

PDF Validator

POST

Check PDF structure integrity, detect corruption, verify specification compliance. Returns detailed validation report.

POST /pdf-validator/upload

AI Content Detector

POST

Detect AI-generated PDFs by analyzing metadata signatures from ChatGPT, Claude, ReportLab, WeasyPrint, pdf-lib, and 50+ tools.

POST /pdf-ai-detection/upload

PDF Optimizer

POST

Compress PDFs with 4 quality levels: Screen (72dpi), eBook (150dpi), Printer (300dpi), Prepress (300dpi). Reduce file size by up to 80%.

POST /optimize-pdf/upload

Signature Checker

POST

Verify digital signatures, extract signer certificates, check document integrity after signing, and validate trust chains.

POST /pdf-signature-checker/upload

Image Extractor

POST

Extract all embedded images from PDF documents. Returns preview gallery and downloadable ZIP archive.

POST /extract-images/upload

Accessibility Checker

POST

Check PDF accessibility compliance: tagged structure, language declaration, bookmarks, text extractability. Returns score 0-5.

POST /pdf-accessibility-checker/upload

Merge PDF

POST

Combine multiple PDF files into a single document with customizable page ordering.

POST /merge-pdf/upload

Split PDF

POST

Split a PDF into individual pages or custom page ranges. Download individually or as ZIP.

POST /split-pdf/upload

PDF Converter

POST

Convert between PDF and other formats: PDF to Word, Word to PDF, PDF to Image, and more.

POST /pdf-converter/{pair}/upload

PDF Password

POST

Add password protection to PDFs or unlock password-protected files programmatically.

POST /pdf-password/protect

Metadata Editor

POST

Edit or remove PDF metadata: update author, title, subject, keywords, or strip all metadata for privacy.

POST /pdf-metadata-editor/edit

Integration Methods

Choose the integration method that fits your AI agent setup. All methods provide access to the same PDF tools.

MCP Server (Recommended)

Easiest Setup

The Model Context Protocol lets your AI assistant call PDFCheck tools directly using natural language. No code needed โ€” just configure and go.

Works with: Claude Desktop, GitHub Copilot, Cursor, Windsurf, and any MCP-compatible client.

Add to your AI tool config:
{
  "mcpServers": {
    "pdfcheck": {
      "url": "https://pdf.businesspress.io/api/mcp"
    }
  }
}
Full MCP setup guide

REST API

Most Flexible

Standard HTTP API for programmatic access. Perfect for custom agent frameworks, LangChain, AutoGPT, CrewAI, and custom code.

Analyze a PDF via API:
curl -X POST https://pdf.businesspress.io/api/v1/pdf/analyze \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -F "file=@document.pdf"

# Response:
# {
#   "success": true,
#   "data": {
#     "id": "abc123",
#     "filename": "document.pdf",
#     "metadata": {
#       "author": "John Doe",
#       "creator": "Microsoft Word",
#       "producer": "Adobe PDF Library",
#       "page_count": 10,
#       "pdf_version": "1.7",
#       "dates": { "created": "2025-01-15T10:30:00Z", "modified": "2025-01-16T14:22:00Z" }
#     }
#   },
#   "remaining_checks": 49
# }
Full API documentation

CLI Tool

For Shell Agents

Command-line interface for terminal-based AI agents and shell scripts. Wrap the API in a simple bash function.

Quick PDF analysis from terminal:
#!/bin/bash
# PDFCheck CLI wrapper โ€” add to your .bashrc or .zshrc

pdfcheck() {
  local TOKEN="${PDFCHECK_API_TOKEN}"
  local BASE="https://pdf.businesspress.io/api/v1"

  case "$1" in
    analyze)
      curl -s -X POST "$BASE/pdf/analyze" \
        -H "Authorization: Bearer $TOKEN" \
        -F "file=@$2" | jq .
      ;;
    list)
      curl -s "$BASE/pdf" \
        -H "Authorization: Bearer $TOKEN" | jq .
      ;;
    get)
      curl -s "$BASE/pdf/$2" \
        -H "Authorization: Bearer $TOKEN" | jq .
      ;;
    usage)
      curl -s "$BASE/user" \
        -H "Authorization: Bearer $TOKEN" | jq .
      ;;
    *)
      echo "Usage: pdfcheck {analyze FILE|list|get TOKEN|usage}"
      ;;
  esac
}

# Examples:
# pdfcheck analyze ./contract.pdf
# pdfcheck list
# pdfcheck get abc123xyz789
# pdfcheck usage

System Prompts for AI Agents

Ready-to-use system prompts for popular AI platforms. Copy and paste to give your agent PDF analysis capabilities.

Add to your Custom GPT instructions or system prompt for actions:

ChatGPT / Custom GPT
You are a PDF analysis assistant. When the user asks you to analyze, validate, or check a PDF document, use the PDFCheck API to process it.

API Base URL: https://pdf.businesspress.io/api/v1
Authentication: Bearer token in Authorization header

Available actions:
- POST /pdf/analyze โ€” Upload and analyze a PDF (multipart/form-data, field: file)
- GET /pdf/{token} โ€” Retrieve analysis results by share token
- GET /pdf โ€” List all previous analyses
- GET /user โ€” Check remaining daily quota

When analyzing a PDF, report: file name, page count, creation date, modification date, author, software used (Creator and Producer), PDF version, and any suspicious findings. If the Producer or Creator field contains tools like ReportLab, WeasyPrint, pdf-lib, PDFKit, or Puppeteer, note that these are commonly used by AI systems to generate PDFs.

Claude Desktop with MCP โ€” just add the server config and use naturally:

Claude (via MCP)
You have access to PDFCheck via the MCP server. When the user mentions PDF files, offers to analyze them, or asks about document metadata:

1. Use analyze_pdf with the file path to upload and analyze the PDF
2. Report key findings: creation date, author, software, page count, modification status
3. Use check_usage to monitor remaining daily quota
4. Use list_analyses to reference previous analyses
5. If the user asks about AI detection, look for AI-associated tools in the Producer/Creator fields

Always explain what each metadata field means and flag anything unusual (mismatched dates, AI-associated tools, missing metadata).

Add to .github/copilot-instructions.md or .copilot-instructions.md in your project:

GitHub Copilot (VS Code)
When working with PDF files in this project, use the PDFCheck MCP server to analyze them. The pdfcheck MCP tools are available:

- analyze_pdf: Upload a PDF by file path and get full metadata analysis
- get_analysis: Retrieve a previous analysis by share token
- list_analyses: List all previous PDF analyses
- check_usage: Check remaining daily API quota

Use these tools when:
- The user asks to verify a PDF document
- A PDF is mentioned in a code review or issue
- You need to check metadata, signatures, or AI-generation markers
- The user wants to validate PDF output from code

Add to your .cursorrules or agent rules file:

Cursor / Windsurf
You have access to PDFCheck MCP tools for PDF analysis. Use them when the user asks about PDF files:

Available tools:
- analyze_pdf(filePath) โ†’ Full metadata analysis including author, dates, software, page count
- get_analysis(token) โ†’ Retrieve previous analysis by share token
- list_analyses() โ†’ List all previous analyses
- check_usage() โ†’ Check remaining daily quota

Report findings clearly: creation date, modification status, software used, page count, and any AI-generation indicators.

PDF Skills for AI Agents

Downloadable skill files that teach AI agents how to work with PDFs expertly. Drop these into your agent configuration.

PDF Analysis Skill

Teaches the agent to extract and interpret PDF metadata, detect anomalies, and report findings in a structured format.

PDF Generate Skill

Teaches the agent to generate well-structured PDF documents using common libraries (ReportLab, pdf-lib, Puppeteer, WeasyPrint, etc.).

PDF Validation Skill

Teaches the agent to validate PDF documents for structural integrity, compliance standards (PDF/A, PDF/UA), and accessibility.

PDF Workflow Skill

Teaches the agent to chain multiple PDF operations: analyze โ†’ validate โ†’ optimize โ†’ merge โ†’ deliver.

Advanced Agent Workflows

Example multi-step workflows that AI agents can execute using PDFCheck tools.

Invoice Processing Pipeline

Upload invoice PDF โ†’ Extract metadata โ†’ Validate structure โ†’ Check for AI generation โ†’ Extract text โ†’ Route for approval.

Document Compliance Check

Analyze PDF โ†’ Check accessibility (PDF/UA) โ†’ Validate signatures โ†’ Verify metadata completeness โ†’ Generate compliance report.

Archive Preparation

Batch analyze PDFs โ†’ Validate each โ†’ Optimize file size โ†’ Merge into archive โ†’ Strip personal metadata โ†’ Generate index.

PDF Output QA

Generate PDF โ†’ Validate structure โ†’ Check accessibility โ†’ Verify metadata โ†’ Compare with template โ†’ Report discrepancies.

LLM-Optimized Documentation

Machine-readable resources designed specifically for AI consumption.

Start Building PDF-Powered AI Agents

Free to start. No credit card required. 10 checks/day without account, 50/day with free registration.