SAP Document AI: Enterprise IDP Platform
On This Page
SAP's unified intelligent document processing solution combining OCR, large language models, and pre-trained transformers for enterprise document automation across SAP's business application ecosystem.

Overview
SAP Document AI consolidates the former Document Information Extraction service into a document processing platform deployed on SAP Business Technology Platform. The solution uses blended AI methods including pre-trained transformers and large language models to minimize training data requirements. Research lineage includes the CharGrid, BERTgrid, and Charmer papers, predating the LLM era, and the platform now supports schema-based zero-shot processing - handling complex documents without task-specific retraining.
Q4 2025 marked a strategic inflection: four capabilities reached general availability simultaneously - multimodal vision extraction, email attachment processing, multi-step document workflows, and SAP Cloud Transport Management integration. The workflow engine GA is the most architecturally significant of the four. By adding classification, routing, and automated processing pipelines, SAP Document AI moved from a point extraction tool into an orchestration layer for document processing, as SAP CTO Philipp Herzig confirmed in the Q4 2025 Business AI roundup.
Scale figures disclosed through a February 2026 Hasso Plattner Founders' Award nomination - 30,000+ customers, billions of documents processed, €2.6 billion in estimated annual business value, and 285x growth in SAP BTP usage for custom document automation since 2020 - are self-reported and unverified by independent analysts. No Gartner, Everest Group, or IDC assessment of Document AI specifically is cited. The product is natively embedded in 32 business processes across SAP S/4HANA, SAP Business Network, SAP Concur, SAP Fieldglass, SAP SuccessFactors, SAP Customer Experience, and SAP BTP, with "dozens more use cases in development."
SAP positions Document AI as an embedded solution within its enterprise application ecosystem rather than a standalone IDP competitor. Integration into 32 business processes means Document AI reaches customers who never evaluated it as a standalone IDP product - a distribution model structurally difficult for point-solution vendors to replicate.
How SAP Document AI Processes Documents
The Q4 2025 GA releases define the current processing architecture across four layers.
Multimodal extraction allows schema administrators to toggle between text-only and combined text-and-image processing. In vision mode, a multimodal model interprets visual elements alongside text - specifically hazard pictograms, stamps, signatures, logos, diagrams, and labels. Per-schema control allows cost and performance tuning. Text-only IDP systems have a known structural limitation on visually complex compliance documents; the vision mode directly targets Safety Data Sheets and Declarations of Conformity as primary use cases. No throughput benchmarks or accuracy comparisons against competing IDP vendors are provided.
Multi-step document workflows let users define processing pipelines - combining extraction, classification, email processing, content-based routing, and automated processing - without external integrations or additional tooling. Workflows trigger automatically via inbound channels or manually via file upload.
Email attachment processing handles attachments alongside or separately from the email body, improving flexibility for email-based document ingestion channels.
Transport management integration with SAP Cloud Transport Management enables export and import of schemas across development, QA, and production instances, addressing enterprise deployment consistency for teams managing multiple environments.
Native integration with SuccessFactors, S/4HANA, and SAP ERP provides multilingual OCR across 110+ languages via next-generation generative AI models, with handwriting detection and barcode recognition. No pricing or plan tier information is available for any of the new features, including vision processing. No availability regions are specified.
SAP has signaled reusable tools to empower AI agents handling complex document workflows across industries as the next announced capability. No launch date or technical specification has been provided.
Use Cases
HR & Employee Onboarding
The embedded edition within SAP SuccessFactors Onboarding automates extraction of key fields from national ID documents - ID type, number, and validity dates - and prompts new hires to validate captured data before submission. SAP claims up to 15% acceleration in overall onboarding cycles and up to 30% improvement in validation accuracy. Both figures are SAP benchmark estimates, not independently verified.
Safety & Compliance
Vision-enabled extraction processes Safety Data Sheets and Declarations of Conformity with hazard pictogram recognition and visual element identification - document types where text-only IDP systems fail structurally. Teams evaluating open-source alternatives for compliance document pipelines may also consider Unstract's no-code LLM platform, which addresses hallucination mitigation for regulated document workflows.
Finance & Procurement
Pre-configured templates for invoices, purchase orders, and remittance advice with continuous learning from user corrections. For SAP-connected enterprise workflows, Hypatos also integrates with SAP for financial document automation and offers a point of comparison on straight-through processing rates. Teams building structured extraction pipelines on top of LLMs may also evaluate LangExtract, Google's open-source Python library for grounded structured extraction from unstructured text.
Public Sector
The City of Hamburg automatically classified 6 million documents for aid application processing, demonstrating the platform's capacity for high-volume government workflows. Organizations evaluating SAP Document AI for similar government-scale deployments may also compare xSuite, a SAP-certified accounts payable automation provider processing 80+ million documents annually across 60 countries, as a point of reference on SAP-native document throughput at scale.
Technical Specifications
| Component | Details |
|---|---|
| Deployment | SAP Business Technology Platform (BTP) |
| AI Technology | Pre-trained transformers, LLMs, vision-enabled multimodal extraction |
| Languages | 110+ languages via next-generation generative AI models |
| File Formats | 35+ formats including PDF, images, Office documents |
| Visual Processing | Pictograms, stamps, signatures, logos, charts, labels |
| Integration | SAP S/4HANA, SuccessFactors, SAP Concur, SAP Fieldglass, SAP Business Network, OpenText VIM |
| Workflow | Multi-step automation with classification, routing, email attachment processing |
| Transport Management | SAP Cloud Transport Management for schema promotion across dev/QA/prod |
| Zero-Shot Processing | Schema-based extraction without task-specific retraining |
| Embedded Processes | 32 native business processes across SAP portfolio |
| Pricing | Not publicly disclosed |
Resources
- SAP Document AI Product Page
- SAP Community - Document AI
- SAP Help Documentation
- SAP Tutorials
2026-01 [news: Q4 2025 Business AI Release Highlights | news.sap.com] Four Document AI capabilities reach GA: vision extraction, email attachment processing, multi-step workflows, transport management integration (https://news.sap.com/2026/01/sap-business-ai-release-highlights-q4-2025/)2026-02 [news: Hasso Plattner Founders' Award Finalists | news.sap.com] Document AI named finalist; self-reported scale metrics disclosed: 30,000+ customers, billions of documents, €2.6B estimated business value, 285x BTP usage growth since 2020 (https://news.sap.com/2026/02/hasso-plattner-founders-award-finalists-scaling-innovation/)2026-02 [news: Q4 2025 Release Highlights - German Edition | news.sap.com] German-language companion post confirming same GA releases with additional architectural framing (https://news.sap.com/germany/2026/02/highlights-q4-2025-release-sap-business-ai-it-und-entwickler/)
Company Information
SAP SE Dietmar-Hopp-Allee 16 69190 Walldorf, Germany Phone: +49 6227 7-47474 Email: info@sap.com Website: https://www.sap.com
"We built SAP Document AI to deliver measurable business value at global scale, securely, responsibly, and embedded in everyday processes, demonstrating SAP's ability to operationalize AI at massive scale."
- Tobias Weller, Chief Product Owner and Team Lead, SAP Document AI