LandingAI
Visual AI company founded by Andrew Ng specializing in agentic document extraction and computer vision through specialized transformer models.
Overview
LandingAI was founded by Andrew Ng — co-founder of Coursera, founding lead of Google Brain, and former chief scientist at Baidu — and is headquartered in Mountain View, California. The company has staked out "agentic document extraction" as a distinct category, positioning its approach against both traditional OCR pipelines and general-purpose LLMs.
The September 2025 launch of Document Pre-trained Transformer-2 (DPT-2) is the clearest expression of that strategy: a model trained specifically on document structure — gridline-free tables, merged cells, non-standard layouts — rather than adapted from a general foundation model. The company claims 99.16% accuracy on DocVQA, a visual question-answering benchmark for real-world documents, though no competitor score on the same test has been published independently, making the figure a floor rather than a competitive position. An AIMultiple evaluation of 60 document images ranked LandingAI first at 69/100 composite, ahead of Mistral OCR, Claude 3.7 Sonnet, OpenAI o3-mini, and Docsumo — but competitor scores were not disclosed, so the margin of the lead is unknown.
The credibility picture has a concrete gap. Deep Analysis Senior Analyst Dan Lucarini explicitly stated, citing LandingAI's own webinar, that ADE does not yet have confidence scoring for data extraction — which he calls a requirement for any enterprise production deployment. LandingAI's own product page lists "confidence scoring and audit-ready traceability per extracted field" as a current capability. The discrepancy is unresolved: either the feature shipped between the webinar and the page update, or the marketing page is ahead of the actual product. Lucarini's open invitation for LandingAI to brief analysts and resolve the question had not been answered as of his publication date.
LandingAI was also absent from the ISG Buyers Guide for IDP Platforms published February 11, 2026 — the same week it received editorial analyst coverage — suggesting it has not yet met the revenue, customer count, or analyst engagement thresholds for formal enterprise evaluation inclusion. The company sits alongside Reducto AI (valued at $600M), Unstructured, and automat as Bay Area IDP startups collectively raising approximately $200M in a market that added 100 new vendor names in the past 12 months alone. LandingAI's own funding figure is not publicly disclosed.
Strategic partnerships signal expansion beyond pure document processing. ABB Robotics made a strategic investment in September 2025 to integrate LandingLens into robotics software, targeting 80% reduction in vision AI deployment time. In January 2026, LandingAI partnered with Snowflake to deliver ADE as a Snowflake Native App for energy sector applications, where CEO Dan Maloney noted that critical operational and regulatory intelligence remains "locked outside the analytics stack."
Developer adoption is the near-term growth lever. A Financial AI Hackathon drew over 1,000 developers globally, producing winning solutions in loan underwriting, fraud detection, compliance automation, and invoice processing — one entry combining DPT-2 with AWS Bedrock in a multi-agent architecture with RAG and deterministic rule engines. A free DeepLearning.AI course taught by LandingAI staff covers ADE's visual parsing approach and an AWS production pipeline — ADE triggers on S3 uploads, loads parsed documents into Amazon Bedrock Knowledge Base, and queries them via Strands Agents — positioning ADE within the enterprise cloud stack without requiring a direct enterprise sales relationship. Adoption figures cited by the company (90% reduction in information search times, billions of pages processed) are vendor-stated and unconfirmed by independent sources.
How LandingAI Processes Documents
LandingAI's Agentic Document Extraction (ADE) replaces the conventional OCR-then-LLM pipeline with a single visual AI pass. The DPT-2 model reads document layout as a visual object — preserving spatial relationships between text, tables, and figures — rather than converting the page to plain text first and losing structure in the process. AIMultiple noted that ADE "can extract complicated and mixed data (text and table on the same page) without any prompting," which distinguishes it from prompt-engineered general models.
The extraction pipeline produces structured Markdown and JSON with visual grounding: each extracted field is traceable to its bounding box in the source document. This cell-level provenance supports RAG pipelines and audit workflows. ADE Split handles multi-document PDFs by separating them using layout-aware visual AI before extraction begins.
Integration requires 3 lines of SDK code across Python, TypeScript, and JavaScript. The platform is available via cloud, on-premises, and virtual private environment deployment, with a Zero Data Retention option where applicable. Pricing is credit-based per page processed; full details are at docs.landing.ai/ade/ade-pricing. Security certifications are referenced at landing.ai/security-at-landingai but specific certification names are not disclosed — a source gap.
The unresolved confidence scoring question sits at the center of the production readiness debate. If the feature is live as the product page states, ADE closes a critical enterprise gap. If it is not, the platform remains a strong development and prototyping tool that requires additional validation layers before production deployment in regulated environments.
Use Cases
Financial Document Processing
Financial institutions use ADE to process 10-K forms, financial statements, and regulatory filings. The platform's smart 10-K auditor implementation demonstrates visual grounding for audit trails and element traceability across complex nested tables. Hackathon winners extended this to loan underwriting, fraud detection, and invoice processing across multiple formats, currencies, and languages — combining the ADE SDK with AWS Bedrock in multi-agent architectures with RAG and deterministic rule engines.
Andrew Ng has framed the opportunity directly: "In financial services — and in many other places — we have so much data... we retain the invoices, the financial documents, the 10Ks to the K1s, but we have so much data that for a long time has just been sitting around in our data warehouses or sometimes even on our laptops unprocessed."
Healthcare Document Intelligence
Dr. Declan Kelly from Eolas Medical reported: "ADE has significantly outperformed other document extractors we've used. It has helped us build an Agentic RAG answer engine, based on unique healthcare institutional content, to offer instant, validated support to medical professionals at the point of care." The platform's visual grounding and source traceability are particularly relevant for clinical documentation where provenance matters for compliance.
Energy Sector Operations
The January 2026 Snowflake Native App partnership targets energy organizations managing large volumes of unstructured operational and regulatory documents. Deploying ADE as a Snowflake Native App brings document extraction directly into the analytics stack, eliminating the data movement that previously kept document intelligence separate from operational data.
Manufacturing Visual Inspection
Through the ABB Robotics partnership, LandingLens is integrated into robotics software for quality control applications. Sami Atiya, President of ABB Robotics, stated: "Installation and deployment time is done in hours instead of weeks, allowing more businesses to automate smarter, faster and more efficiently." This use case extends LandingAI's footprint beyond document processing into physical production environments.
Technical Specifications
| Feature | Specification |
|---|---|
| Core Products | Agentic Document Extraction (ADE), LandingLens, VisionAgent |
| Core Technology | Document Pre-trained Transformer-2 (DPT-2), Visual AI, Computer Vision |
| Document Capabilities | Zero-shot parsing, semantic chunking, visual grounding, source traceability, ADE Split multi-document separation |
| Form Field Recognition | Checkboxes, signatures, barcodes, QR codes, attestations, ID cards, logos |
| Platform Type | API-first; cloud, on-premises, virtual private environment; Zero Data Retention option |
| Benchmark Performance | 99.16% DocVQA accuracy (vendor-stated, no competitor comparison published); 69/100 AIMultiple composite score, ranked #1 of 5 tools across 60 images |
| AIMultiple Test Scope | 30 flowcharts + 30 tables; metrics: node/edge/decision accuracy (flowcharts), title/header/row/cell accuracy (tables); competitor scores not published |
| Processing Scale | Billions of pages processed (vendor-stated, unconfirmed by independent sources) |
| Performance Claims | 90% reduction in information search time (vendor-stated); 80% reduction in vision AI deployment time via ABB partnership |
| Developer Tools | Python, TypeScript, JavaScript SDKs; 3-line integration; 5.2k GitHub stars |
| Pricing Model | Credit-based per page; monthly and annual subscriptions; free trial, no credit card required |
| Known Gap | Confidence scoring for data extraction: flagged as absent by Deep Analysis analyst (sourced from LandingAI webinar); listed as present on LandingAI product page — status unresolved |
| Enterprise Analyst Coverage | Absent from ISG Buyers Guide for IDP Platforms, February 2026 |
Resources
- Website
- Agentic Document Extraction
- LandingLens
- VisionAgent
- GitHub
- Document AI Course — DeepLearning.AI
- AIMultiple Benchmark
- Deep Analysis: LandingAI and the Future of IDP
- LandingAI: Competitive Analysis
- Agentic Document Processing Guide
- Agentic Capability Overview
Company Information
Headquarters: Mountain View, CA, USA
Founder: Andrew Ng
Users: 30,000+
Developer Community: 1,000+ hackathon participants (Financial AI Hackathon)
Funding: Not publicly disclosed