LandingAI

Visual AI company founded by Andrew Ng specializing in agentic document extraction and computer vision through specialized transformer models.

Overview

LandingAI was founded by Andrew Ng — co-founder of Coursera, founding lead of Google Brain, and former chief scientist at Baidu — and is headquartered in Mountain View, California. The company has staked out "agentic document extraction" as a distinct category, positioning its approach against both traditional OCR pipelines and general-purpose LLMs.

The September 2025 launch of Document Pre-trained Transformer-2 (DPT-2) is the clearest expression of that strategy: a model trained specifically on document structure — gridline-free tables, merged cells, non-standard layouts — rather than adapted from a general foundation model. The company claims 99.16% accuracy on DocVQA, a visual question-answering benchmark for real-world documents, though no competitor score on the same test has been published independently, making the figure a floor rather than a competitive position. An AIMultiple evaluation of 60 document images ranked LandingAI first at 69/100 composite, ahead of Mistral OCR, Claude 3.7 Sonnet, OpenAI o3-mini, and Docsumo — but competitor scores were not disclosed, so the margin of the lead is unknown.

The credibility picture has a concrete gap. Deep Analysis Senior Analyst Dan Lucarini explicitly stated, citing LandingAI's own webinar, that ADE does not yet have confidence scoring for data extraction — which he calls a requirement for any enterprise production deployment. LandingAI's own product page lists "confidence scoring and audit-ready traceability per extracted field" as a current capability. The discrepancy is unresolved: either the feature shipped between the webinar and the page update, or the marketing page is ahead of the actual product. Lucarini's open invitation for LandingAI to brief analysts and resolve the question had not been answered as of his publication date.

LandingAI was also absent from the ISG Buyers Guide for IDP Platforms published February 11, 2026 — the same week it received editorial analyst coverage — suggesting it has not yet met the revenue, customer count, or analyst engagement thresholds for formal enterprise evaluation inclusion. The company sits alongside Reducto AI (valued at $600M), Unstructured, and automat as Bay Area IDP startups collectively raising approximately $200M in a market that added 100 new vendor names in the past 12 months alone. LandingAI's own funding figure is not publicly disclosed.

Strategic partnerships signal expansion beyond pure document processing. ABB Robotics made a strategic investment in September 2025 to integrate LandingLens into robotics software, targeting 80% reduction in vision AI deployment time. In January 2026, LandingAI partnered with Snowflake to deliver ADE as a Snowflake Native App for energy sector applications, where CEO Dan Maloney noted that critical operational and regulatory intelligence remains "locked outside the analytics stack."

Developer adoption is the near-term growth lever. A Financial AI Hackathon drew over 1,000 developers globally, producing winning solutions in loan underwriting, fraud detection, compliance automation, and invoice processing — one entry combining DPT-2 with AWS Bedrock in a multi-agent architecture with RAG and deterministic rule engines. A free DeepLearning.AI course taught by LandingAI staff covers ADE's visual parsing approach and an AWS production pipeline — ADE triggers on S3 uploads, loads parsed documents into Amazon Bedrock Knowledge Base, and queries them via Strands Agents — positioning ADE within the enterprise cloud stack without requiring a direct enterprise sales relationship. Adoption figures cited by the company (90% reduction in information search times, billions of pages processed) are vendor-stated and unconfirmed by independent sources.

How LandingAI Processes Documents

LandingAI's Agentic Document Extraction (ADE) replaces the conventional OCR-then-LLM pipeline with a single visual AI pass. The DPT-2 model reads document layout as a visual object — preserving spatial relationships between text, tables, and figures — rather than converting the page to plain text first and losing structure in the process. AIMultiple noted that ADE "can extract complicated and mixed data (text and table on the same page) without any prompting," which distinguishes it from prompt-engineered general models.

The extraction pipeline produces structured Markdown and JSON with visual grounding: each extracted field is traceable to its bounding box in the source document. This cell-level provenance supports RAG pipelines and audit workflows. ADE Split handles multi-document PDFs by separating them using layout-aware visual AI before extraction begins.

Integration requires 3 lines of SDK code across Python, TypeScript, and JavaScript. The platform is available via cloud, on-premises, and virtual private environment deployment, with a Zero Data Retention option where applicable. Pricing is credit-based per page processed; full details are at docs.landing.ai/ade/ade-pricing. Security certifications are referenced at landing.ai/security-at-landingai but specific certification names are not disclosed — a source gap.

The unresolved confidence scoring question sits at the center of the production readiness debate. If the feature is live as the product page states, ADE closes a critical enterprise gap. If it is not, the platform remains a strong development and prototyping tool that requires additional validation layers before production deployment in regulated environments.

Use Cases

Financial Document Processing

Financial institutions use ADE to process 10-K forms, financial statements, and regulatory filings. The platform's smart 10-K auditor implementation demonstrates visual grounding for audit trails and element traceability across complex nested tables. Hackathon winners extended this to loan underwriting, fraud detection, and invoice processing across multiple formats, currencies, and languages — combining the ADE SDK with AWS Bedrock in multi-agent architectures with RAG and deterministic rule engines.

Andrew Ng has framed the opportunity directly: "In financial services — and in many other places — we have so much data... we retain the invoices, the financial documents, the 10Ks to the K1s, but we have so much data that for a long time has just been sitting around in our data warehouses or sometimes even on our laptops unprocessed."

Healthcare Document Intelligence

Dr. Declan Kelly from Eolas Medical reported: "ADE has significantly outperformed other document extractors we've used. It has helped us build an Agentic RAG answer engine, based on unique healthcare institutional content, to offer instant, validated support to medical professionals at the point of care." The platform's visual grounding and source traceability are particularly relevant for clinical documentation where provenance matters for compliance.

Energy Sector Operations

The January 2026 Snowflake Native App partnership targets energy organizations managing large volumes of unstructured operational and regulatory documents. Deploying ADE as a Snowflake Native App brings document extraction directly into the analytics stack, eliminating the data movement that previously kept document intelligence separate from operational data.

Manufacturing Visual Inspection

Through the ABB Robotics partnership, LandingLens is integrated into robotics software for quality control applications. Sami Atiya, President of ABB Robotics, stated: "Installation and deployment time is done in hours instead of weeks, allowing more businesses to automate smarter, faster and more efficiently." This use case extends LandingAI's footprint beyond document processing into physical production environments.

Technical Specifications

Feature	Specification
Core Products	Agentic Document Extraction (ADE), LandingLens, VisionAgent
Core Technology	Document Pre-trained Transformer-2 (DPT-2), Visual AI, Computer Vision
Document Capabilities	Zero-shot parsing, semantic chunking, visual grounding, source traceability, ADE Split multi-document separation
Form Field Recognition	Checkboxes, signatures, barcodes, QR codes, attestations, ID cards, logos
Platform Type	API-first; cloud, on-premises, virtual private environment; Zero Data Retention option
Benchmark Performance	99.16% DocVQA accuracy (vendor-stated, no competitor comparison published); 69/100 AIMultiple composite score, ranked #1 of 5 tools across 60 images
AIMultiple Test Scope	30 flowcharts + 30 tables; metrics: node/edge/decision accuracy (flowcharts), title/header/row/cell accuracy (tables); competitor scores not published
Processing Scale	Billions of pages processed (vendor-stated, unconfirmed by independent sources)
Performance Claims	90% reduction in information search time (vendor-stated); 80% reduction in vision AI deployment time via ABB partnership
Developer Tools	Python, TypeScript, JavaScript SDKs; 3-line integration; 5.2k GitHub stars
Pricing Model	Credit-based per page; monthly and annual subscriptions; free trial, no credit card required
Known Gap	Confidence scoring for data extraction: flagged as absent by Deep Analysis analyst (sourced from LandingAI webinar); listed as present on LandingAI product page — status unresolved
Enterprise Analyst Coverage	Absent from ISG Buyers Guide for IDP Platforms, February 2026

Resources

Website
Agentic Document Extraction
LandingLens
VisionAgent
GitHub
Document AI Course — DeepLearning.AI
AIMultiple Benchmark
Deep Analysis: LandingAI and the Future of IDP
LandingAI: Competitive Analysis
Agentic Document Processing Guide
Agentic Capability Overview