On This Page

Open-source content services platform acquired by Hyland in 2021, now operating as one of three federated repositories alongside OnBase and Alfresco in Hyland's Content Innovation Cloud.

Nuxeo

$200K/yearStarting price
4.2/5User rating (GetApp/Capterra)
220%QoQ agentic adoption growth (Q4 2025)
2000Year founded

Overview

Founded in 2000 and acquired by Hyland in 2021, Nuxeo is a high-end content services platform built for petabyte-scale document management. Enterprise pricing starts at $200,000 annually, positioning it against SharePoint in the upper ECM segment, with a 4.2/5 rating across review platforms.

Under CEO Jitesh Ghai, Hyland's strategy has shifted from standalone repository competition toward federation. Rather than displacing incumbent content systems, Nuxeo is now positioned as one node in a Content Federation Service that lets AI analysis run across Nuxeo, OnBase, Alfresco, and external systems including SharePoint 365 without requiring data migration. Chief Innovation Strategist John Newton describes the approach as "modernize without migration."

That federation bet is gaining commercial traction at the portfolio level. Hyland reported 220% quarter-on-quarter growth in customer adoption of agentic content services across its Content Innovation Cloud in Q4 2025. Whether that demand concentrates in Nuxeo specifically, or in OnBase or Alfresco, was not disclosed, a gap worth watching in subsequent quarters.

In December 2025, Hyland was named a Leader in the IDC MarketScape for Worldwide Intelligent Document Processing Software 2025-2026. IDC Senior Research Manager Amy Machado stated: "With a cloud-native architecture, modern user experience, and AI-driven capabilities, Hyland is well-positioned to support a broad range of IDP use cases and agentic automation workflows across the entire document lifecycle." The designation covers Hyland's full portfolio; Nuxeo-specific IDP benchmarks were not broken out separately.

In early 2026, Nuxeo shipped LTS 2025.0 with infrastructure modernization across AWS SDK 2.0 (client-side encryption), MongoDB Java driver 5.x, and PDFBox 3.x, alongside a 10% startup performance improvement. The release also removes legacy components, including NTLM authentication and OAuth 1.0, signaling a clean break from pre-cloud-era integrations. Hyland's February 2026 update added simplified administration, automated redaction, and compliance enhancements to Nuxeo, along with a real-time compliance reporting module covering processing accuracy, throughput, and regulatory adherence. Feature-level changelogs for these updates have not been published in available sources.

As Chris G., Senior Solutions Architect in Healthcare, summarizes the platform's core strength: "The performance, at petabyte scale with Nuxeo is impressive. We can ultimately do anything via API which we could via CLI or via Web UI."

How Nuxeo processes documents

Nuxeo's microservices architecture provides cloud-native, containerized document processing built around an API-first design. The LTS 2025.0 release anchors the stack on AWS SDK 2.0 with S3 KMS client-side encryption, MongoDB Java driver 5.x for the document store, and Elasticsearch for configurable faceted search across billions of documents.

AI agent capabilities enter through Hyland's Agent Builder, which integrates with Nuxeo workflows to enable natural language automation within existing environments without rebuilding the underlying repository. The Content Federation Service extends this further: AI analysis can now operate across content held in Nuxeo and external systems, including SharePoint 365, without migrating the underlying data. Michael Campbell, Chief Product Officer at Hyland, described the direction in February 2026: "We're defining this new wave of AI-native content innovation enabling organizations to automate what matters, work anywhere, and scale securely."

OCR is handled through partner addons rather than native development. The primary route is a Tesseract-based addon through Oceane Consulting, available via the Nuxeo Marketplace. For teams evaluating open-source alternatives to partner-delivered OCR, Unstract offers a no-code LLM platform with production-grade extraction that can complement or replace Tesseract-based pipelines in similar enterprise environments. Standards-based BPMN 2.0 workflows handle process automation, and SCIM 2.0 support covers identity management for enterprise directory integration.

The February 2026 compliance updates add automated redaction and a real-time reporting module that surfaces processing accuracy, throughput metrics, and regulatory adherence. For enterprises operating under frameworks such as the Australian Privacy Act or HIPAA, this module addresses a governance gap that previously required custom reporting outside the platform.

Use cases

Enterprise content at petabyte scale

Large organizations use Nuxeo's horizontally scalable architecture to manage billions of documents with Elasticsearch-powered search and fine-grained permissions. The platform's primary competitive displacement is against Microsoft SharePoint in high-volume scenarios. Chris G. from Healthcare migrated specifically because "Microsoft SharePoint has turned out to be extremely non-performant with extremely large volumes of binary document data."

Federated AI across fragmented content estates

Through Hyland's Content Federation Service, Nuxeo serves as one node in a multi-repository AI layer that also spans OnBase, Alfresco, and SharePoint 365. Enterprises that cannot consolidate onto a single content system, the common enterprise reality, can run AI document analysis and generative AI workflows across whichever repositories they already operate, without migrating data. Ian McCain, Vice President at Datum Evolve, a Hyland partner, described the business case: "By combining agentic automation, intelligent document processing, and content federation, we're helping organizations break out of content silos, accelerate critical workflows, strengthen governance, and turn unstructured information into a real operational advantage."

Organizations evaluating structured extraction across federated repositories may also consider LangExtract, Google's open-source Python library for extracting structured information from unstructured text using LLMs with precise source grounding, as a lightweight complement to platform-level federation.

Compliance and regulated industries

The February 2026 updates position Nuxeo directly for regulated sectors. Automated redaction, enhanced compliance features, and real-time reporting address requirements in financial services, healthcare, and government where audit trails and data residency controls are non-negotiable. The Content Federation Service's ability to apply AI processing without physical data migration is particularly relevant for organizations subject to data sovereignty regulations that prohibit moving content across jurisdictions.

Digital asset management

Organizations implement Nuxeo for centralized creative content workflows with AI-assisted metadata tagging, version control, and automated derivative generation through partner marketplace addons. Vendors such as VIDIZMO address adjacent requirements in this space, combining enterprise video management with document AI and redaction capabilities for government and regulated industries that need unified media and document governance.

Technical specifications

Feature Specification
Architecture Cloud-native microservices, AWS SDK 2.0
Database MongoDB Java driver 5.x
Deployment Cloud, on-premises, hybrid via Hyland platform
Scalability Petabyte scale, billions of documents
Search Elasticsearch with configurable facets
Integration REST API, SCIM 2.0, Java, JavaScript, .NET clients; Content Federation Service (SharePoint 365, OnBase, Alfresco)
OCR Partner addons via Tesseract-based services (Oceane Consulting)
Storage S3 with KMS client-side encryption v3
Security Fine-grained permissions, S3 KMS encryption, SCIM 2.0 identity management, auditing, automated redaction
Workflow BPMN 2.0; Hyland Agent Builder natural language automation
Compliance reporting Real-time module covering processing accuracy, throughput, regulatory adherence (added February 2026)
Pricing $200,000/year starting price
Rating 4.2/5 (GetApp/Capterra)

Resources

Company information

Field Detail
Founded 2000
Headquarters Nantes, France
Parent Hyland
Ownership Acquired by Hyland, 2021
Open source Yes (core platform on GitHub)
Deployment Cloud, on-premises, hybrid
Pricing $200,000/year starting price