Dataiku
Universal AI platform with document intelligence capabilities, preparing for 2026 IPO with $350M ARR and governance-by-design approach.

Overview
Founded in 2013 in Paris, Dataiku provides an enterprise AI platform that includes intelligent document processing through its Universal AI Platform. The company is preparing for a U.S. IPO in H1 2026 with Morgan Stanley and Citigroup as lead underwriters at a $3.7 billion valuation. Dataiku surpassed $350M ARR in October 2025, up from $300M+ ARR in January 2025.
The company was recognized as a Leader in IDC's MarketScape for Worldwide Unified AI Governance Platforms 2026, with CEO Florian Douetteau emphasizing that "AI governance has shifted from a checkpoint to a foundation." Former Salesforce President Alexandre Dayon joined the Board of Directors in October 2025, strengthening enterprise expertise ahead of the IPO.
Key Features
- Natif.ai IDP Plugin: Processes documents (PDF, TIFF, JPEG) using computer vision, deep learning, and NLP
- Agent Hub: Collaborative workspace for building, sharing, and scaling AI agents with ROI measurement and governance
- AI Factory Accelerator: NVIDIA-powered solution for enterprise AI acceleration with native governance integration
- Governance by Design: Embeds AI governance directly into development workflows rather than as afterthought controls
- VLM and LLM Integration: Extracts and embeds information from text, tables, and images using vision-language models
- Retail Accelerator Pack: Seven ready-to-use retail use cases including entity extraction and LLM-enhanced predictions
Use Cases
Enterprise AI Governance
Organizations leverage Dataiku's unified governance platform to address the gap where 95% of data leaders can't fully trace AI decisions end-to-end while 86% report AI embedded in daily operations. The platform provides governance controls embedded directly in AI development workflows.
Retail AI Transformation
Retailers use Dataiku's Retail Accelerator Pack for customer experience optimization. Head of AI Architecture Jed Dougherty notes "The riskiest place to use GenAI in retail is also the most valuable one: the customer experience."
Document Intelligence Workflows
Teams process document collections through the modular pipeline, converting native and scanned content to structured data with embedded governance controls for regulatory compliance and audit trails.
Technical Specifications
| Feature | Specification |
|---|---|
| Core Platform | Dataiku Data Science Studio (DSS), Universal AI Platform |
| Document Processing | Natif.ai IDP plugin, modular pipeline |
| AI Governance | Native governance controls, end-to-end traceability |
| Agent Platform | Agent Hub with collaborative workspace |
| Cloud Partnerships | AWS Agentic AI and Healthcare Software Competency, NVIDIA AI Factory |
| File Formats | PDF, TIFF, JPEG, diverse file types |
| Deployment | Cloud, on-premises |
| Accelerators | Retail, healthcare, manufacturing use cases |
Resources
Company Information
Headquarters: Paris, France (US HQ: New York City)
Founded: 2013
Offices: New York, Denver, Washington DC, Los Angeles, Paris, London, Munich, Frankfurt, Sydney, Singapore, Tokyo, Dubai
Funding: $4.6B valuation (Series E, 2021), $200M Series F (2022)