On This Page

Berlin-based AI company providing the open-source Haystack framework for building production-ready AI applications, with recent expansion into AI agent integration, German-language processing, and validated on-premises enterprise deployments.

deepset

315+IDP vendors tracked
80+data sources and model providers
18German government AI agent pilots
3product tiers: OSS, Starter, Platform

Overview

deepset develops Haystack, an open-source AI orchestration framework that enables developers to build customizable LLM applications. Founded in 2018 by Milos Rusic, Malte Pietsch, and Timo Möller, the company was named a Gartner Cool Vendor in AI Engineering in 2024 and holds AWS Generative AI Competency status. With over 4,000 Discord community members and 300+ GitHub contributors, Haystack powers production deployments at Airbus, The Economist, NVIDIA, and Comcast.

Haystack has emerged as one of the two dominant players in the retrieval-augmented generation (RAG) framework market alongside LangChain, with deepset positioning it as the production-ready alternative focused on enterprise deployment and maintainability rather than rapid prototyping.

The company's strategic narrative has shifted from "open-source NLP framework" to "enterprise AI orchestration layer." As Milos Rusic stated in February 2026: "The winning layer is AI orchestration, which connects models to company data and workflows, with evaluation, observability, and governance. Through these methods, enterprises can deploy AI with trust and control."

Three developments in early 2026 mark a clear inflection point. deepset rebranded its commercial offering to Haystack Enterprise Platform, unifying open-source and commercial tiers under a single brand. Germany's Federal Ministry for Digital Affairs selected deepset to run 18 AI agent pilots across 17 municipalities, covering document analysis and administrative process automation. And Haystack was chosen as a reference architecture for Deutschland-Stack (D-Stack), Germany's national sovereign technology platform, with secunet Security Networks AG announcing a reference architecture for processing classified information powered by Haystack at SCCON25.

How deepset processes documents

Haystack uses a modular directed acyclic graph (DAG) pipeline architecture where each processing step, including document ingestion, chunking, embedding, retrieval, reranking, and generation, is a serializable, swappable component. Pipelines are defined in Python and exported to YAML for reproducible deployment across Docker, Kubernetes, Ray, and serverless environments.

For RAG workflows, Haystack connects to 80+ data sources and model providers including Hugging Face, Elasticsearch, OpenSearch, OpenAI, Cohere, and Anthropic. Built-in evaluation tooling lets teams measure retrieval quality and generation accuracy before promoting pipelines to production. This distinguishes Haystack from lighter-weight orchestration libraries that leave evaluation entirely to the developer.

Recent framework releases extend this architecture in two directions. Haystack v2.26.0 introduced Jinja2 templating for dynamic agent system prompts, an LLMRanker component for semantic reranking, and improved async performance through concurrent embedding inference with configurable concurrency limits. Haystack v2.27.0 added automatic list joining in pipelines, expanded DocumentStore metadata inspection utilities, and introduced partial support for multimodal models through an image-text-to-text task in HuggingFaceLocalChatGenerator. Version 2.27.0 also fixed a security issue in ChatPromptBuilder where template variables could be interpreted as structured content; they are now automatically sanitized during rendering.

The MCP server integration (July 2025) adds a conversational layer on top of this pipeline architecture. Developers can create, debug, and manage pipelines through natural language commands in Cursor or Claude Desktop using 30 specialized tools covering pipeline creation, index management, and component configuration.

For German-language document workflows, the deepset-mxbai-embed-de-large-v1 model produces 1,024-dimension embeddings from text and images with a 512-token maximum context, priced at $0.01 per million tokens through the mixedbread partnership.

Product tiers

deepset's three-tier structure mirrors the open-source monetization model used by Databricks and HashiCorp: a free tier drives developer adoption, commercial tiers capture enterprises at scale.

Haystack Open Source

Free

100% open-source, self-supported. Apache 2.0 license. Full framework access with community support via Discord and GitHub.

  • Full pipeline framework
  • 80+ integrations
  • Apache 2.0 license
  • Community support

Haystack Enterprise Starter {primary}

Contact for pricing

Pre-built templates plus 4 hours per month of remote consultation with core maintainers, priority updates, and early feature access.

  • Lightweight RAG templates
  • Advanced RAG with hallucination filtering
  • Agentic RAG with external tool use
  • Multimodal pipelines with OCR and image analysis
  • Multi-Agent pipelines with MCP
  • Priority updates

Haystack Enterprise Platform

Contact for pricing

Full-stack cloud or on-premises deployment with visual editor and autoscaling. Targets regulated industries requiring data residency control.

  • Visual pipeline editor
  • Autoscaling infrastructure
  • Cloud or on-premises
  • SOC 2 Type II, ISO 27001, GDPR, HIPAA
  • NVIDIA Blackwell validated design

Pricing is based on organization size rather than per-token or usage metering, which aligns with enterprise procurement preferences and reduces cost unpredictability for high-volume document workflows.

Use cases

Financial services and regulated industries

Financial services firms deploy Haystack's pipeline-centric architecture for production-grade RAG implementations where auditability and maintainability matter. The framework's serializable pipeline format and built-in evaluation tools support the transition from prototype to enterprise-scale document processing. The NVIDIA Blackwell validation opens an additional path for banks and insurers operating under data residency requirements that prohibit cloud processing. No benchmark data or customer references have been disclosed for this configuration; buyers should seek independent corroboration before committing to this deployment path.

German public administration

deepset's selection by Germany's Federal Ministry for Digital Affairs to support 18 AI agent pilots across 17 municipalities represents the most concrete government validation in the company's history. The pilots focus on document analysis, application review, and administrative process acceleration. deepset is also partnering with Kreis Borken as part of the "Agentic AI Hub." Success here could establish Haystack as the reference orchestration layer for European public sector AI, creating a precedent that influences procurement across EU member states.

Sovereign and air-gapped deployments

Enterprises in defense, government, and regulated industries that cannot route documents through cloud APIs can deploy Haystack on NVIDIA Blackwell GPU infrastructure through the validated NVIDIA Enterprise AI Factory design. The Meta Llama Stack integration combines Haystack's orchestration with Llama's deployment infrastructure for sovereign, domain-specific applications. secunet Security Networks AG's reference architecture for processing classified information at SCCON25 demonstrates adoption in high-security environments. As Rusic framed it: "Customizability, deployment flexibility, and Open Source are the cornerstones of Sovereign AI, not only in Germany but across the globe."

Enterprise AI agent development

Developers use deepset's MCP server to build and manage document processing pipelines through conversational AI interfaces in Cursor and Claude Desktop. The 30-tool integration covers pipeline creation, debugging, and index management through natural language, reducing the technical barrier for teams that need to iterate quickly on document workflows without deep framework expertise.

German-language document processing

European organizations use deepset's specialized German embedding model for localized document analysis and semantic search. The 1,024-dimension model processes both text and images, addressing regional language requirements in German-speaking markets at $0.01 per million tokens. This price point makes it viable for high-volume document workflows.

Technical specifications

Feature Specification
Core product Haystack (open-source framework)
Enterprise product Haystack Enterprise Platform, Haystack Enterprise Starter
Architecture Modular DAG pipeline with serializable components
AI agent integration MCP server with 30 specialized tools (July 2025)
Language models German embedding model (1,024 dimensions, 512 max tokens, $0.01/M tokens)
Database integrations SingleStore, Elasticsearch, OpenSearch, 80+ providers
Deployment Docker, Kubernetes, Ray, serverless, REST APIs, on-premises (NVIDIA Blackwell)
Environment support Cloud, VPC, on-premises, air-gapped
On-premises validation NVIDIA Enterprise AI Factory validated design (Blackwell GPUs); vendor-reported, no independent benchmark data available
Compliance SOC 2 Type II, ISO 27001, GDPR, HIPAA
Programming language Python 3.9+
License Apache 2.0 (open-source components)
Framework versions v2.27.0 (multimodal, security fix), v2.26.0 (Jinja2 agents, LLMRanker, async)

Competitive position

deepset competes on three fronts simultaneously. Against general-purpose LLM frameworks like LangChain and LlamaIndex, deepset emphasizes production-readiness, governance, and enterprise support over rapid prototyping. Against monolithic intelligent document processing (IDP) platforms like ABBYY and UiPath, deepset offers modular, open-source alternatives that give engineering teams full control over the processing stack. Against in-house engineering efforts, Haystack provides pre-built components, evaluation tooling, and maintainer support that reduce time-to-production.

The sovereign AI positioning creates a distinct competitive moat in European regulated markets. No major US-headquartered IDP platform has achieved equivalent government validation in Germany. The Deutschland-Stack selection and Federal Ministry pilot program give deepset reference customers that competitors cannot easily replicate.

For teams evaluating open-source document processing frameworks, competing approaches include Docling from IBM Research (MIT license, strong layout preservation) and Unstructured (ETL-focused, 25+ file types). A direct comparison is available in the Unstructured competitive analysis. For broader context on self-hosted options, see the self-hosted document processing guide and the document processing for RAG guide.

Source note: The NVIDIA Blackwell validation and government pilot figures are vendor-reported. No independent benchmarks, third-party audits, or customer references have been published for these claims as of April 2026.

Resources

Company information

Headquarters: Berlin, Germany

Founded: 2018

Founders: Milos Rusic (CEO), Malte Pietsch, Timo Möller

Employees: 51-200