On This Page

Docufai is Ripcord's generative AI document discovery platform, enabling users to query scanned and digital documents in natural language and receive instant answers with source citations.

docufai

$32MSeries C funding (April 2024)
72%Revenue growth in 2023
$150MTotal funding raised
$110MPre-money valuation (Oct 2023)

Overview

Launched in beta in November 2023, Docufai emerged from Ripcord's Document Intelligence as a Service (DIaaS) platform, which the Hayward, California company built for government and enterprise markets. The product sits on top of Ripcord's cloud-based "Canopy" content platform, which combines proprietary robotics with AI-driven extraction across customers including MUFG Bank, Coca-Cola Bottlers, and the IRS.

Docufai's core distinction from other document AI tools is its focus on information retrieval rather than content generation. Users upload documents in PDF, DOCX, PPTX, JPG, PNG, BMP, or TIFF formats, receive automatic summaries, and ask questions in any language regardless of the source document's language. The system integrates OpenAI's models and adds knowledge bases with logical reasoning rules to reduce hallucinated answers, a known failure mode in retrieval-augmented generation systems.

In February 2024, Ripcord extended the product line with Docufai Express, which pairs robotic document scanning hardware with the generative AI interface. This combination targets organizations with large volumes of physical paper that need both digitization and immediate query access, a workflow gap that cloud-only competitors cannot address.

Ripcord closed a $32 million Series C in April 2024 led by Kleiner Perkins and Google Ventures, with FUJIFILM Business Innovation also acquiring equity. Total funding across all rounds reached approximately $150 million. The company reported 72% revenue growth in 2023, following 95%-plus growth in 2022, with revenue reaching $11.8 million in 2022 up from $5.9 million in 2021, according to TechCrunch's October 2023 reporting.

How Docufai processes documents

Docufai's pipeline begins when a user uploads a document and the platform generates an automatic summary, extracting key information without requiring the user to read the full file. From there, users ask questions in natural language and receive answers with specific source references pointing back to the originating document and page.

The accuracy layer relies on OpenAI's language models augmented with integrated knowledge bases and logical reasoning rules. This architecture differs from a raw retrieval-augmented generation setup: the knowledge bases constrain what the model can assert, reducing the risk of fabricated citations or invented figures. Ripcord's CEO Sam Fahmy described the approach in October 2023 as "applying tech, including machine learning and generative AI, to understand and validate the data in the scanned docs."

Multilingual support runs across both input and output. A user can upload a Japanese regulatory filing and ask questions in English, receiving answers in English with citations to the original Japanese text. This cross-language capability is particularly relevant for Ripcord's Asia-Pacific customer base, which includes one of Japan's largest property management companies and the Fujifilm Ripcord joint venture, selected by the Japan International Cooperation Agency in 2023 for digitalization services.

The product roadmap published in October 2023 included document translation, related document discovery, and collaborative notebook features, though current availability of these capabilities has not been independently confirmed.

Use cases

Government document processing

Federal agencies use Docufai for regulatory compliance and records modernization. Ripcord secured an IRS contract worth over $4 million for tax document processing and expanded its US Air Force contract, establishing a track record in high-stakes government workflows. Analysts query policy documents, regulations, and case files conversationally, receiving answers with citations. The system handles multilingual government documents while maintaining compliance controls through Ripcord's enterprise platform, accessible to public sector agencies through NASA SEWP V, ITES-SW2, NASPO ValuePoint, NCPA, and OMNIA Partners contracts via the Carahsoft Technology Corp partnership established in October 2023.

Financial services due diligence

MUFG Bank is both an investor and customer, with an annual contract value of $5 million as of October 2023, with a Wells Fargo deal reported to be in final stages at that time. M&A and compliance teams use Docufai to query financial statements, contracts, and regulatory filings in natural language, extracting relevant figures and terms without manual document review. The source-citation requirement is particularly important in financial services, where analysts need to trace every extracted figure back to its originating document for audit purposes.

Knowledge worker document discovery

Docufai targets knowledge workers seeking ad-hoc document insights rather than IT teams automating high-volume structured extraction. This positions it differently from traditional intelligent document processing platforms focused on form-based data capture. A legal team reviewing discovery documents, a researcher querying a corpus of reports, or a compliance officer checking policy adherence across multiple filings represents the primary user profile. The free beta, available globally as of April 2024, targets this audience before converting users to paid enterprise tiers.

Technical specifications

Feature Specification
Supported file formats PDF, DOCX, PPTX, JPG, PNG, BMP, TIFF
Core AI OpenAI integration with knowledge bases and logical reasoning
Platform foundation Ripcord Canopy cloud-based content platform
Beta launch November 2023
Docufai Express launch February 2024
Query interface Natural language questions
Language support Multilingual input and output
Accuracy approach Knowledge bases and logical rules constrain LLM responses
Availability Free beta (global); enterprise tier (US customers)
Pricing model $0.08-$0.25 per document image scanned (Ripcord platform); subscription for Docufai with Docufai Express
Government contracts NASA SEWP V, ITES-SW2, NASPO ValuePoint, NCPA, OMNIA Partners
Named enterprise customers MUFG Bank, Coca-Cola Bottlers, IRS, US Air Force

Company and funding

Ripcord's differentiation in the document AI market rests on combining proprietary robotics hardware with software-based AI, a combination that cloud-native competitors cannot replicate. The robotics layer automates physically labor-intensive tasks including staple removal, page unfolding, folder tracking, and digital twin creation before documents reach the AI layer. This hardware-software integration is the basis for Docufai Express and explains why FUJIFILM Business Innovation joined the Series C as an equity investor alongside the Fujifilm Ripcord joint venture.

Todd Bailey, VP of Partner and Strategic Relationships at Ripcord, stated in April 2024: "With our proprietary robotics automating labor-intensive tasks and our AI-driven platform extracting critical insights, we're transforming how businesses handle their documents. And now, with the introduction of Docufai, our GenAI document discovery platform, we're empowering users to unlock the full potential of their data like never before."

The OpenAI integration signals reliance on third-party language model infrastructure rather than proprietary models, which reduces differentiation on the AI layer but accelerated time-to-market. As the broader IDP market consolidates around cloud-native, template-free extraction, Ripcord's bet is that the physical-to-digital pipeline remains a defensible moat for regulated industries with large paper backlogs.

Supersonic advancements in robotics and AI are fundamentally changing how we interact with and gain knowledge from documents. Ripcord is committed to leading the way in this new era and is incredibly grateful for our investors, customers, and partners for helping fuel our mission.

Sam Fahmy, CEO, Ripcord, April 2024

Resources

:::recent 3 :::