Aluma: IDP Software Vendor
On This Page
Cambridge-based intelligent document processing (IDP) vendor with a 1-10 employee team competing against enterprise-scale platforms through vertical specialization and privacy-focused architecture. Unlike cloud-first competitors such as Rossum and ABBYY, Aluma positions itself on data sovereignty: the platform does not store or retain customer documents or extracted data, targeting regulated industries where pharmaceutical logistics, legal claim processing, and HR document management require strict compliance controls.

Recent developments
Aluma launched the latest version of its platform on March 17th with enhanced AI and machine learning (ML) capabilities for document classification and data extraction. The company won "AI Product of the Year" at the DM Awards and retired its legacy API after processing 2.3 million lines, a milestone that signals platform maturity despite the team's small size. A strategic partnership with Filehound for intelligent document automation extends Aluma's channel-driven go-to-market approach.
The most significant recent development is a published pharma logistics case study documenting a production deployment for a client processing approximately 1 million delivery notes and invoices annually. The client evaluated Tungsten Automation, OpenText, and ABBYY before selecting Aluma, citing cost as the deciding factor. The deployment completed in one week and achieved 83% straight-through extraction, exceeding the client's 80% target, with 96% overall accuracy. These figures are self-reported via a vendor case study and carry no independent verification.
Privacy-first architecture
Aluma's privacy-by-design architecture differentiates it in a market dominated by cloud-first vendors. The platform handles document separation, classification, data extraction, redaction, and validation through a combination of OCR and machine learning while maintaining data security by not storing or retaining customer documents. This approach appeals to regulated industries where data sovereignty matters more than feature breadth.
The ML model updates continuously from human corrections, improving accuracy across diverse document layouts over time. This reduces dependency on manual template maintenance, a recurring pain point with rule-based systems at scale, and is particularly relevant for clients handling documents from many different suppliers in varying formats.
Vertical market focus
Pharmaceutical supply chain automation
The pharma logistics case study is Aluma's most detailed public evidence of production performance. A client processing roughly 1 million delivery notes and invoices per year evaluated Tungsten Automation, OpenText, and ABBYY and rejected all three on cost grounds before selecting Aluma. The system extracts header details, shipment dates, and purchase order references from delivery notes, and sums, invoice numbers, and PO references from invoices. Deployment completed in one week. The 83% straight-through processing (STP) rate aligns with industry benchmarks for high-volume structured logistics documents, though it falls short of the 90%+ rates claimed by some enterprise-tier vendors on constrained document types.
Aluma's focus on NLP capabilities beyond traditional OCR for handling format variability positions the platform against specialized competitors in pharmaceutical verticals, where certificates of analysis arrive from global suppliers in dozens of layouts.
Legal and compliance processing
Legal and compliance teams use Aluma to digitize physical records and process electronic files for case management. The platform performs OCR on scanned documents, classifies files by document type and matter, redacts privileged or sensitive information, and organizes content for searchability. Analytics track processing volumes and quality metrics across document collections.
High-volume invoice processing
Accounts payable departments handle invoices from thousands of vendors in varying formats. Aluma separates multi-invoice PDF files, classifies documents by vendor and invoice type, and extracts key fields including invoice number, date, amount, and line items. The platform integrates extracted data with ERP systems while routing exceptions for human review.
Technical capabilities
| Feature | Specification |
|---|---|
| Deployment | Cloud-native, subscription-based |
| Uptime | 99%+ availability claimed |
| Scale | Processes millions of documents |
| Integration | API, SDK, automation tool connectors |
| Data security | No document storage or retention by Aluma |
| Configuration | Minimal setup for complex document types |
| Learning | ML model updates from human corrections |
The platform combines OCR and ICR text extraction, automated document separation, content-based categorization, targeted data field retrieval, sensitive information redaction, barcode recognition for document routing, a validation web UI for human review, and an analytics dashboard for performance monitoring.
Market position
Aluma wins deals at the price-sensitive segment of the market rather than competing on feature parity with enterprise platforms. The pharma logistics case study makes this explicit: a client handling 1 million documents annually, a volume well within the addressable range of Tungsten Automation, OpenText, and ABBYY, rejected all three on cost and selected Aluma. The one-week deployment claim reinforces a rapid time-to-value positioning that enterprise IDP deployments, which typically run months, cannot match.
With 40% of staff having 5-7 years tenure, the small team shows unusual stability for a tech startup. That said, the 1-10 employee size limits the platform's ability to compete on enterprise feature depth against larger IDP platforms such as Hyperscience and Instabase. The emphasis on partner development and pre-sales roles indicates reliance on channel partnerships to scale beyond direct sales capacity.
Getting started
Aluma offers a trial program allowing organizations to test the platform with their own documents. Contact the team through the website to discuss implementation requirements and access trial credentials.