Parsewise — Document Extraction for Investors
AI-powered document extraction platform for investment teams and underwriters, founded by Y Combinator alumni in 2024.
Overview
Parsewise was founded in 2024 by Max Hofer (CEO) and Greg Csegzi (CTO) as part of Y Combinator's Spring 2025 batch. The London-based startup targets investment management, reinsurance, and life sciences sectors with AI-powered document package processing.
Despite Y Combinator backing, Parsewise remains absent from major IDP vendor comparisons and industry roundups as of early 2026, indicating early-stage market positioning compared to established players like ABBYY and UiPath.
The platform differentiates through end-to-end traceability - every extracted data point links to source locations with bounding boxes. Business experts retain granular control over extraction logic rather than relying on black-box AI outputs. Max Hofer brings a PhD in Computer Science and Economics plus Bain experience, while Greg Csegzi comes from Palantir with life sciences and insurance sector expertise.
Key Features and Benefits
- Source Traceability: Bounding box highlighting links every extracted data point to exact document locations
- Expert Control: Business users directly modify extraction logic and validation rules
- Document Package Processing: Handles collections of related documents with cross-referencing
- Multi-Format Support: PDF, DOCX, XLSX, and PPTX processing
- GDPR Compliance: Customer documents never used for model training
Use Cases
Investment Due Diligence
Investment teams process pitch decks, financial statements, and legal documents during due diligence. The platform extracts financial metrics and flags inconsistencies across document packages, allowing analysts to focus on evaluation rather than manual data collection.
Insurance Underwriting
Reinsurance companies automate policy document processing and risk assessment workflows. Transparent traceability allows underwriters to verify extracted underwriting parameters and adjust logic for specific policy types.
Life Sciences Regulatory Review
Clinical trial documentation and regulatory submission processing with audit trail maintenance. Compliance teams verify extracted data through source highlights and adjust validation rules for regulatory requirements.
Technical Specifications
| Feature | Specification |
|---|---|
| Core Technology | AI extraction with human-in-the-loop validation |
| Supported Formats | PDF, DOCX, XLSX, PPTX |
| Key Differentiator | End-to-end traceability with bounding boxes |
| Data Security | Encryption in transit and at rest |
| Compliance | GDPR compliant |
| Training Policy | Customer documents never used for model training |
| Target Industries | Investment management, reinsurance, life sciences |
Resources
Company Information
Headquarters: London, United Kingdom
Founded: 2024
Founders: Max Hofer (CEO), Greg Csegzi (CTO)
Accelerator: Y Combinator Spring 2025