Scale AI
Scale AI is a data annotation and AI training platform provider that underwent a major transformation in 2025 when Meta acquired a 49% stake for $14.8 billion, transitioning from independent vendor to strategic AI infrastructure partner.

Overview
Scale AI operates a data annotation platform supporting images, video, text, audio, LiDAR, and point cloud data types. Founded in 2016, the company initially focused on data labeling services for autonomous vehicles before expanding into document processing with Scale Document AI. In May 2024, the company raised $1 billion from Nvidia, Amazon, and Meta, valuing it at nearly $14 billion.
The strategic landscape shifted dramatically in June 2025 when Meta acquired a 49% stake for $14.8 billion. As part of the deal, founder and CEO Alexandr Wang transitioned to Meta to co-lead the newly formed Meta Superintelligence Labs alongside former GitHub CEO Nat Friedman. The acquisition immediately triggered customer departures including OpenAI cutting ties and Google canceling a planned $200 million spend, while Microsoft and xAI began exploring alternatives.
Scale AI has diversified beyond traditional data labeling into military AI training applications, appearing alongside defense contractors in a market projected to reach $2.17 billion by 2030.
Key Features
- Scale Document AI: Template-free document extraction using adaptive machine learning models
- In-House OCR Engine: Proprietary text recognition based on computer vision and natural language processing
- Data Engine: RLHF, data generation, and model evaluation for training large language models
- Multi-Format Support: Processes images, video, text, audio, LiDAR, point clouds
- Human-in-the-Loop: Global network of domain expert annotators for validation
- Military Training AI: Simulation-based training programs and autonomous drone integration
Use Cases
Autonomous Vehicle Training
Scale AI's original focus area, providing labeled data for self-driving car development including LiDAR point cloud annotation and computer vision training datasets.
Financial Services Document Processing
Banks use Scale Document AI to process loan applications and compliance documents with template-free extraction and human validation for regulatory requirements.
Defense and Military Applications
Scale AI provides AI training solutions for military simulation programs, autonomous drone systems, and cyber threat training scenarios.
Technical Specifications
| Feature | Specification |
|---|---|
| Core Products | Scale Document AI, Data Engine, Scale Rapid, Scale Studio, Scale GenAI |
| Recognition Technology | In-house OCR, computer vision, NLP, adaptive ML models |
| Data Types | Images, video, text, audio, LiDAR, point clouds, documents |
| Extraction Approach | Template-free, adaptive AI |
| Integration | API, SDK, CLI tools |
| Cloud Storage | AWS S3, Google Cloud Storage, Azure Blob Storage |
| Target Industries | Autonomous vehicles, defense, financial services, healthcare |
| Deployment | Cloud-based platform |
| Annual Revenue | $1.5B ARR (as of 2025) |
Resources
Company Information
Headquarters: San Francisco, California, United States
Founded: 2016
Employees: 1,000+ (as of 2024)
Revenue: $870M (2024), $1.5B ARR (2025)
Valuation: $29B (2025)
Key Investment: Meta Platforms purchased 49% stake for $14.8B in June 2025