Skip to content

Scale AI

Scale AI is a data annotation and AI training platform provider that underwent a major transformation in 2025 when Meta acquired a 49% stake for $14.8 billion, transitioning from independent vendor to strategic AI infrastructure partner.

Scale AI

Overview

Scale AI operates a data annotation platform supporting images, video, text, audio, LiDAR, and point cloud data types. Founded in 2016, the company initially focused on data labeling services for autonomous vehicles before expanding into document processing with Scale Document AI. In May 2024, the company raised $1 billion from Nvidia, Amazon, and Meta, valuing it at nearly $14 billion.

The strategic landscape shifted dramatically in June 2025 when Meta acquired a 49% stake for $14.8 billion. As part of the deal, founder and CEO Alexandr Wang transitioned to Meta to co-lead the newly formed Meta Superintelligence Labs alongside former GitHub CEO Nat Friedman. The acquisition immediately triggered customer departures including OpenAI cutting ties and Google canceling a planned $200 million spend, while Microsoft and xAI began exploring alternatives.

Scale AI has diversified beyond traditional data labeling into military AI training applications, appearing alongside defense contractors in a market projected to reach $2.17 billion by 2030.

Key Features

  • Scale Document AI: Template-free document extraction using adaptive machine learning models
  • In-House OCR Engine: Proprietary text recognition based on computer vision and natural language processing
  • Data Engine: RLHF, data generation, and model evaluation for training large language models
  • Multi-Format Support: Processes images, video, text, audio, LiDAR, point clouds
  • Human-in-the-Loop: Global network of domain expert annotators for validation
  • Military Training AI: Simulation-based training programs and autonomous drone integration

Use Cases

Autonomous Vehicle Training

Scale AI's original focus area, providing labeled data for self-driving car development including LiDAR point cloud annotation and computer vision training datasets.

Financial Services Document Processing

Banks use Scale Document AI to process loan applications and compliance documents with template-free extraction and human validation for regulatory requirements.

Defense and Military Applications

Scale AI provides AI training solutions for military simulation programs, autonomous drone systems, and cyber threat training scenarios.

Technical Specifications

Feature Specification
Core Products Scale Document AI, Data Engine, Scale Rapid, Scale Studio, Scale GenAI
Recognition Technology In-house OCR, computer vision, NLP, adaptive ML models
Data Types Images, video, text, audio, LiDAR, point clouds, documents
Extraction Approach Template-free, adaptive AI
Integration API, SDK, CLI tools
Cloud Storage AWS S3, Google Cloud Storage, Azure Blob Storage
Target Industries Autonomous vehicles, defense, financial services, healthcare
Deployment Cloud-based platform
Annual Revenue $1.5B ARR (as of 2025)

Resources

Company Information

Headquarters: San Francisco, California, United States

Founded: 2016

Employees: 1,000+ (as of 2024)

Revenue: $870M (2024), $1.5B ARR (2025)

Valuation: $29B (2025)

Key Investment: Meta Platforms purchased 49% stake for $14.8B in June 2025