Scale AI

Organization

A data labeling company crucial for training large language models. Meta acquired a 49% stake for over $14 billion in what is described as a 'shadow aqua-hire', effectively taking Scale AI's resources for itself after its major customers, OpenAI and Google, cancelled their contracts.


entitydetail.created_at

7/19/2025, 8:28:51 AM

entitydetail.last_updated

7/22/2025, 5:14:29 AM

entitydetail.research_retrieved

7/19/2025, 8:31:56 AM

Summary

Scale AI, Inc. is a San Francisco-based data annotation and AI company founded in 2016, with approximately 900 employees. It specializes in providing high-quality training data, data labeling, and model evaluation services for artificial intelligence applications, including computer vision, autonomous vehicles, and large language models (LLMs). Its research division, the Safety, Evaluation and Alignment Lab, focuses on assessing and aligning LLMs through initiatives like Humanity's Last Exam. Scale AI operates subsidiaries Remotasks and Outlier for data labeling. The company serves a wide range of commercial clients such as Etsy, General Motors, OpenAI, PayPal, Pinterest, Samsung, Toyota, Uber, Microsoft, Meta, Fox, Accenture, Cohere, Brex, and OpenSea. It also collaborates with governments, including the United States on military projects and Qatar for social programs. In a significant development, Meta Platforms announced an agreement in June 2025 to acquire a 49% stake in Scale AI for $14.3 billion, a strategic move by Meta to enhance its AI capabilities amidst intense industry competition.

Referenced in 1 Document
Research Data
Extracted Attributes
  • Name

    Scale AI, Inc.

  • Type

    Data annotation company, AI company

  • Founded

    2016-01-01

  • Mission

    Accelerate the development of AI applications by providing high-quality data

  • Services

    Data labeling, model evaluation, high-quality training data for machine learning models, data annotation (automated, human-only, human-in-the-loop), Reinforcement Learning from Human Feedback (RLHF), data generation, model safety, model alignment

  • Employees

    900

  • Headquarters

    San Francisco, California, United States

  • Divisions/Products

    Safety, Evaluation and Alignment Lab (research arm), Remotasks (subsidiary), Outlier (subsidiary), Scale Generative AI Platform, Scale Data Engine, Scale Rapid, Scale Studio, Scale Pro

Timeline
  • Scale AI, Inc. was founded. (Source: Wikidata, Web Search)

    2016-01-01

  • Scale AI established Remotasks, a crowdworking platform for labeled data creation, particularly for computer vision and autonomous vehicles. (Source: Wikipedia, Web Search)

    2017

  • Scale AI contributed to Meta Platforms’ Purple Llama initiative, a security framework for the development of open generative AI models. (Source: Web Search)

    2023-12-XX

  • Scale AI was selected by the Department of Defense to test and evaluate its LLMs for military purposes under a one-year contract. (Source: Web Search)

    2024-02-XX

  • Meta Platforms agreed to purchase a 49% stake in Scale AI for $14.3 billion. (Source: Summary, Wikipedia, Related Documents)

    2025-06-XX

Scale AI

Scale AI, Inc. is an American data annotation company based in San Francisco, California. It provides data labeling and model evaluation services to develop applications for artificial intelligence. The company’s research arm, the Safety, Evaluation and Alignment Lab, focuses on evaluating and aligning large language models (LLMs), including through initiatives such as Humanity's Last Exam, a benchmark designed to assess advanced AI systems on alignment, reasoning, and safety. Scale AI outsources data labeling through its subsidiaries, Remotasks, which focuses on computer vision and autonomous vehicles, and Outlier, which focuses on data annotation for LLMs. Scale AI's customers in the commercial sector include Etsy, General Motors, OpenAI, PayPal, Pinterest, Samsung, Toyota, and Uber. The company also directly works with world governments, including the United States on multiple military-related projects, and with Qatar to improve the efficiency of its social programs. In June 2025, Meta Platforms agreed to purchase a 49% stake in Scale AI for $14.3 billion.

Web Search Results
  • What is Scale AI? The Ultimate Guide to the Data Engine Powering ...

    ## What is Scale AI? A Comprehensive Definition Scale AI is a data-centric artificial intelligence company that specializes in providing high-quality training data for machine learning models. Founded in 2016, Scale AI has evolved from a data annotation service to a comprehensive data platform that helps companies develop, improve, and deploy AI models across various industries. [...] ## What is Scale AI? A Comprehensive Definition Scale AI is a data-centric artificial intelligence company that specializes in providing high-quality training data for machine learning models. Founded in 2016, Scale AI has evolved from a data annotation service to a comprehensive data platform that helps companies develop, improve, and deploy AI models across various industries. [...] For businesses looking to implement AI solutions, understanding Scale AI’s capabilities represents an important step in developing a comprehensive AI strategy. As the company continues to evolve and expand its offerings, it remains at the forefront of solving one of the most significant challenges in artificial intelligence: turning raw data into intelligent systems that deliver real-world value. Other Articles: The Future of Energy AI and Climate Tech AI and Sports Broadcasting

  • Scale AI - Wikipedia

    Scale AI, Inc. is an American data annotation company based in San Francisco, California. It provides data labeling and model evaluation services to develop applications for artificial intelligence. [...] In 2017, Scale AI established Remotasks, a crowdworking platform to support the creation of labeled data for machine learning, particularly in areas such as computer vision and autonomous vehicles. The subsidiary has facilities in Southeast Asia and Africa. [...] In December 2023, Scale AI was among a list of companies that contributed to Meta Platforms’s Purple Llama initiative, a security framework for the purpose of development of open generative AI models. In February 2024, Scale AI was selected by the Department of Defense to test and evaluate its LLMs for military purposes under a one-year contract.

  • How To Scale AI In Your Organization - IBM

    Scaling AI involves expanding the use of machine learning (ML) and AI algorithms to perform day-to-day tasks efficiently and effectively, matching the pace of business demand. To achieve this, AI systems require robust infrastructure and substantial data volumes to maintain speed and scale. [...] Scaling AI involves an iterative process that requires collaboration across multiple teams, including business experts, IT and data science professionals. Business operation experts work closely with data scientists to make sure that AI outputs align with organizational guidelines. Retrieval augmented generation (RAG) can optimize AI outputs based on organizational data without modifying the underlying model. [...] # How to scale AI in your organization ## Authors Content Writer IBM Consulting Inbound Content Lead, AI Productivity & IBM Consulting ## How to scale AI in your organization Scaling artificial intelligence (AI) for your organization means integrating AI technologies across your business to enhance processes, increase efficiency and drive growth while managing risks and elevating compliance.

  • Scale AI - Contrary Research

    Scale AI’s core value proposition is built around the data engineering component of this lifecycle. Specifically, Scale AI helps companies with data annotation and labeling of “ground truth” data. This ground truth data refers to correctly labeling data in an expected format, such as tagging a picture of a cat as a “cat” or assisting in differentiating a dog from a cat in an image. [...] Scale AI offers a comprehensive approach to data labeling by offering automated data labeling, human-only labeling, and human-in-the-loop (HITL) labeling, each with distinct advantages. Automated data labeling utilizes custom machine learning models to efficiently label large datasets with well-known objects, significantly accelerating the labeling process. However, it requires high-quality ground-truth datasets to ensure accuracy and struggles with edge cases. [...] As part of the data engine offering, Scale AI offers three distinct data annotation solutions tailored to different needs: Scale Rapid, Scale Studio, and Scale Pro.

  • Scale AI | LinkedIn

    At Scale, our mission is to accelerate the development of AI applications. We believe that to make the best models, you need the best data. The Scale Generative AI Platform leverages your enterprise data to customize powerful base generative models to safely unlock the value of AI. The Scale Data Engine consists of all the tools and features you need to collect, curate and annotate high-quality data, in addition to robust tools to evaluate and optimize your models. Scale powers the most [...] advanced LLMs and generative models in the world through world-class RLHF, data generation, model evaluation, safety, and alignment. Scale is trusted by leading technology companies like Microsoft and Meta, enterprises like Fox and Accenture, Generative AI companies like Open AI and Cohere, U.S. Government Agencies like the U.S. Army and the U.S. Airforce, and Startups like Brex and OpenSea.

Location Data

Scale, DeWitt, Clinton County, Iowa, United States

industrial

Coordinates: 41.8134957, -90.5328116

Open Map