Close Menu
    What's Hot

    Tim Heidecker Wants to Turn Infowars Into Adult Swim for the Internet

    England captain Nat Sciver-Brunt to miss next two Women’s T20 World Cup matches with calf injury | Cricket News

    Floyd Mayweather facing felony charges for passing bad check

    Facebook X (Twitter) Instagram
    Trending
    • Tim Heidecker Wants to Turn Infowars Into Adult Swim for the Internet
    • England captain Nat Sciver-Brunt to miss next two Women’s T20 World Cup matches with calf injury | Cricket News
    • Floyd Mayweather facing felony charges for passing bad check
    • CVS Is Switching to Aluminum Pill Bottles
    • Traveling During a Heat Wave: Tips and Precautions
    • AWS says AI agents can work on their own. It’s also building tools to keep them in line
    • Learning from the Right Sovereign Wealth Funds by Erika Mouynes
    • California’s Vote Count: How Slowness Invites Suspicion Even When It’s Not Sketchy
    interluknewsinterluknews
    • Home
    • Business
      • Corporate News
      • Industry Insights
      • Startups & Entrepreneurship
      • Technology & Innovation
    • Economy
      • Economic Policy
      • Financial Analysis
      • Inflation & Interest Rates
      • Trade & Markets
    • Global
      • Conflicts & Security
      • Diplomacy
      • Global Trends
      • International Affairs
    • Lifestyle
      • Fashion
      • Food & Dining
      • Personal Development
      • Travel
    • Opinion
      • Columns
      • Editorials
      • Expert Opinions
      • Reader Voices
    • More
      • Politics
        • Elections
        • Government & Policy
        • International Relations
        • Political Analysis
      • Sports
        • Cricket
        • Football / Soccer
        • International Sports
        • Local Sports
      • Technology
        • Artificial Intelligence
        • Cybersecurity
        • Gadgets & Reviews
        • Tech News
      • South Africa News
    Facebook X (Twitter) Instagram
    interluknewsinterluknews
    Technology & Innovation

    Collecting robot training data is dirty, unglamorous work. Some AI labs are already paying XDOF to do it

    adminBy adminJune 17, 2026No Comments5 Mins Read
    Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
    Collecting robot training data is dirty, unglamorous work. Some AI labs are already paying XDOF to do it
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Two weeks ago, OpenAI said it would relaunch the robotics program it shuttered in 2021 — the latest signal that the biggest AI labs are racing to teach machines to operate in the physical world. But building capable robots requires something the AI industry doesn’t yet have, which is the training data to match that used for language models.

    That gap is creating a new kind of infrastructure business. Unlike LLMs that were trained on a vast sea of publicly available text, robots need data that captures physical interaction, and that kind of data barely exists. YouTube videos and footage captured by gig workers are low-fidelity and hard to reconcile with the physical world.

    XDOF (pronounced “ecks-doff”), emerging from stealth today, is betting that the next great bottleneck in AI isn’t models or chips, but the data feedback loop needed to teach robots how to interact with the physical world.

    The startup aims to build the data pipelines, collection tools, and annotation systems that frontier labs and robotics companies can’t easily build themselves — and has raised $70 million from Thrive Capital, Spark Capital, a16z, Lux, and WndrCo to do it. Co-founder and CEO Philippe Wu says XDOF, which has about 60 employees, is already working with 20 customers including several frontier AI labs, but cannot name them.

    “All of the top labs are trying to pursue robotics,” Wu said. “We’ve already seen some of the downfalls of falling a little bit behind in the language model race … you don’t want to be in this type of situation where you pursue this technology too late, and everyone is in this boat where physical AI is the next frontier.”

    Wu ran into this problem himself as a PhD student at UC Berkeley. His focus was on enabling robots to learn skills from large-scale data sets. There was just one problem.

    “We didn’t have large-scale data to work with,” he told TechCrunch. “There was this chicken-and-egg problem — we first needed to actually collect data before we could even ask how to train a foundation model for robotics.”

    Wu and his future XDOF co-founder and CTO, Fred Shentu, worked on a project called GELLO, a low-cost teleoperation system that lets a human operator control a robotic arm to generate training data. “It ended up becoming a very influential paper in robotics, because a lot of people had similar needs and bottlenecks, and many started leveraging this type of device for data collection,” Wu said.

    Spotting the opportunity, Wu, Shentu, and third co-founder and Chief Operating Officer Nemo Jin launched XDOF in October 2024 to provide a data ecosystem for companies pursuing robotics models. Mindful that data provision alone can be a dead-end business, the company is also focused on data cleaning, tooling, and annotation — creating a self-reinforcing feedback loop for robot trainers.

    As a starting point, the company is partnering with UC Berkeley’s AI Research lab to release what it believes is the largest collection of high-quality robot training data ever assembled, dubbed ABC. It includes 130,000 trajectories of robot manipulation data, 300 hours of simulation, and 100 hours of evaluations. That kind of scaled-up pre-training data has never been available to academia before.

    “We’ve seen in language, image generation, and other fields, that when models and data are released, the community achieves things that you wouldn’t necessarily have expected,” David McAllister, a Berkeley PhD student who helped organize the release, told TechCrunch.

    The team has already used the data to train robots on benchmark tasks like folding T-shirts and flattening boxes, or loading AirPods into their cases.

    Unlimited degrees of freedom

    The company plans to work across three tiers of a data pyramid. The most valuable tier is teleoperation data collected on the actual robot being deployed; next comes teleoperated robots gathering more general data, as with GELLO; and finally “egocentric” data gathered by humans performing everyday tasks, for which XDOF plans to build its own wearable sensors.

    “Your camera choice is going to affect the quality of your data — which is going to affect how your hand-tracking algorithm performs,” Wu said. “If you don’t design the hardware well from the start, the data you collect might have very specific problems that you didn’t anticipate.”

    The company plans to hire and train armies of teleoperators and egocentric data operators around the world — a labor-intensive model that raises an obvious question: Why aren’t the major labs doing this data production work themselves?

    “You need a warehouse of hundreds of thousands of square feet with hundreds of robots,” Wu said. “You need to maintain these robots, calibrate their physical parameters, and properly train operators.”

    It’s a build-out that requires focus, capital, and operational scale that most AI labs would rather outsource — which is precisely the market XDOF is betting on.

    The name XDOF is a play on the robotics term “degrees of freedom,” which describes the number of independent motions a robot can perform. Your arm, from shoulder to wrist, has seven degrees of freedom. Humanoid robotics company Figure.AI’s latest robot has 30. The X in the company’s name captures its ambition: “Arbitrary degrees of freedom, unlimited degrees of freedom,” Wu says.

    When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

    collecting data dirty Labs paying Robot training unglamorous work XDOF
    Follow on Google News Follow on Flipboard
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
    Previous ArticleAllowing Iran to Charge Fees in Strait of Hormuz Would Set ‘Dangerous Precedent,’ Maersk CEO Says
    Next Article Telegram challenges India app ban, calls move unconstitutional | Social Media News
    admin
    • Website

    Related Posts

    Tim Heidecker Wants to Turn Infowars Into Adult Swim for the Internet

    June 17, 2026

    AWS says AI agents can work on their own. It’s also building tools to keep them in line

    June 17, 2026

    Opinion | We Ran the Numbers. Remote Work Is Bad for Us.

    June 17, 2026
    Leave A Reply Cancel Reply

    Demo
    Latest Posts

    Tim Heidecker Wants to Turn Infowars Into Adult Swim for the Internet

    England captain Nat Sciver-Brunt to miss next two Women’s T20 World Cup matches with calf injury | Cricket News

    Floyd Mayweather facing felony charges for passing bad check

    CVS Is Switching to Aluminum Pill Bottles

    Latest Posts

    Subscribe to News

    Get the latest sports news from NewsSite about world, sports and politics.

    Advertisement
    Demo

    We are a digital news platform delivering timely, accurate, and insightful coverage of politics, global affairs, business, economy, sports, and more. Our mission is to keep readers informed with reliable news, clear analysis, and stories that truly matter.
    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Type above and press Enter to search. Press Esc to cancel.

    Powered by
    ...
    ►
    Necessary cookies enable essential site features like secure log-ins and consent preference adjustments. They do not store personal data.
    None
    ►
    Functional cookies support features like content sharing on social media, collecting feedback, and enabling third-party tools.
    None
    ►
    Analytical cookies track visitor interactions, providing insights on metrics like visitor count, bounce rate, and traffic sources.
    None
    ►
    Advertisement cookies deliver personalized ads based on your previous visits and analyze the effectiveness of ad campaigns.
    None
    ►
    Unclassified cookies are cookies that we are in the process of classifying, together with the providers of individual cookies.
    None
    Powered by