Humyn Labs Commits $20 Million for Physical AI Data Infrastructure
  • News
  • Asia

Humyn Labs Commits $20 Million for Physical AI Data Infrastructure

The investment will scale data collection in the Global South to train next-generation robotics.

4/13/2026
Ghita Khalfaoui
Back to News

Artificial intelligence firm Humyn Labs has announced a significant $20 million commitment to expand its data collection infrastructure for physical AI systems. This strategic investment, self-funded through company revenue, aims to bridge the critical gap between laboratory-tested AI and real-world deployment. Co-founded by Manish Agarwal and Ishank Gupta, the company will focus on gathering high-quality visual, movement, and voice data to power the next generation of robotics.


Bridging the Real-World Deployment Gap

Many advanced AI systems face a 'deployment gap,' where performance plummets in uncontrolled commercial settings due to data discrepancies. A model successful in the lab can see its reliability drop significantly when encountering new lighting, backgrounds, or textures. Humyn Labs addresses this 'distribution shift' by sourcing varied training data to make AI more robust and dependable for commercial use.

The company's core strategy involves scaling egocentric, source-first data collection across diverse global regions, including India and Latin America. This method captures first-person human activity, providing rich, context-aware datasets on how people navigate and interact with their surroundings. These datasets are crucial for training physical AI systems to operate effectively in complex, real-world environments.

Enhancing Voice and Simulation Capabilities

Recognizing voice as a critical interface for human-robot interaction, Humyn Labs is expanding its capabilities to support 33 languages, dialects, and accents. This initiative ensures that AI systems can accurately understand and respond to human commands with deep contextual and cultural nuance. The investment will bolster the infrastructure needed to capture these complex voice patterns for more natural interactions.

To further accelerate development, the company will establish dedicated Robotics Labs for building high-fidelity simulation environments and world models. These labs will allow AI systems to be trained and tested in virtual settings before being deployed in the physical world. This approach integrates real-world data with advanced training frameworks, making the deployment process faster and more efficient.

Strategic Growth and Market Outlook

Humyn Labs is financing this expansion through its own revenue, currently operating at an annualized run rate of approximately $4 to $5 million. Co-founder Manish Agarwal noted a strong sales pipeline of around $50 million, with projections to reach $100 million in annual recurring revenue by late 2026. The company targets top-tier technology labs, where successful proofs-of-concept can scale into substantial contracts.

The founders position Humyn Labs as a key enabler of next-generation AI, providing the foundational human data infrastructure that all real-world systems will require. This move comes as the global AI training data market is projected to exceed $23 billion by 2034. By focusing on the technically demanding segment of physical AI validation, the company is positioning itself at the industry's frontier.


Humyn Labs' $20 million investment marks a strategic push to solve one of AI's most persistent challenges: real-world applicability. By focusing on diverse, high-quality egocentric and voice data, the company is building the essential infrastructure for more reliable and adaptable physical AI. This initiative not only strengthens its market position but also accelerates the transition of intelligent systems from controlled labs to the complexities of our daily world.