Clairva Raises $500K Pre-Seed Funding from Venture Catalysts
Clairva is an AI data infrastructure startup building licensed, provenance-backed datasets for training artificial intelligence models.

AI data infrastructure startup Clairva has raised $500,000 in a pre-seed funding round led by Venture Catalysts through its angel investment network.
The company plans to utilise the fresh capital to expand its licensed data sourcing network, strengthen collaborations with content owners and institutions, enhance its data enrichment and validation capabilities, and accelerate business development efforts with AI companies globally.
Founded in 2025 by Sunil Nair, Sabari Raju, Dushyant Verma, and Amit Parashar, Clairva is building a licensed data infrastructure platform focused on creating provenance-backed datasets for artificial intelligence applications, including foundation models, embodied AI, robotics, and autonomous systems.
As AI models become increasingly dependent on high-quality training data, access to datasets with verified ownership rights, clear provenance, and diverse real-world representation has emerged as a key challenge. Clairva aims to address this gap by partnering with content creators, production houses, studios, archives, institutions, and contributor networks to source, license, organise, and prepare datasets for AI development.
The startup is currently focusing on India, Southeast Asia, and other Global South markets, with an emphasis on improving representation of regional languages, environments, behaviours, gestures, workflows, and real-world scenarios that remain underrepresented in many existing AI datasets.
Clairva is also developing technology solutions across the AI data pipeline, including licensed dataset ingestion, rights and provenance management, automated data enrichment, metadata generation, object and action tagging, temporal segmentation, quality validation, and dataset packaging.


