Manuscripts & Inscriptions Digitization Foundation

A unified platform that hosts India's digitized manuscripts and inscriptions — building a searchable, AI-ready repository for civilizational intelligence.

What We Do

India's Civilizational Data, Reclaimed

India holds the world's largest untapped dataset — 10 million manuscripts locked away in fragmented repositories across the country. MIDF consolidates this fragmented wealth into an AI-ready repository, transforming our invisible heritage into a functional, future-ready asset.

MIDF operates as an execution engine that actively governs the digitization supply chain—deploying strategic capital and enforcing rigorous technical protocols to ensure every historical asset is preserved to uncompromising, conservation-grade standards.

The Problem

10 million manuscripts scattered across incompatible, unsearchable digital silos. Physical manuscripts degrade daily. Knowledge is invisible to AI models and researchers.

Our Solution

A unified platform with OCR, transcription, annotation, and multi-script support — powered by AI-ready metadata and maintained through strict quality standards.

Our Differentiation

Ownership. Custodians retain full ownership of their content. We provide the vault; they hold the key. Plus, we actively manage digitization through grants and standards.

The Problem

The Critical Execution Vacuum

Government bodies have successfully preserved millions of physical items, but without aggressive deep-learning HTR and robust APIs, our 10th-century IP remains locked. While we wait, the rest of the world builds the digital economy on their heritage.

10M+

Total Manuscripts

The estimated volume of India's intellectual heritage sitting in physical archives.

6.8%

Documented (NMM)

Roughly cataloged by national missions, but still physically inaccessible to researchers and the public.

0.8%

Digitized (Images)

Scanned as incompatible JPEGs and PDFs. Locked in digital silos. Machine-invisible.

0.2%

Searchable Data

The tiny fraction that can actually be read, trained upon by AI, or queried by global researchers.

The Solution

A Unified Platform for India's Written Heritage

MIDF is an indigenous, online repository that aggregates India's manuscripts and inscriptions into a searchable, machine-readable knowledge graph — for researchers, institutions, AI models, and the public at large.

1

Beyond static storage — an actively evolving, machine-readable intelligence hub.

2

An institutional partner — architectured to protect sovereignty and foster absolute trust.

Browse 175+ Records on Smriti →

Searchable Archive

Upload, search, OCR/transcription, and annotation — turning raw images into intelligent, queryable data.

Multi-Script Compatible

Supporting multi-script and multi-language compatibility so knowledge is discoverable across all languages.

AI-Ready Repository

Structured metadata and machine-readable formats designed to power the next generation of AI models.

Secure & Trusted

Custodians retain full ownership of their physical and digital content. We provide the vault; they hold the key.

Ancient stone inscription with carved script

The Strategy

Building the Digital Infrastructure

At MIDF, our Infrastructure + Capital architecture transforms invisible heritage into a globally accessible asset.

Digital Platform

We are building an indigenous repository that converts fragmented manuscripts into a dynamic, machine-readable knowledge graph. It delivers multi-script compatibility, universal search, and advanced AI-driven OCR.

Strategic Capital & Execution

Operating as an execution engine, we issue strategic grants to digitize at-risk collections directly at the source. Simultaneously, our dedicated in-house epigraphy & manuscripts scholarly teams leverage advanced technology to decode ancient records at scale.

Enterprise-Grade Standards

True digitization goes beyond static images. We enforce strict, conservation-grade protocols across our entire supply chain, guaranteeing that every uploaded asset is uniform, future-proof, and inherently AI-ready for researchers worldwide.

Mission & Vision

Preserving the Past, Powering the Future

Our Mission

To build the digital infrastructure that standardizes India's fragmented heritage into an accessible, AI-ready asset — ensuring no manuscript is lost to time or neglect.

Our Vision

A future where India's civilizational knowledge is unified on a single unified platform — safe from loss, structurally organized, and discoverable across all languages and scripts.

The Outcome

The Vision & Goals

A future where India's fragmented heritage is unified within an AI-ready, machine-readable repository, ensuring our history is structurally organized, permanently preserved, and easily accessible.

10M+

Manuscripts Unlocked

Transforming vast, invisible physical archives into a functional, future-ready asset for the world.

100%

Machine Readable

Converting flat images into structured data specifically designed for AI model training and complex querying.

Open

Global Access

Empowering researchers, institutions, and the general public with universal access, free of restrictive paywalls.

Support the Mission

Invest in India's digital heritage infrastructure

Our Team

The people behind MIDF

RD

Rahul Dewan

Strategic Advisor & Seed Donor

A technology entrepreneur, angel investor, and founder of SangamTalks. He brings decades of experience in enterprise software architecture and currently leads the ₹1,000 crore Hindu Fund for civilisational causes.

SH

Sudarshan H.S.

Sr. Researcher & Technical Head

A historian of science and scholar of Indian epistemology, with a focus on the global transmission of ancient Indian knowledge. He also leads specialised research and educational workshops on pedagogical systems.

NB

Puranika Narayana Bhatta

Director & CFO

An IIT Bombay and IIM Calcutta alumnus with two decades of global strategy experience. He is also a location technology pioneer at Latlong and a writer whose work spans ancient Samskrita poetry and modern economics.

PP

Pooja Porwal

Chief Executive Officer

She brings seven years of experience in startup strategy and execution, and is supporting the on-ground implementation of MIDF's platform and capital model.

SB

Shreyas Bharadwaj

Director & Strategic Advisor

Part of MIDF's leadership team, contributing to the organisation's strategic direction and institutional development.

We aren't starting from scratch

We have already built the engine and tested the logistics. We just need the fuel to scale the operation nationally.

Pilot 01

eGangotri

82,000+ Texts2 Crore Pages

Rescuing India's manuscript heritage and opening it to the world forever with zero paywalls.

Read More →

Pilot 02

eSahitya

50 Lakh FoliosPalm-Leaf Rescue

Professional digitization of fragile palm-leaf and paper manuscripts, focusing on rare Kannada and Vachana literature.

Read More →

Join The Civilizational Ecosystem

Heritage preservation cannot happen in a vacuum. MIDF is built by operators, but powered by a decentralized community of scholars, developers, and institutions uniting to reclaim our history.

Custodians & Institutions

Libraries, ashrams, and families holding physical texts. Through our Quality Accelerator grants, we provide the capital and tech protocols for you to digitize your collections while retaining absolute ownership.

The Tech Guild

Open-source developers, data scientists, and ML engineers. Join the ecosystem to build custom OCR models, refine vision-language algorithms, and integrate our open API into next-gen LLM training data.

Scholars & Validators

Subject matter experts in ancient scripts and philosophy. We need a network of citizen scholars to validate AI transcriptions, add contextual annotations, and map knowledge graphs.

Join the Network

Select your role to get access to our private beta and latest updates.