Consider two data points from ASER 2024. Only 23.4% of Grade 3 children can read a Grade 2-level text. And by Grade 5, while 48.7% of children can read fluently, only 30.7% can solve a basic division problem — an 18 percentage-point gap that no state in India has yet closed. The World Bank calls the reading dimension of this condition Learning Poverty. But the "N" in FLN — numeracy — is trailing even further behind, largely out of the headline. India, in the space of just five years, has decided it will no longer tolerate either.

The Scale of What Is Happening

India's FLN movement is not a single programme — it is an ecosystem in rapid motion. The National Education Policy 2020 mandated a decisive shift away from rote memorisation, toward foundational literacy and numeracy (FLN) as a non-negotiable baseline for every child. NIPUN Bharat, launched in 2021, gave that mandate a measurable target: universal FLN competency by Grade 3 by 2026–27. The Union Budget 2024–25 allocated over ₹1.48 lakh crore to school education under Samagra Shiksha, with a substantial and growing share ring-fenced for FLN capacity and implementation. Philanthropic co-investment from CSR funders and bilateral agencies adds hundreds of crores more annually.

LLF, Pratham, Room to Read, Central Square Foundation, Kaivalya Education Foundation, and dozens of state-level organisations are simultaneously running FLN and grade-transition interventions across tens of thousands of schools. States like Haryana, Rajasthan, Uttar Pradesh, Madhya Pradesh, and Odisha have embedded FLN frameworks directly into their school education governance.

43–55%
of Indian children unable to read age-appropriate text — Learning Poverty rate (World Bank / PARAKH 2024)
23.4%
of Grade 3 children can read a Grade 2-level text (ASER 2024)
18 pts
Literacy–Numeracy gap by Grade 5: 48.7% read fluently, only 30.7% can divide (ASER 2024)
10
states where LLF alone is working toward 25% learning poverty by 2030
2026–27
NIPUN Bharat target year for universal FLN competency by Grade 3

What all this activity generates is a steady and growing pipeline of evaluation mandates: baselines before programme roll-out, midline checks during implementation, and endlines to assess impact. Each demands a specific set of capabilities — and very few consulting firms in India combine all of them under one roof.

Critically, NEP 2020 defines the foundational stage as ages 3–8 — not Grade 1 onwards. With 83.3% of 4-year-olds now enrolled in some form of pre-primary institution (ASER 2024), largely through Anganwadi centres and Balvatikas, the evaluation clock must start before school entry. Grade 3 outcomes cannot be accurately evaluated without understanding school-readiness at the pre-primary level. The root of Grade 3 failure often lies at Grade 0 — and Early Childhood Care and Education (ECCE) baseline assessments are rapidly emerging as a non-negotiable upstream capability.

The FLN evaluation space in India is no longer niche. It is becoming one of the most consequential fields in development consulting — and the organisations commissioning this work are increasingly sophisticated in what they demand.

— RAYSolute Consultants, Education Evaluation Practice

What Makes FLN Evaluation Genuinely Difficult

Anyone who has attempted to assess foundational learning outcomes in rural government schools will tell you: this is not a standard M&E exercise. The challenges are distinctive, layered, and underestimated by most generalist firms entering the space.

1
Assessment Tool Design Is Not Trivial
Tool Development Challenge

FLN competencies span decoding, reading fluency, comprehension, oral communication, number recognition, arithmetic, and increasingly, higher-order thinking skills like inference, problem-solving, and reasoning. Instruments must be contextualised for local languages — Hindi, Haryanvi, regional dialects — and psychometrically validated. Borrowing wholesale from ASER or PIRLS without adaptation produces tools that mislead rather than illuminate.

2
Sampling Requires Statistical Precision
Design Challenge

With intervention groups of 100–300 schools spread across multiple blocks, getting the sample design right — accounting for clustering effects, Minimum Detectable Effect Size (MDES), intra-cluster correlation, and block-level stratification — is the difference between a finding that stands and one that is challenged at the reporting stage.

3
Field Execution in Rural Contexts Is Demanding
Operations Challenge

Schools are dispersed, often under-resourced, and enumerators must establish inter-rater reliability before data collection even begins. One-visit-per-school models must combine classroom observation, student assessment, and teacher interviews — all without disrupting the school day or biasing teacher behaviour.

4
The Analytical Demand Is Rising
Analysis Challenge

Modern FLN evaluations are no longer satisfied with simple pre-post comparisons. Programme teams and funders want subgroup disaggregation by gender, geography, and prior learning level; teaching-learning practice indices; and the application of evaluation frameworks like OECD-DAC criteria to assess relevance, coherence, efficiency, impact, and sustainability — not just effectiveness.

5
Regional Heterogeneity Is Massive — and Frequently Ignored
Governance Challenge

The PARAKH Rashtriya Sarvekshan 2024 reveals stark intra- and inter-state disparities that generic national frameworks obscure. States like Punjab, Himachal Pradesh, and Kerala are excelling; others trail significantly on the same metrics. A rigorous evaluation does not simply measure student outcomes — it measures the systemic capacity of local governance to sustain those outcomes. Evaluation sampling must account for varying levels of state administrative capacity, block-level PMU strength, and district governance quality, not only student demographics or school type.

6
The Digital Variable Cannot Be Assessed With Paper-Only Frameworks
EdTech Challenge

NIPUN Bharat has placed significant emphasis on platforms like DIKSHA for blended learning delivery, and personal smartphone ownership among the 14–18 age cohort in rural India now stands at 74.2% (ASER Beyond Basics, 2023) — a figure that has been rising steadily year-on-year. The medium of instruction is changing. Modern FLN evaluations must assess the efficacy of digital and blended pedagogical tools alongside traditional classroom observation — including human-computer interaction quality, platform usage fidelity, and whether digital access is translating into learning gain or merely screen time. Evaluators who cannot assess this dimension are already behind the programme design curve.

The FLN Frontier: Grade 4 and Beyond

Having invested heavily in Grades 1–3, the system is now grappling with a critical question: what happens to children who did not achieve FLN benchmarks by Grade 3? They do not disappear — they move into Grades 4 and 5 with compounding deficits, unable to access grade-appropriate curriculum, and increasingly disengaged.

This has given rise to a new category of intervention — one that simultaneously addresses foundational remediation, grade-appropriate competency building, and the development of higher-order thinking skills: critical reasoning, expressive writing, academic vocabulary, oral communication, and mathematical problem-solving.

Skill Domain Current Evaluation Demand Complexity Level
Foundational Literacy (Gr. 1–3) Established — ASER tools widely used; strong baseline High Volume
Foundational Numeracy — "FLN+" (Gr. 1–5) Severely underserved — fractions, place value, multi-digit operations rarely assessed independently of literacy; the 18-pt gap with reading fluency demands dedicated numeracy instruments. Note: "FLN+" refers to the extended numeracy continuum through Grade 5, beyond the Grade 3 NIPUN benchmark. Supply Gap
ECCE / Pre-Primary School-Readiness (Ages 3–6) Emerging — 83.3% of 4-year-olds now enrolled (ASER 2024); NEP 2020 defines the foundational stage as ages 3–8, yet Anganwadi/Balvatika baselines remain rare Supply Gap
Grade-Appropriate Language & Maths (Gr. 4–5) Rapidly growing — few validated tools exist for this cohort Growing Fast
Academic Language & Vocabulary Emerging — new frontier beyond FLN; limited Indian benchmarks Supply Gap
Critical Thinking & Reasoning Emerging — aligned with NEP 2020; very few agencies equipped to assess Supply Gap
Digital Blended Learning Efficacy Growing — NIPUN/DIKSHA platforms and edtech tools require evaluation of digital pedagogy, not just offline outcomes; 74.2% personal smartphone ownership among rural 14–18 cohort (ASER Beyond Basics, 2023) signals accelerating household device access Growing Fast
Teaching-Learning Practice Observation High demand — funders require classroom practice indices, not just outcomes Growing Fast

Demand mapping based on active RFP pipelines and programme evaluation trends observed across India's FLN ecosystem, March 2026.

Evaluating Grade 4–5 and higher-order thinking programmes demands a step up in methodological sophistication. The tools must assess not only reading and arithmetic but also inference, argumentation, and procedural fluency. Observation instruments must capture not just whether a classroom activity occurred but whether it created genuine cognitive demand. This is the challenge that will dominate the evaluation pipeline over the next three to five years.

Numeracy deserves particular attention here. Mathematics is strictly cumulative: if place value, basic operations, and number sense are not solidified in the foundational years, the jump to fractions, decimals, and multi-digit division in Grades 4–5 collapses entirely. ASER 2024 data confirm this — the 18 percentage-point gap between reading fluency and division competency by Grade 5 is not a coincidence; it is the compounding consequence of an ecosystem that has long treated literacy as the primary FLN headline. Funders and programme designers are now urgently seeking evaluators who understand why children can decode text but cannot "read numbers" — and who can design instruments that diagnose why numeracy is failing, not merely whether it is.

What It Takes to Get This Right

The organisations commissioning FLN evaluations — LLF, Pratham, Central Square Foundation, Aga Khan Foundation, and state government bodies — are not looking for a data collection vendor. They are looking for a thought partner who understands the education system, brings technical depth, and can deliver findings that are actionable.

01
Deep Education System Knowledge Foundation
Understanding how government primary schools in North and Central India actually function — teacher pressures, ABRC mentoring realities, multigrade classroom dynamics, and how state policy translates (or does not) into classroom practice. ASER 2024 data show that over 52% of government primary schools have fewer than 60 students, making two-thirds of Grade 1–2 classrooms effectively multigrade. Evaluation tools and classroom observation protocols must be designed for this reality — not for an idealised single-grade setting — and must be capable of assessing Teaching at the Right Level (TaRL) methodologies where children from different grades learn together.
02
Technical Rigour in Tool Development Core Capability
Designing contextualised, validated assessment instruments grounded in recognised frameworks — ASER, PIRLS, ASSET — while building in the local linguistic and pedagogical context that makes tools credible and defensible in field settings.
03
Statistical Precision in Design Core Capability
Sampling designs built on power calculations, MDES thresholds, and clustering adjustments — ensuring evaluation findings can bear the weight of programmatic and policy decisions rather than being challenged at the reporting stage.
04
Field Operations That Produce Clean Data Differentiator
Trained enumerator teams with inter-rater reliability protocols, digital data collection with real-time quality checks, and supervisory field presence that ensures data fidelity throughout the collection window.
05
Analysis and Reporting That Drives Decisions Differentiator
Deliverables that programme teams and funders can act on — disaggregated, contextualised, and framed against policy realities. Not compliance documents. Evidence for decisions.
06
State-Capacity and Governance Lens Differentiator
Reading PARAKH 2024's inter-state variance correctly means understanding that evaluation sampling must account for governance capacity — block PMU strength, district data systems, and administrative absorptive capacity — not just student demographics. In high-heterogeneity states, a single aggregate score actively misleads funders and programme teams.
07
Digital and Blended Learning Evaluation Differentiator
As DIKSHA, edtech pilots, and smartphone-enabled learning become mainstream within NIPUN Bharat, evaluation must assess platform fidelity, digital pedagogy quality, and learning gain attribution in blended environments — not only paper-based outcomes. This is now a core competency, not an optional add-on.

The Opportunity for Programme Teams and Funders

If you are designing an FLN or grade-transition intervention — whether at district, state, or national scale — the evaluation question cannot be an afterthought. The baseline window is often only weeks before implementation begins. Assessment tools that are not validated in advance produce data that cannot be compared meaningfully at endline. Sampling designs without statistical power produce findings that cannot be generalised or defended.

Equally, if you are a state government, CSR funder, or multilateral agency reviewing FLN programmes for continued investment, independent evaluation by a technically credible firm is not a cost — it is the mechanism by which you protect the value of everything else you have funded.

India's FLN moment is real and urgent. The policy window is open. The funding is moving. The programmes are being designed. What this moment needs — and what remains in short supply — is evaluation that is technically excellent, contextually grounded, and genuinely independent.

— Aurobindo Saxena, RAYSolute Consultants

The next frontier — Grades 4 and 5, higher-order thinking, academic language, and reasoning — is already visible on the horizon. The organisations designing these programmes are doing so now. The evaluation mandates will follow immediately. The window to establish credibility and track record in this space is the present.

An Invitation to NGOs, Funders, and Government Programme Teams

To NGOs designing FLN and grade-transition programmes: Your evaluation cannot wait until the intervention is running. The baseline window is your evidence foundation — and it must be built before implementation begins.

To CSR funders and bilateral agencies: Independent, technically rigorous evaluation is not a line item to negotiate down. It is the mechanism by which your investment becomes evidence.

To state government programme managers: The FLN data you need to defend and scale your programme to the next district requires methodology that holds up — not just numbers that look good in a presentation.

RAYSolute Consultants is ready to engage from day one of your design process.

AS
Aurobindo Saxena
Founder & CEO, RAYSolute Consultants

Aurobindo leads RAYSolute Consultants, specialising in outcome evaluation, baseline-endline studies, assessment tool development, and strategic advisory in foundational and grade-transition education. RAYSolute works with NGOs, government bodies, and development funders across India's FLN ecosystem. www.raysolute.com · +91-98913-21279