Why the lesson planning debate is asking the wrong question — and what actually determines whether a teacher improves.
The Debate
The EdTech community has been arguing about lesson planning for years. On one side: standardized, scripted plans that guarantee minimum quality, reduce cognitive load, and protect the 80% of teachers who have never received training on how to plan. On the other: open-ended AI tools that give teachers autonomy — the chance to make something their own, adapt to their classroom, build professional skill.
Both sides have evidence. Both sides have failures. The debate has produced heat. It has not produced answers.
We think that's because both sides are answering the wrong question. The question is not "scripted or generative?" The question is: what gives a teacher enough of a stake in a lesson that they actually deliver it?
A Lesson from Consumer Psychology
In the 1950s, General Mills launched Betty Crocker instant cake mix. Everything was included. You added water, stirred, and baked. Sales were terrible.
Psychologists diagnosed the problem: the process was too complete. Mothers felt they hadn't actually made the cake. The product required no contribution from them — and so they felt no ownership over the outcome. General Mills' fix was counterintuitive. They removed the powdered egg from the mix. Now you had to add a real egg. Sales jumped.
The egg did nothing nutritionally significant. But it gave the maker one genuine act of contribution — and that changed the psychological relationship to the product. It was no longer a factory cake. It was her cake.
This is not a metaphor we are reaching for. It is a precise description of what we have observed in 1,101 schools across Pakistan.
What the Data Shows
We are drawing on three distinct data sources: an independent evaluation by the EdTech Hub, a teacher survey conducted by GSMA/Radicle Development across 211 teachers on our AI coaching platform in Islamabad, and our own Rumi platform database covering 4,335+ lesson plans generated by teachers across Pakistan.
Source 1 — EdTech Hub Independent Evaluation, 2024
The EdTech Hub evaluation of scripted lesson plans across Pakistan government schools found four failure modes that recur regardless of plan quality:
Teachers receive a plan but don't know why it was structured this way — no pedagogical rationale, no discussion. It lands as an instruction, not a tool.
A plan designed for Grade 4 Islamabad schools may not fit a mixed-grade classroom in Quetta. Scripted plans don't know your classroom. Teachers do.
69% of teachers report greater satisfaction with AI-assisted plans over government-issued scripts — but satisfaction doesn't automatically translate into better delivery.
The plan is delivered (or not). No one asks what worked. No one helps the teacher close the gap between what they planned and what happened in the room.
Source 2 — GSMA / Radicle Development Teacher Survey, March 2026
211 teachers. 6 focus groups. Islamabad schools using Taleemabad's AI coaching platform. The numbers are striking — but the nuance matters more.
But buried in the focus group data is the contradiction: teachers who rated the platform highly also wanted more control. More flexibility. The ability to adjust timing, sequence, and emphasis. They were not rejecting the tool. They were claiming it.
The survey's own conclusion: "Overall impact depends on the educator." The tool is not the variable. The teacher's relationship to the tool is the variable.
This is the egg. Not the mix.
Source 3 — Rumi Platform Data, Pakistan
Rumi's lesson planning data tells a story about teacher preference when given real choice. We have two plan formats: pre-generated chapter plans (curriculum-linked, fully scripted, launched March 2026) and user-generated plans (teacher specifies topic, grade, and parameters; AI builds from their inputs).
The ratio is 24:1. Teachers, when given the choice, overwhelmingly generate their own plans. This holds even when the pre-generated plans are higher quality by most objective pedagogical measures.
But here is the more interesting observation: pre-generated plans are growing fast. 10 in March. 162 in the first ten days of April. Teachers who started with user-generated plans are migrating toward structured pre-generated formats — now that those formats exist and feel complete rather than imposed.
The lesson: when the pre-generated plan is designed to accommodate agency rather than eliminate it, teachers welcome it. The issue was never scripted vs generative. It was whose hand is on the plan.
One more signal from the data: teachers generate lesson plans at 1am Pakistan time. Nobody asks them to do this. Tuesdays and Wednesdays account for 49% of all weekly generation. This is not compliance. This is profession.
What We Built and Why
In December 2023, Taleemabad — in partnership with the Ministry of Federal Education and Professional Training — built and launched the National Institute of Excellence in Teacher Education. This is not a partnership with a pre-existing institution. We designed it, built the curriculum and technology backbone, and operate the programme. NIETE is a government programme: MoFEPT launched it officially, the Federal Directorate of Education and Federal College of Education are co-partners, and LUMS School of Education contributed to curriculum design.
NIETE's design reflects exactly the lesson described in this paper. We did not build a platform that delivers scripted plans to teachers. We built a coaching environment in which teachers participate in their own development — submitting audio recordings, receiving AI-generated feedback against the OECD coaching framework, and generating their own lesson plans on Rumi.
The 69% satisfaction advantage of AI-assisted plans over government scripts — cited above from the EdTech Hub evaluation — was documented in NIETE's context. It is the egg in action.
Our Framework
Curriculum-linked, pedagogically structured plans ready to use. Designed for teachers in high-load contexts who lack time or confidence to generate from scratch. The floor, not the ceiling.
Teacher specifies topic, grade, classroom size, and parameters. AI builds the plan. The teacher made the choices. The plan is theirs. This is the egg. This is what 96% of Rumi's lesson plans are.
Teacher submits audio from their class. AI analyzes against OECD coaching framework. Feedback identifies the gap between what was planned and what happened. This is where real improvement occurs — not in planning, but in reflection.
The Argument
The EdTech field has a theory of change problem. Most lesson planning interventions assume that if you improve the plan, you improve the lesson. The evidence — including 89.7% non-engagement with scripted plans — suggests this is mostly false.
Plans don't change classrooms. Teachers change classrooms. And teachers change when three things happen in sequence:
They made a choice about this lesson. It's not something that happened to them — it's something they built, even partially.
What they planned versus what actually happened in the room. Without this, the plan is orphaned from reality.
The next lesson is better because of what they learned from the last one. Not because the platform updated. Because the teacher did.
This is not a one-time event. It is a professional identity. Teaching becomes something you improve at — not something that happens to you.
The GSMA survey confirmed this directly: "Overall impact depends on the educator." Not the tool. The educator. Every teacher who reported improved outcomes had something in common — they were using the tool actively, adapting it, treating the AI as a collaborator rather than an authority.
The 7% who showed no improvement were using the platform passively. They received the output. They did not engage with it. Same tool. Different relationship. Wildly different outcomes.
Recommendations
The format of the plan matters less than whether the teacher made a meaningful choice in creating it. Design for agency, not output quality.
The gap between planning and delivery is where teacher development lives. Coaching tools that close this loop — audio submission, AI analysis, structured reflection — produce measurable improvement. Lesson plan tools alone do not.
Track whether teachers reflect on completed lessons, not just whether they submitted a plan. A teacher who generates a plan at 1am and submits audio coaching two days later is developing. One who ticks a compliance box is not.
Several teachers in the GSMA survey cited "coercive rollout" as a barrier — not the tool itself. The tool gave agency; the deployment took it away. Implementation culture determines whether teachers claim ownership or deflect it.
Pre-generated content should be the starting point of a conversation, not the endpoint of a pipeline. Build in edit steps, branching options, classroom-size variables. Give the teacher something to add. Give them their egg.