AI and EdTech Tools for Teachers: A Complete Evidence-Based Guide
Central hub for AI in education, EdTech tool reviews, AI marking, ChatGPT for teachers, and AI ethics resources.
Central hub for AI in education, EdTech tool reviews, AI marking, ChatGPT for teachers, and AI ethics resources.
The UK education system stands at an inflection point. ChatGPT reached 100 million users faster than any technology in history. Teachers face a choice: resist AI or integrate it thoughtfully. Evidence shows the latter works better.
This hub consolidates what research tells us about AI in classrooms. We cover lesson planning tools, marking automation, differentiation engines, and accessibility features. We also cover what the evidence doesn't support—and why some "innovative" tools fail learners.
The key insight: AI is most powerful when it automates the administrative work teachers don't enjoy, freeing time for the human work teachers do best—dialogue, feedback, adaptive teaching.
Research identifies three high-impact areas where AI genuinely helps teaching:
AI can generate starter activities, worked examples, and discussion prompts. A teacher using ChatGPT for lesson planning doesn't spend hours writing materials—instead, they spend 15 minutes refining AI drafts. The time saved compounds: an hour per week across a year is 50+ hours of planning time recovered.
The constraint is quality. Generic AI prompts produce generic lesson plans. Effective use requires a clear pedagogical intent. A teacher asking "Generate a Year 5 fractions lesson" gets something mediocre. A teacher asking "Generate a diagnostic task to assess whether learners understand equivalence vs. quantity" gets something useful.
This aligns with cognitive load theory (Sweller, 1988). AI offloads the extraneous load of material generation, preserving cognitive space for the intrinsic work of pedagogy.
Automated marking of multiple-choice assessments has been standard for 30 years. Modern AI extends this to short-answer and extended writing. Systems can now flag common misconceptions, generate feedback prompts, and rank student work by confidence level.
The evidence is mixed. AI marking systems improve feedback speed but can miss context-specific misconceptions. A study by Chen et al. (2023) found AI feedback was 82% as effective as teacher feedback when trained on rubrics, but fell to 54% when rubrics were vague.
Best practice: Use AI to draft feedback, never as final feedback. A teacher reviewing AI suggestions takes 2 minutes instead of 20. The learner receives richer, faster feedback.
AI differentiation engines adapt content difficulty, pacing, and modality based on learner performance. A learner struggling with abstract fractions gets concrete pictorial representations. One who masters quickly moves to applied problems.
This is not personalised learning (a discredited concept). It's adaptive learning—a systematic response to observed performance. The mechanism that drives it is pedagogically sound: retrieval practice at the edge of competence (Bjork & Bjork, 1992).
The caution: adaptive systems work best in low-stakes practice, not high-stakes assessment. Learners need some struggle to build robust knowledge.
Not all tools are equal. Schools adopting EdTech often face pressure to choose fast. This framework helps leaders evaluate:
Does the tool align with how learners actually learn? Red flags include:
Green flags include:
Ask for randomised controlled trials (RCTs) or robust quasi-experimental evidence. If the vendor cannot produce evidence, be sceptical. The EEF Teaching and Learning Toolkit is a good baseline for what "evidence" looks like in UK schools.
Be aware of publication bias: tools are more likely to publish positive findings. Ask whether the impact was independent-tested.
A tool that costs £50,000 per year and improves reading fluency by 3% is less valuable than one costing £5,000 and improving it by 5%. Calculate the cost-per-percentile-gain. This forces honest evaluation.
EdTech vendors often design for mainstream first, SEND as an afterthought. This is backwards. AI metacognitive scaffolds are most powerful for learners who struggle to regulate their own learning. If a tool isn't accessible from day one, pass.
Learners with dyslexia, dyscalculia, and autism often struggle with metacognition—the ability to monitor and adjust their own thinking. AI scaffolds address this directly.
Example: A learner with dyscalculia using AI graphic organisers gets real-time visual structure. Instead of working memory overload, they see the problem mapped out. This isn't "personalised learning"—it's removing barriers.
Similarly, AI-powered retrieval practice quizzes adapt difficulty so learners with SEND always work at the zone of proximal development (Vygotsky, 1978). Too hard → demoralisation. Too easy → no learning. Adaptive systems keep the zone stable.
Many schools ban ChatGPT. This is defensible as a interim response, but it's not sustainable. Academic integrity in the age of AI requires teaching learners how to use AI ethically.
The principle: Learners should understand AI—how it works, what it's good for, what it's bad at. They should know when AI use is appropriate (brainstorming, checking grammar, explaining concepts) and when it's not (sitting exams, submitting work as their own).
This mirrors how we teach with calculators. We don't ban them; we teach learners when to use them and when mental arithmetic matters. Same with AI.
The broadest general-purpose tool. Good for lesson planning, explaining concepts, generating multiple-choice questions. Poor at maths (often makes calculation errors) and outdated knowledge (training data cuts off April 2024). Best practices for ChatGPT in teaching.
Multimodal (text, image, video). Stronger at maths than ChatGPT. Can analyse images, which is useful for marking work or generating worked examples. Real-time web access means knowledge is current.
Strong reasoning and long-form writing. Less flashy than ChatGPT but often more reliable. Larger context window allows processing entire lessons or articles. Best for detailed feedback and curriculum planning.
Tools like Kahoot, Quizlet, and Classcraft are purpose-built for education. They're less flexible but more classroom-integrated. Evaluating edtech tools should account for ease of use and integration cost.
AI adoption fails without staff training. Professional development for AI in schools should cover:
Teachers often fear AI because they don't understand it. Demystification is the first step.
The EEF has evaluated dozens of EdTech tools. Here's what works:
The strongest EdTech aligns with evidence-based pedagogy, not novelty.
Rolling out new tools poorly wastes time and money. Here's a structure that works:
Select 5–10 enthusiastic teachers. They use the tool in one class. Focus on understanding barriers, not perfect implementation.
Build on pilots. Run 90-minute sessions covering how to use the tool, alignment with your pedagogy, and how to support learners with SEND. Practice together.
All teachers implement in one subject area. Monthly check-ins identify common problems. Quick fixes (usually training or workflow tweaks) are deployed immediately.
Measure impact on a few key metrics (e.g., retrieval practice completion rate, feedback speed). Adjust based on data, not anecdote.
AI tools often have a novelty effect: learners are excited for weeks, then the effect fades. The research on motivation and learning is clear: external tools (points, badges, AI praise) don't sustain effort. Intrinsic motivation—competence, autonomy, belonging—does.
Use AI to support these fundamentals. An AI quiz that gives immediate, honest feedback builds competence. A metacognitive scaffold that helps learners choose their own next step builds autonomy. Neither is about gamification.
Start small. Pick one problem your school is trying to solve—perhaps slow feedback cycles, or differentiation for SEND learners. Find an AI tool that addresses it. Run a 6-week pilot with 10 teachers. Measure one outcome carefully. Decide whether to scale.
The future isn't "AI in schools" or "no AI in schools." It's "thoughtful AI in schools, integrated with pedagogy, evaluated honestly, and used to free up teacher time for the irreplaceable human work of teaching."
These papers provide the foundation for evidence-based adoption of AI tools in schools.
The UK education system stands at an inflection point. ChatGPT reached 100 million users faster than any technology in history. Teachers face a choice: resist AI or integrate it thoughtfully. Evidence shows the latter works better.
This hub consolidates what research tells us about AI in classrooms. We cover lesson planning tools, marking automation, differentiation engines, and accessibility features. We also cover what the evidence doesn't support—and why some "innovative" tools fail learners.
The key insight: AI is most powerful when it automates the administrative work teachers don't enjoy, freeing time for the human work teachers do best—dialogue, feedback, adaptive teaching.
Research identifies three high-impact areas where AI genuinely helps teaching:
AI can generate starter activities, worked examples, and discussion prompts. A teacher using ChatGPT for lesson planning doesn't spend hours writing materials—instead, they spend 15 minutes refining AI drafts. The time saved compounds: an hour per week across a year is 50+ hours of planning time recovered.
The constraint is quality. Generic AI prompts produce generic lesson plans. Effective use requires a clear pedagogical intent. A teacher asking "Generate a Year 5 fractions lesson" gets something mediocre. A teacher asking "Generate a diagnostic task to assess whether learners understand equivalence vs. quantity" gets something useful.
This aligns with cognitive load theory (Sweller, 1988). AI offloads the extraneous load of material generation, preserving cognitive space for the intrinsic work of pedagogy.
Automated marking of multiple-choice assessments has been standard for 30 years. Modern AI extends this to short-answer and extended writing. Systems can now flag common misconceptions, generate feedback prompts, and rank student work by confidence level.
The evidence is mixed. AI marking systems improve feedback speed but can miss context-specific misconceptions. A study by Chen et al. (2023) found AI feedback was 82% as effective as teacher feedback when trained on rubrics, but fell to 54% when rubrics were vague.
Best practice: Use AI to draft feedback, never as final feedback. A teacher reviewing AI suggestions takes 2 minutes instead of 20. The learner receives richer, faster feedback.
AI differentiation engines adapt content difficulty, pacing, and modality based on learner performance. A learner struggling with abstract fractions gets concrete pictorial representations. One who masters quickly moves to applied problems.
This is not personalised learning (a discredited concept). It's adaptive learning—a systematic response to observed performance. The mechanism that drives it is pedagogically sound: retrieval practice at the edge of competence (Bjork & Bjork, 1992).
The caution: adaptive systems work best in low-stakes practice, not high-stakes assessment. Learners need some struggle to build robust knowledge.
Not all tools are equal. Schools adopting EdTech often face pressure to choose fast. This framework helps leaders evaluate:
Does the tool align with how learners actually learn? Red flags include:
Green flags include:
Ask for randomised controlled trials (RCTs) or robust quasi-experimental evidence. If the vendor cannot produce evidence, be sceptical. The EEF Teaching and Learning Toolkit is a good baseline for what "evidence" looks like in UK schools.
Be aware of publication bias: tools are more likely to publish positive findings. Ask whether the impact was independent-tested.
A tool that costs £50,000 per year and improves reading fluency by 3% is less valuable than one costing £5,000 and improving it by 5%. Calculate the cost-per-percentile-gain. This forces honest evaluation.
EdTech vendors often design for mainstream first, SEND as an afterthought. This is backwards. AI metacognitive scaffolds are most powerful for learners who struggle to regulate their own learning. If a tool isn't accessible from day one, pass.
Learners with dyslexia, dyscalculia, and autism often struggle with metacognition—the ability to monitor and adjust their own thinking. AI scaffolds address this directly.
Example: A learner with dyscalculia using AI graphic organisers gets real-time visual structure. Instead of working memory overload, they see the problem mapped out. This isn't "personalised learning"—it's removing barriers.
Similarly, AI-powered retrieval practice quizzes adapt difficulty so learners with SEND always work at the zone of proximal development (Vygotsky, 1978). Too hard → demoralisation. Too easy → no learning. Adaptive systems keep the zone stable.
Many schools ban ChatGPT. This is defensible as a interim response, but it's not sustainable. Academic integrity in the age of AI requires teaching learners how to use AI ethically.
The principle: Learners should understand AI—how it works, what it's good for, what it's bad at. They should know when AI use is appropriate (brainstorming, checking grammar, explaining concepts) and when it's not (sitting exams, submitting work as their own).
This mirrors how we teach with calculators. We don't ban them; we teach learners when to use them and when mental arithmetic matters. Same with AI.
The broadest general-purpose tool. Good for lesson planning, explaining concepts, generating multiple-choice questions. Poor at maths (often makes calculation errors) and outdated knowledge (training data cuts off April 2024). Best practices for ChatGPT in teaching.
Multimodal (text, image, video). Stronger at maths than ChatGPT. Can analyse images, which is useful for marking work or generating worked examples. Real-time web access means knowledge is current.
Strong reasoning and long-form writing. Less flashy than ChatGPT but often more reliable. Larger context window allows processing entire lessons or articles. Best for detailed feedback and curriculum planning.
Tools like Kahoot, Quizlet, and Classcraft are purpose-built for education. They're less flexible but more classroom-integrated. Evaluating edtech tools should account for ease of use and integration cost.
AI adoption fails without staff training. Professional development for AI in schools should cover:
Teachers often fear AI because they don't understand it. Demystification is the first step.
The EEF has evaluated dozens of EdTech tools. Here's what works:
The strongest EdTech aligns with evidence-based pedagogy, not novelty.
Rolling out new tools poorly wastes time and money. Here's a structure that works:
Select 5–10 enthusiastic teachers. They use the tool in one class. Focus on understanding barriers, not perfect implementation.
Build on pilots. Run 90-minute sessions covering how to use the tool, alignment with your pedagogy, and how to support learners with SEND. Practice together.
All teachers implement in one subject area. Monthly check-ins identify common problems. Quick fixes (usually training or workflow tweaks) are deployed immediately.
Measure impact on a few key metrics (e.g., retrieval practice completion rate, feedback speed). Adjust based on data, not anecdote.
AI tools often have a novelty effect: learners are excited for weeks, then the effect fades. The research on motivation and learning is clear: external tools (points, badges, AI praise) don't sustain effort. Intrinsic motivation—competence, autonomy, belonging—does.
Use AI to support these fundamentals. An AI quiz that gives immediate, honest feedback builds competence. A metacognitive scaffold that helps learners choose their own next step builds autonomy. Neither is about gamification.
Start small. Pick one problem your school is trying to solve—perhaps slow feedback cycles, or differentiation for SEND learners. Find an AI tool that addresses it. Run a 6-week pilot with 10 teachers. Measure one outcome carefully. Decide whether to scale.
The future isn't "AI in schools" or "no AI in schools." It's "thoughtful AI in schools, integrated with pedagogy, evaluated honestly, and used to free up teacher time for the irreplaceable human work of teaching."
These papers provide the foundation for evidence-based adoption of AI tools in schools.