Updated on
March 19, 2026
IB Assessment: A Teacher's Guide
|
March 19, 2026
This IB assessment teacher's guide helps educators apply best-fit grading, adapt rubrics for SEND, and make criteria visible.


Updated on
March 19, 2026
|
March 19, 2026
This IB assessment teacher's guide helps educators apply best-fit grading, adapt rubrics for SEND, and make criteria visible.

Criterion-referenced assessment evaluates pupil performance against predetermined, fixed criteria. This rejects norm-referenced grading, which compares pupils against each other. In the International Baccalaureate (IB) framework, a pupil achieves a specific grade by demonstrating the skills described in that grade band. If every pupil demonstrates the highest level of achievement, every pupil receives the top grade.
This model relies on detailed rubrics containing specific command terms and achievement descriptors. The Middle Years Programme (MYP) uses four criteria (A, B, C, and D) tailored to each subject group. The Diploma Programme (DP) uses subject-specific assessment objectives. Hattie & Timperley (2007) argue that providing transparent, criteria-based goals significantly improves pupil achievement.
Teachers use these rubrics formatively to guide learning and summatively to evaluate final products. The criteria remain consistent, allowing pupils to track their progress. Wiliam (2011) highlights that when pupils understand the success criteria, they become active participants in their own learning. This shifts the focus from accumulating marks to mastering skills.
A Geography teacher introduces a new essay task by projecting the IB rubric. The teacher asks the class to define the command terms 'describe' and 'evaluate'. Pupils write brief definitions in their draft planners. What the teacher does: Projects the rubric and facilitates a class discussion. What pupils produce: Written definitions of command terms.
IB rubrics demand rigorous cognitive engagement. Teachers can use Webb's Depth of Knowledge (DoK) to unpack these demands and sequence lessons. Lower achievement bands typically align with DoK Level 1 and 2, requiring recall and application. Higher bands push pupils into DoK Level 3 and 4, demanding analysis and extended thinking.
Presenting an entire IB rubric often causes cognitive overload. Sweller (1988) demonstrates that working memory has limits, and dense academic language exceeds those limits. Teachers must scaffold these criteria. This is essential for inclusive teaching.
Pupils with Special Educational Needs and Disabilities (SEND) are particularly vulnerable to poorly presented assessment criteria. Abstract rubrics create barriers that mask a pupil's subject knowledge. By integrating Visible Thinking routines and structural scaffolds, teachers make criteria accessible.
A Mathematics teacher maps an IB investigation rubric against Webb's DoK levels on a wall chart. The teacher points to the DoK Level 2 section and asks pupils to complete a calculation exercise. Pupils complete the exercise on mini whiteboards and show them for verification. What the teacher does: Creates a DoK-aligned wall chart and leads a short practice exercise. What pupils produce: Completed calculation exercises on mini whiteboards.
Applying IB criteria requires specific classroom strategies. Teachers must bridge the gap between policy documents and instruction.
The best-fit approach relies on a teacher's judgment to determine the most accurate achievement band. A piece of work rarely aligns perfectly with every bullet point. The teacher must identify the "centre of gravity". This means balancing strong performance against weaker performance.
The teacher reviews an English literature essay alongside the Criterion A rubric. The teacher highlights strong critical analysis (a band 7 descriptor) in green and weak textual referencing (a band 4 descriptor) in yellow. The teacher decides the overall quality sits in the 5-6 band and records a justification. What the teacher does: Highlights strengths and weaknesses in an essay using colour-coding. What pupils produce: Pupils review their own highlighted essays and explain why their work fits a specific band.
IB assessment should be transparent. Making criteria visible involves translating expectations into interactive classroom tools. This connects the summative rubric to formative tasks.
The teacher designs visual checkpoints based on the MYP Design cycle criteria. The teacher places cards representing each criteria strand on the pupils' desks. The teacher instructs the class to move a token onto a card once they have completed that requirement. What the teacher does: Creates physical cards representing each stage of the design cycle. What pupils produce: Pupils move tokens to indicate progress and explain their reasoning.
The language in IB policy documents is rarely suitable for direct pupil consumption. Teachers must translate these rubrics into student-friendly language. This clarifies expectations and reduces anxiety.
The teacher projects the official DP History rubric alongside a simplified version. The teacher points out how "synthesises complex information" translates to "combines facts from three sources to make a new point". The teacher hands out copies of the translated rubric. What the teacher does: Creates and distributes a simplified version of the rubric. What pupils produce: Pupils use the translated rubric to check their work against the criteria.

A misconception is that a pupil must meet every descriptor in a lower band before achieving a higher band. The IB framework uses a best-fit model. A pupil might demonstrate sophisticated evaluation skills (Band 7) while making minor structural errors (Band 4). The teacher uses judgment to award a grade that reflects the overall quality.
Many teachers believe that criterion-referenced rubrics are only for summative marking. In reality, these criteria must drive formative assessment. Waiting until the end of a unit guarantees that pupils will miss feedback opportunities. Formative tasks should isolate specific strands of the rubric.
Another error is assuming that the rubric dictates the sequence of teaching. While the criteria define the end goal, they do not prescribe the pedagogical journey. Teachers have the freedom to design engaging learning experiences.
A Science teacher leads a department meeting to correct the assumption that a spelling error prevents a pupil from achieving the top band in a laboratory report. The teacher shares an example of a report with minor spelling mistakes that still demonstrates exceptional scientific reasoning. The department agrees to apply the best-fit model. What the teacher does: Facilitates a department discussion about applying the best-fit model. What pupils produce: N/A (This is a teacher-focused activity).
Transitioning to effective criterion-referenced assessment requires planning and collaboration.
Step 1: Unpack the Subject Guide. Read the specific IB guide for your subject and programme. Identify the core assessment criteria and highlight the command terms.
Step 2: Translate the Criteria. Rewrite the achievement descriptors into clear "I can" statements. Ensure these translations remain accurate. Share these translated rubrics with your department.
Step 3: Design Formative Scaffolds. Create small activities that target single strands of the rubric. If Criterion B requires pattern recognition, design a ten-minute lesson starter focused on identifying data trends.
Step 4: Conduct Internal Standardisation. Gather your department to review a sample of pupil work before assigning final grades. Discuss the work against the criteria and agree on the best-fit band.
A History department meets to standardise a recent MYP Year 3 assessment. The lead teacher provides three unmarked essays. Each teacher silently marks the first essay using the translated rubric. The teachers then reveal their awarded bands and discuss any discrepancies until they reach a consensus. What the teacher does: Provides sample essays and facilitates a standardisation meeting. What pupils produce: N/A (This is a teacher-focused activity).
The principles of criterion-referenced assessment look different depending on the subject, but the focus on explicit standards remains identical.
In MYP Individuals and Societies, teachers struggle to assess Criterion A (Knowing and understanding) without penalising pupils for lower literacy levels. Writing tasks obscure the pupil's historical or geographical knowledge. The teacher uses Map It graphic organisers to bypass this literacy barrier. The teacher asks pupils to map the causes of a historical event using a relational diagram. Pupils populate the graphic organiser with keywords and connections. What the teacher does: Introduces and models the use of Map It graphic organisers. What pupils produce: Completed Map It graphic organisers demonstrating their understanding of historical events.
In DP Language and Literature, implementing the best-fit judgment can be subjective, particularly for Criterion C (Producing text). Pupils struggle to adopt the required academic register. The teacher scaffolds this process using Build It sentence blocks. The teacher provides sentence starters and structural constraints that force pupils to use specific literary terminology. Pupils manipulate these sentence components before writing their comparative analysis paragraph. What the teacher does: Provides sentence starters and structural constraints. What pupils produce: Comparative analysis paragraphs constructed using sentence blocks.
In MYP Sciences, complex practical rubrics easily overwhelm SEND learners. A standard laboratory report rubric assesses multiple criteria simultaneously. The teacher adapts the assessment by breaking the complex rubric down into visible, single-step checkpoints using a Learning Design Canvas. The teacher gives the pupil a card that only asks for a hypothesis. Once the pupil writes the hypothesis and the teacher verifies it, the teacher hands the pupil the next card requiring the methodology. What the teacher does: Creates a Learning Design Canvas with single-step checkpoints. What pupils produce: A completed laboratory report, scaffolded by the Learning Design Canvas.
Strategies for Supporting SEND Learners in IB Assessment infographic for teachers" loading="lazy">
Formative assessment occurs during learning to provide feedback against specific strands of the criteria. Summative assessment happens at the end of a unit to evaluate the pupil's final performance against the entire rubric. Both must use the same IB criteria.
When a pupil's work sits on the border between two bands, revisit the command terms. Look for the prevailing quality of the work and determine if it leans towards the higher or lower cognitive demand. Document your rationale.
You must find the centre of gravity. If a pupil shows Band 7 analysis but Band 1 communication, the best-fit grade will likely sit in the middle bands. You cannot award a top grade if a fundamental requirement is missing, but you should not award the lowest grade if high-level skills are present.
Never hand a SEND pupil a full page of dense IB text. Break the rubric down into a checklist of single actions. Present only the criteria for the specific band the pupil is working towards.
Departments should standardise at least once per assessment cycle, ideally before teachers begin marking. Regular standardisation calibrates teacher judgment, builds a shared understanding of the criteria, and reduces grading discrepancies.
IB assessment is criterion-referenced and does not use percentage-based grading or point accumulation within a single rubric. You must assign a holistic band score based on the qualitative descriptors.
Draft your translated rubric for your next unit today and share it with a colleague for feedback.
These peer-reviewed studies provide the evidence base for the strategies discussed above.
Context Matters: A Strategy to Pre-train Language Model for Science Education View study ↗
58 citations
Liu et al. (2023)
This research explores using AI language models to automatically score student science writing, including argumentation. For teachers, this could streamline assessment processes and provide consistent feedback on scientific reasoning skills, though human oversight remains essential for nuanced evaluation.
Validity matters more than cheating View study ↗
51 citations
Dawson et al. (2024)
This paper argues that ensuring assessment validity should take priority over preventing cheating. Teachers should focus on designing assessments that accurately measure learning outcomes rather than becoming overly concerned with cheating, as valid assessments naturally reduce opportunities for academic dishonesty.
Two critical thinking models-probing questions and conceptualization-adding 4 skillsets to the teacher's armamentarium. View study ↗
13 citations
Johnsen et al. (2020)
The study presents two critical thinking models using probing questions and conceptualisation techniques originally from medical education. Teachers can adapt these structured questioning approaches to help students develop analytical skills and deeper reasoning across various subjects and assessment contexts.
What matters in design? Cultivating undergraduates’ critical thinking through online peer assessment in a Confucian heritage context View study ↗
40 citations
Zhan (2020)
This research examines how online peer assessment develops critical thinking skills in East Asian educational contexts. Teachers can use peer assessment strategies to enhance student engagement and analytical abilities whilst being mindful of cultural factors that may influence participation.
Teaching Summative Assessment: A Curriculum Analysis of Pre-Service Language Teacher Education in Sweden and Finland View study ↗
Yıldırım et al. (2023)
This study analyses how teacher training programmes in Sweden and Finland prepare educators for summative assessment. It highlights the importance of systematic assessment literacy training, suggesting teachers need structured support to develop effective assessment design and implementation skills.

Criterion-referenced assessment evaluates pupil performance against predetermined, fixed criteria. This rejects norm-referenced grading, which compares pupils against each other. In the International Baccalaureate (IB) framework, a pupil achieves a specific grade by demonstrating the skills described in that grade band. If every pupil demonstrates the highest level of achievement, every pupil receives the top grade.
This model relies on detailed rubrics containing specific command terms and achievement descriptors. The Middle Years Programme (MYP) uses four criteria (A, B, C, and D) tailored to each subject group. The Diploma Programme (DP) uses subject-specific assessment objectives. Hattie & Timperley (2007) argue that providing transparent, criteria-based goals significantly improves pupil achievement.
Teachers use these rubrics formatively to guide learning and summatively to evaluate final products. The criteria remain consistent, allowing pupils to track their progress. Wiliam (2011) highlights that when pupils understand the success criteria, they become active participants in their own learning. This shifts the focus from accumulating marks to mastering skills.
A Geography teacher introduces a new essay task by projecting the IB rubric. The teacher asks the class to define the command terms 'describe' and 'evaluate'. Pupils write brief definitions in their draft planners. What the teacher does: Projects the rubric and facilitates a class discussion. What pupils produce: Written definitions of command terms.
IB rubrics demand rigorous cognitive engagement. Teachers can use Webb's Depth of Knowledge (DoK) to unpack these demands and sequence lessons. Lower achievement bands typically align with DoK Level 1 and 2, requiring recall and application. Higher bands push pupils into DoK Level 3 and 4, demanding analysis and extended thinking.
Presenting an entire IB rubric often causes cognitive overload. Sweller (1988) demonstrates that working memory has limits, and dense academic language exceeds those limits. Teachers must scaffold these criteria. This is essential for inclusive teaching.
Pupils with Special Educational Needs and Disabilities (SEND) are particularly vulnerable to poorly presented assessment criteria. Abstract rubrics create barriers that mask a pupil's subject knowledge. By integrating Visible Thinking routines and structural scaffolds, teachers make criteria accessible.
A Mathematics teacher maps an IB investigation rubric against Webb's DoK levels on a wall chart. The teacher points to the DoK Level 2 section and asks pupils to complete a calculation exercise. Pupils complete the exercise on mini whiteboards and show them for verification. What the teacher does: Creates a DoK-aligned wall chart and leads a short practice exercise. What pupils produce: Completed calculation exercises on mini whiteboards.
Applying IB criteria requires specific classroom strategies. Teachers must bridge the gap between policy documents and instruction.
The best-fit approach relies on a teacher's judgment to determine the most accurate achievement band. A piece of work rarely aligns perfectly with every bullet point. The teacher must identify the "centre of gravity". This means balancing strong performance against weaker performance.
The teacher reviews an English literature essay alongside the Criterion A rubric. The teacher highlights strong critical analysis (a band 7 descriptor) in green and weak textual referencing (a band 4 descriptor) in yellow. The teacher decides the overall quality sits in the 5-6 band and records a justification. What the teacher does: Highlights strengths and weaknesses in an essay using colour-coding. What pupils produce: Pupils review their own highlighted essays and explain why their work fits a specific band.
IB assessment should be transparent. Making criteria visible involves translating expectations into interactive classroom tools. This connects the summative rubric to formative tasks.
The teacher designs visual checkpoints based on the MYP Design cycle criteria. The teacher places cards representing each criteria strand on the pupils' desks. The teacher instructs the class to move a token onto a card once they have completed that requirement. What the teacher does: Creates physical cards representing each stage of the design cycle. What pupils produce: Pupils move tokens to indicate progress and explain their reasoning.
The language in IB policy documents is rarely suitable for direct pupil consumption. Teachers must translate these rubrics into student-friendly language. This clarifies expectations and reduces anxiety.
The teacher projects the official DP History rubric alongside a simplified version. The teacher points out how "synthesises complex information" translates to "combines facts from three sources to make a new point". The teacher hands out copies of the translated rubric. What the teacher does: Creates and distributes a simplified version of the rubric. What pupils produce: Pupils use the translated rubric to check their work against the criteria.

A misconception is that a pupil must meet every descriptor in a lower band before achieving a higher band. The IB framework uses a best-fit model. A pupil might demonstrate sophisticated evaluation skills (Band 7) while making minor structural errors (Band 4). The teacher uses judgment to award a grade that reflects the overall quality.
Many teachers believe that criterion-referenced rubrics are only for summative marking. In reality, these criteria must drive formative assessment. Waiting until the end of a unit guarantees that pupils will miss feedback opportunities. Formative tasks should isolate specific strands of the rubric.
Another error is assuming that the rubric dictates the sequence of teaching. While the criteria define the end goal, they do not prescribe the pedagogical journey. Teachers have the freedom to design engaging learning experiences.
A Science teacher leads a department meeting to correct the assumption that a spelling error prevents a pupil from achieving the top band in a laboratory report. The teacher shares an example of a report with minor spelling mistakes that still demonstrates exceptional scientific reasoning. The department agrees to apply the best-fit model. What the teacher does: Facilitates a department discussion about applying the best-fit model. What pupils produce: N/A (This is a teacher-focused activity).
Transitioning to effective criterion-referenced assessment requires planning and collaboration.
Step 1: Unpack the Subject Guide. Read the specific IB guide for your subject and programme. Identify the core assessment criteria and highlight the command terms.
Step 2: Translate the Criteria. Rewrite the achievement descriptors into clear "I can" statements. Ensure these translations remain accurate. Share these translated rubrics with your department.
Step 3: Design Formative Scaffolds. Create small activities that target single strands of the rubric. If Criterion B requires pattern recognition, design a ten-minute lesson starter focused on identifying data trends.
Step 4: Conduct Internal Standardisation. Gather your department to review a sample of pupil work before assigning final grades. Discuss the work against the criteria and agree on the best-fit band.
A History department meets to standardise a recent MYP Year 3 assessment. The lead teacher provides three unmarked essays. Each teacher silently marks the first essay using the translated rubric. The teachers then reveal their awarded bands and discuss any discrepancies until they reach a consensus. What the teacher does: Provides sample essays and facilitates a standardisation meeting. What pupils produce: N/A (This is a teacher-focused activity).
The principles of criterion-referenced assessment look different depending on the subject, but the focus on explicit standards remains identical.
In MYP Individuals and Societies, teachers struggle to assess Criterion A (Knowing and understanding) without penalising pupils for lower literacy levels. Writing tasks obscure the pupil's historical or geographical knowledge. The teacher uses Map It graphic organisers to bypass this literacy barrier. The teacher asks pupils to map the causes of a historical event using a relational diagram. Pupils populate the graphic organiser with keywords and connections. What the teacher does: Introduces and models the use of Map It graphic organisers. What pupils produce: Completed Map It graphic organisers demonstrating their understanding of historical events.
In DP Language and Literature, implementing the best-fit judgment can be subjective, particularly for Criterion C (Producing text). Pupils struggle to adopt the required academic register. The teacher scaffolds this process using Build It sentence blocks. The teacher provides sentence starters and structural constraints that force pupils to use specific literary terminology. Pupils manipulate these sentence components before writing their comparative analysis paragraph. What the teacher does: Provides sentence starters and structural constraints. What pupils produce: Comparative analysis paragraphs constructed using sentence blocks.
In MYP Sciences, complex practical rubrics easily overwhelm SEND learners. A standard laboratory report rubric assesses multiple criteria simultaneously. The teacher adapts the assessment by breaking the complex rubric down into visible, single-step checkpoints using a Learning Design Canvas. The teacher gives the pupil a card that only asks for a hypothesis. Once the pupil writes the hypothesis and the teacher verifies it, the teacher hands the pupil the next card requiring the methodology. What the teacher does: Creates a Learning Design Canvas with single-step checkpoints. What pupils produce: A completed laboratory report, scaffolded by the Learning Design Canvas.
Strategies for Supporting SEND Learners in IB Assessment infographic for teachers" loading="lazy">
Formative assessment occurs during learning to provide feedback against specific strands of the criteria. Summative assessment happens at the end of a unit to evaluate the pupil's final performance against the entire rubric. Both must use the same IB criteria.
When a pupil's work sits on the border between two bands, revisit the command terms. Look for the prevailing quality of the work and determine if it leans towards the higher or lower cognitive demand. Document your rationale.
You must find the centre of gravity. If a pupil shows Band 7 analysis but Band 1 communication, the best-fit grade will likely sit in the middle bands. You cannot award a top grade if a fundamental requirement is missing, but you should not award the lowest grade if high-level skills are present.
Never hand a SEND pupil a full page of dense IB text. Break the rubric down into a checklist of single actions. Present only the criteria for the specific band the pupil is working towards.
Departments should standardise at least once per assessment cycle, ideally before teachers begin marking. Regular standardisation calibrates teacher judgment, builds a shared understanding of the criteria, and reduces grading discrepancies.
IB assessment is criterion-referenced and does not use percentage-based grading or point accumulation within a single rubric. You must assign a holistic band score based on the qualitative descriptors.
Draft your translated rubric for your next unit today and share it with a colleague for feedback.
These peer-reviewed studies provide the evidence base for the strategies discussed above.
Context Matters: A Strategy to Pre-train Language Model for Science Education View study ↗
58 citations
Liu et al. (2023)
This research explores using AI language models to automatically score student science writing, including argumentation. For teachers, this could streamline assessment processes and provide consistent feedback on scientific reasoning skills, though human oversight remains essential for nuanced evaluation.
Validity matters more than cheating View study ↗
51 citations
Dawson et al. (2024)
This paper argues that ensuring assessment validity should take priority over preventing cheating. Teachers should focus on designing assessments that accurately measure learning outcomes rather than becoming overly concerned with cheating, as valid assessments naturally reduce opportunities for academic dishonesty.
Two critical thinking models-probing questions and conceptualization-adding 4 skillsets to the teacher's armamentarium. View study ↗
13 citations
Johnsen et al. (2020)
The study presents two critical thinking models using probing questions and conceptualisation techniques originally from medical education. Teachers can adapt these structured questioning approaches to help students develop analytical skills and deeper reasoning across various subjects and assessment contexts.
What matters in design? Cultivating undergraduates’ critical thinking through online peer assessment in a Confucian heritage context View study ↗
40 citations
Zhan (2020)
This research examines how online peer assessment develops critical thinking skills in East Asian educational contexts. Teachers can use peer assessment strategies to enhance student engagement and analytical abilities whilst being mindful of cultural factors that may influence participation.
Teaching Summative Assessment: A Curriculum Analysis of Pre-Service Language Teacher Education in Sweden and Finland View study ↗
Yıldırım et al. (2023)
This study analyses how teacher training programmes in Sweden and Finland prepare educators for summative assessment. It highlights the importance of systematic assessment literacy training, suggesting teachers need structured support to develop effective assessment design and implementation skills.
{"@context":"https://schema.org","@graph":[{"@type":"Organization","@id":"https://www.structural-learning.com/#org","name":"Structural Learning","url":"https://www.structural-learning.com/","logo":{"@type":"ImageObject","url":"https://cdn.prod.website-files.com/5b69a01ba2e409501de055d1/5b69a01ba2e40996a5e055f4_structural-learning-logo.png"}},{"@type":"Person","@id":"https://www.structural-learning.com/team/paul-main/#person","name":"Paul Main","url":"https://www.structural-learning.com/team/paul-main","jobTitle":"Founder","affiliation":{"@id":"https://www.structural-learning.com/#org"}},{"@type":"BreadcrumbList","@id":"https://www.structural-learning.com/post/ib-assessment-teachers-guide#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https://www.structural-learning.com/"},{"@type":"ListItem","position":2,"name":"Blog","item":"https://www.structural-learning.com/blog"},{"@type":"ListItem","position":3,"name":"IB Assessment: A Teacher's Guide","item":"https://www.structural-learning.com/post/ib-assessment-teachers-guide"}]},{"@type":"BlogPosting","@id":"https://www.structural-learning.com/post/ib-assessment-teachers-guide#article","headline":"IB Assessment: A Teacher's Guide","description":"This IB assessment teacher's guide helps educators apply best-fit grading, adapt rubrics for SEND, and make criteria visible.","author":{"@id":"https://www.structural-learning.com/team/paul-main/#person"},"publisher":{"@id":"https://www.structural-learning.com/#org"},"datePublished":"2026-03-19","dateModified":"2026-03-19","inLanguage":"en-GB"}]}