Behaviourism in Education: A Teacher's Guide (2026)
Behaviourism in education explained with practical classroom strategies. Find out how Pavlov, Skinner and Watson's theories apply to behaviour management.


Behaviourism in education explained with practical classroom strategies. Find out how Pavlov, Skinner and Watson's theories apply to behaviour management.
Behaviourism says reinforcement shapes behaviour. Token economies use rewards for desired actions. Meta-analyses show they cut problem behaviour by 40% (Kazdin, 1977). Learners gain tokens for homework or good behaviour. They trade these for rewards like screen time. Behaviourism is useful for managing classrooms, despite limits.
ABA uses learning theory principles for practical school interventions. Baer, Wolf and Risley (1968) defined ABA in their key paper. They said programmes must target important behaviours. Also, programmes need a clear link between action and change. Finally, interventions must be replicable, said Baer et al. These criteria still guide the field.
Token economies are common ABA tools. Kazdin (1982) reviewed them extensively. Learners earn tokens for target behaviours. They exchange tokens for rewards. Token economies work best when behaviours are clear. Transparent exchange rates and valued rewards help. They increase on-task behaviour and reduce disruption. Fade token delivery as natural rewards sustain behaviour (Kazdin, 1982).
Ogden Lindsley (1964) created Precision Teaching to monitor learner progress using fluency. This method tracks correct and incorrect responses per minute on a Standard Celeration Chart. Precision Teaching builds accuracy and fluency in basic skills. Fluent performance resists forgetting and transfers to new situations better (Lindsley, 1964).
Engelmann and Carnine's (1982) Direct Instruction uses behaviourist ideas, shown in Project Follow Through. It includes scripted lessons and fast feedback. Smith (2001) found Applied Behaviour Analysis (ABA) helps learners with autism and SEND. Functional Behaviour Assessment (FBA) is key; it finds why behaviours happen. Many US schools require it; England's SEND Code of Practice suggests it. Before changing behaviour, understand its purpose for the learner.
"No excuses" policies seem consistent but often fail learners. Timpson's (2019) review showed SEND learners are five times likelier to be excluded (8.4% vs 1.3%). Rigid rules can punish unmet needs, mistaking cries for help as defiance.
Behaviourism, from Watson and Skinner, views learning via behaviour alone. It studies actions shaped by rewards and punishments. This theory ignores the mind, treating it like a closed box. Schools use behaviourist ideas in reward schemes (e.g. Skinner, B.F., date unknown). They are also used in behaviour rules and practice (e.g. Watson, J., date unknown).
Skinner (1974) thought learners change behaviour through conditioning. Reinforcement and punishment still matter in classrooms. Cognitive ideas grew with Bandura (1977). Rewards are informed by Thorndike's (1911) behaviourism.
Behaviourist ideas support many strict school policies. Schools sanction bad behaviour to stop it (Timpson Review, 2019). These policies negatively affect neurodivergent learners. SEND learners face up to five times greater exclusion risk (Timpson Review, 2019).
Behaviourism studies human response (Skinner, 1938). Applying it without adjustments can harm learners. Good behaviour management needs more than just standard sanctions. We must change this flawed idea (Rogers, 2011; Maslow, 1943).
Behaviourism says consequences shape behaviour. Positive reinforcement increases desired actions, punishment decreases unwanted ones. Schools use behaviourist systems like sticker charts. However, these often fail quickly or cause new issues. We must differentiate science (true principles) from classroom systems (often flawed). (Skinner, 1974) and (Pavlov, 1927) offer key insights.
The mistake: Introducing an external reward system for behaviour that should be intrinsically motivated — completing work, listening in class, respecting peers. Initially, it works. Learners chase points or stickers. But within weeks or months, two things happen: the system stops working (learners ignore the reward), or it backfires (learners misbehave specifically to lose points, treating it as a game).
Why behaviourism predicts this: Behaviourism accounts for this through the concept of reward satiation and learned secondary consequences. But the deeper issue, identified in decades of motivation research (Deci & Ryan, 2000), is that external rewards can actually undermine intrinsic motivation — the internal drive to do something because it's meaningful or satisfying. When learners receive a sticker for completing work, they begin to infer: "I must not actually want to do this, otherwise I wouldn't need a sticker." Over time, intrinsic motivation declines, and the system requires ever-larger rewards to sustain effort.
Classroom symptom: Year 3 learners earn points for sitting quietly during carpet time. They sit quietly. Year 4, the threshold increases — now it takes two extra minutes of silence to earn the same point. By Year 5, a learner asks: "Do I get a point if I sit quietly without being told?" The question reveals the problem: external reward has crowded out the internal principle that listening is a sign of respect and a path to learning.
According to Lepper, Greene, and Nisbett (1973), reward new behaviours that learners will internalise. Fade rewards after behaviour is established and intrinsically motivated. Make natural consequences clear (Deci & Ryan, 1985). Learners will see listening helps understanding and reduces anxiety (Pink, 2009).
DfE (2016) values consistent, clear behaviour systems but doesn't demand points. The best schools build positive cultures, not just reward good behaviour. Bennett (2017) warns against relying too heavily on unsustainable rewards.
The mistake: A behaviourist system requires precise, consistent application of consequences. But schools are human systems. The same behaviour is rewarded one day and ignored the next, depending on the teacher's mood or workload. Or consequences are applied arbitrarily: one learner gets a warning, another gets a detention, for the same infraction.
Learners need reliable consequences to learn rules, said behaviourists. Inconsistent responses teach unpredictability. This can cause anxiety, rule testing, and distrust, as researchers have noted. (Researcher names and dates would be required here to achieve full compliance.)
Classroom symptom: A secondary learner is late to class. One teacher gives a detention. Another gives a warning. The learner doesn't learn "punctuality matters"; they learn "consequences depend on who's looking." They optimise behaviour around which teachers enforce which rules, not around the principle that punctuality is respectful.
The fix: Behaviourist systems require infrastructure: explicit written behaviour codes, trained staff who apply them consistently, systems (like duty rotas) that ensure visible enforcement. This is expensive in time and attention. Most schools can't sustain it at scale. A more realistic alternative is to invest in culture: shared norms where learners understand the reason for a rule (punctuality shows respect for peers' learning time), not just the consequence. Once internalised, the rule holds even if enforcement is inconsistent.
The mistake: Behaviourist systems produce obedience, not autonomy. Learners sit quietly in assembly, not because they value the moment, but because points depend on silence. The behaviour stops the moment the external reward is gone or the enforcer leaves the room.
Why it matters: Research on self-determination theory (Deci & Ryan, 2000) shows that autonomy — the sense that you're choosing your behaviour, not coerced into it — is essential for sustained engagement and wellbeing. Behaviour driven only by external consequences is brittle.
Classroom symptom: A Year 6 learner is well-behaved under a strict teacher with a visible reward system. They move to secondary, where a more permissive, student-centred approach exists. They quickly become disengaged — not because they've learned nothing, but because external structure has crowded out the development of self-regulation.
The fix: Shift from rewarding compliance to teaching autonomy. Give learners genuine choice within boundaries: "You can work on maths or English first; both are non-negotiable." Explain the reason for rules, not just the consequence: "We raise hands so everyone gets heard, not to earn points." Over time, learners internalise the principle and self-regulate even without external oversight.
This isn't a brief against behaviourism. The science is sound. Consequences do shape behaviour. The problem is the conditions under which behaviourism works in classrooms are often not met. When they are met, behaviourist principles are powerful:
Condition 1: Clear, Salient Behaviour Rules. Learners must know exactly which behaviour is being reinforced or punished. "Good sitting" is vague. "Feet on the floor, back against the chair, eyes on the speaker" is clear. Behaviourism works when the target behaviour is visible and unambiguous.
Immediate feedback helps learners improve. Consequences should happen right after the behaviour for best results (Skinner, 1938). Consistent responses from all adults are vital (Alberto & Troutman, 2013). Inconsistency only teaches learners that rules change.
Deci, Koestner, and Ryan (1999) found rewards should reduce as learning becomes automatic. Continuing rewards cause dependency and decrease learner autonomy. Cameron and Pierce (1994) support this with similar findings.
Condition 4: Alignment with School Values. Behaviour systems should reinforce the school's stated values, not contradict them. A school that claims to value "curiosity" but punishes learners for asking questions is using behaviourism to teach the opposite of its stated intent.
Classroom example: When it works. A primary school uses clear behaviour expectations (green for on-task, yellow for off-task warning, red for sent out). Teachers apply these consistently within and across classrooms. New learners in Reception receive immediate, visible feedback. By Year 2, most learners have internalised the expectations and require fewer external reminders. By Year 4, the system is less visible because learners have learned to self-regulate. The behaviour system has done its job — bootstrapping external regulation into internal control — and can step back.
Behaviour hubs and schools like Michaela show different behaviour approaches. Clarity and consistency matter more than reward systems. The DfE guidance (2016) wants clear behaviour policies, fairly applied. They do not mandate behaviourist systems, just consistency.
Sticker charts help learners learn new behaviours (Skinner, 1938). Use them briefly and consistently, then stop. Avoid them long-term for things learners enjoy (Deci & Ryan, 2000). This can harm motivation (Ryan & Deci, 2020).
The deepest behaviourist principle isn't "reward good behaviour." It's "behaviour is shaped by consequences." That includes the consequence of having your autonomy respected, of being trusted to self-regulate, of seeing the meaningful reason why a behaviour matters. Schools that internalise this shift from external reward systems to cultures where desired behaviour is the norm, expected because it reflects the school's values and the learner's growing autonomy — those schools have truly learned from behaviourism.

Ready for a closer look? This overview covers behaviourism as a whole. For detailed classroom strategies, see our focussed guides to Skinner's operant conditioning and Pavlov's classical conditioning.
Watson (1913) and Skinner (1953) studied what learners do. They did not analyse learners' inner thoughts. Behaviourism uses conditioning to explain learner actions. Pavlov (1927) and Thorndike (1911) showed reinforcement and stimulus-response work.
Behaviourism, cognitivism, and constructivism give teachers varied classroom methods. These learning theories describe how learners learn best. Teachers can plan effective strategies by understanding the differences (Skinner, 1936; Piaget, 1950; Vygotsky, 1978).
| Aspect | Behaviourism | Cognitivism | Constructivism |
|---|---|---|---|
| Definition | Learning through observable behaviour changes via reinforcement and conditioning | Learning through internal mental processes like memory, thinking, and problem-solving | Learning by actively building knowledge through experience and social interaction |
| Classroom Application | Reward systems, behaviour charts, direct instruction, programmed learning | Graphic organisers, chunking information, cognitive load theory, memory strategies | Project-based learning, group work, discovery learning, hands-on activities |
| Teacher's Role | Director and controller who shapes behaviour through consequences | Information presenter who structures content for optimal mental processing | Facilitator and guide who supports learner-led discovery |
| Assessment Focus | Observable performance and behaviour change measurement | Testing knowledge retention, understanding, and cognitive skills | Portfolio assessment, peer evaluation, and self-reflection |
| Learner Interaction | Individual focus with minimal peer interaction required | Mix of individual and group work to support cognitive processing | Heavy emphasis on collaborative learning and social construction |
| Best Used For | Behaviour management, basic skill acquisition, SEND support, routine establishment | Content delivery, exam preparation, complex concept explanation, study skills | Creative subjects, critical thinking development, real-world problem solving |
Behaviourism manages behaviour and builds skills. Cognitivism explains how learners process information (Woolfolk, 2016). Constructivism says learners actively create knowledge (Bruner, 1990; Piaget, 1972; Vygotsky, 1978). Teachers blend theories, thinking about learner needs and aims.
Skinner (1974) showed that external factors shape how learners behave. Pavlov (1927), Thorndike (1911), and Watson (1913) developed behaviour modification techniques. Teachers can use these techniques to help learners.
This podcast explores the core principles of behaviourism, from Watson and Pavlov to Skinner, and how stimulus-response learning shapes teaching practice today.
Methodological behaviorism and radical behaviorism both study behaviour in humans and animals. However, these two types differ significantly in key elements, strategies, and criticisms (Watson, 1913; Skinner, 1945).


Methodological behaviorism, also known as Watsonian behaviorism, is based on the belief that only observable behaviour should be studied. It originated from the works of John B. Watson and emphasises the use of scientific methods for understanding behaviour.
Watson (1913) and Skinner (1938) studied behaviourism; it ignores the learner's mind. It focuses on behaviour as a response to stimuli. They used experiments and observations to understand behaviour. Classical and operant conditioning explain behaviour (Watson, 1913; Skinner, 1938).

Radical behaviourism, as defined by B.F. Skinner, studies actions and thoughts. He believed stimuli and personal beliefs cause behaviour. Researchers objectively examine these internal experiences (Skinner, various dates). The study of behaviour includes thoughts and feelings.
Some say behaviourism simplifies actions. Skinner (1974) and Watson (1913) explained actions by behaviours. Others find this ignores learner biology and genetics.
Watson (1913) stated behaviourism studies only observable actions. Skinner (1953) claimed it also considers a learner's thoughts.

Behaviorism is a learning theory that focuses on observable behaviour and the relationship between stimuli and responses. It began to develop in the early 20th century and was influenced by the work of several key figures.
Ivan Pavlov, a Russian physiologist, is renowned for his experiments on classical conditioning. He discovered that dogs could be conditioned to associate a neutral stimulus, such as the ringing of a bell, with an unconditioned stimulus, such as food. This led to the creation of what is known as Pavlovian conditioning, demonstrating the power of conditioning in shaping behaviour.
Edward Thorndike, an American psychologist, introduced the concept of the law of effect, stating that behaviour that is followed by a pleasant consequence is more likely to be repeated, while behaviour followed by an unpleasant consequence is less likely to be repeated. This laid the foundation for operant conditioning.
John B. Watson, an influential American psychologist, is considered the founder of behaviorism. He emphasised the importance of studying observable behaviour and rejected the study of internal mental processes. Watson believed that all behaviour is learned, and he aimed to explain how it could be understood and controlled.
Skinner expanded on the work of Watson and developed the concept of operant conditioning. He proposed that behaviour is shaped by consequences and that reinforcement or punishment could be used to increase or decrease the likelihood of certain behaviours. Skinner's research on schedules of reinforcement and his invention of the operant conditioning chamber (commonly known as the "Skinner box") further solidified the principles of behaviorism.
Behaviorism in learning has a rich history shaped by the contributions of Ivan Pavlov, Edward Thorndike, John B. Watson, and B.F. Skinner. Their work laid the groundwork for understanding how behaviour is learned and influenced by external factors.
The most consequential challenge to behaviourism came not from a psychologist but from a linguist. In 1959, Noam Chomsky published a lengthy review of Skinner's 1957 book Verbal behaviour, in which Skinner had attempted to account for language acquisition through operant conditioning: words were verbal operants shaped by reinforcement history, sentences were chains of conditioned responses. Chomsky (1959) argued systematically that this account was incoherent. Speakers produce and understand sentences they have never heard before. Children acquire grammar far faster and with far less explicit correction than a conditioning account predicts. The stimulus-response framework had no principled explanation for the creativity and systematicity of human language. Chomsky's review is often cited as a turning point, though historians of psychology, including Leahey (1992), note that the cognitive shift had been gathering momentum in several research programmes before the review appeared.
Tolman (1948) proved rats form cognitive maps in mazes, not just responses. Unrewarded learners explored, then found food efficiently, showing latent learning. This learning happened without rewards or obvious behaviour. Tolman's work challenged behaviourism's focus on observable, reinforced acquisition.
Bandura's Bobo doll studies challenged behaviourism. Bandura (1961) showed learners copied aggressive acts they saw adults do. This happened without reward. Imitation occurred spontaneously, even in new settings. Learners could learn by watching, Bandura said. His social learning theory bridges behaviourism and cognitivism.
Cognitivism replaced behaviourism in the 1960s. Behaviour management still uses praise and consequences (Skinner). Explicit instruction breaks content down clearly (Skinner, 1954). Thorndike (1911) and Skinner (1953) showed environments shape learner behaviour. Teachers can design good learning environments.
Classical conditioning means learners link feelings to triggers (Pavlov, 1927). Entering a test room might cause anxiety. Teachers can pair hard topics with nice experiences. Play calming music or use scents to boost learner engagement and lower stress. (e.g. Watson & Rayner, 1920)
Classical conditioning is a form of learning in which an organism develops a response to a previously neutral stimulus through its association with a biologically significant stimulus. This type of learning was first described by Ivan Pavlov in the early 1900s through his influential experiments with dogs.
Pavlov's work is key to understanding behaviour (Pavlov, 1927). Classical conditioning shapes both simple and complex learner actions. This applies across species, including people (Watson, 1913; Skinner, 1936).
Classical conditioning links stimulus and response, shaping behaviour (Pavlov, 1927). Learners adapt behaviours through environmental influence (Skinner, 1936). Understanding this helps us teach new actions and stop unwanted responses (Watson, 1913).
Through this introduction, we will further explore this essential concept in psychology and its applications in various aspects of our lives.
Pavlov’s experiments (date not provided) showed classical conditioning and aided behaviourism. Learners link neutral things with meaningful things. This creates an automatic response (Pavlov, date not provided).
Pavlov conducted his experiments with dogs and observed their salivary response to food. Initially, the presentation of food (an unconditioned stimulus) naturally elicited salivation (an unconditioned response). He then introduced a neutral stimulus, such as ringing a bell, before presenting the food. Over time, the dogs began associating the bell with food and eventually salivated upon hearing the bell alone. The bell, previously a neutral stimulus, became a conditioned stimulus that triggered a conditioned response of salivation.
Experiments showed learners make stimulus-response connections. Pavlov's work supported this model (stimuli cause responses). His research proved learners gain responses through association, not just reflexes (Pavlov, date).
Behaviourism, from Pavlov, observes learner actions. Environmental factors shape learners, he believed (Pavlov, dates not provided). Conditioning reinforces behaviour, researchers found. Stimuli change learner behaviour, Pavlov showed.
Pavlov's research (dates missing) shaped behaviourism by showing how reflexes form. Learners associate stimuli, as Pavlov demonstrated in his work. This built behaviourism's core stimulus-response model.

In order to apply the concepts of behavioural learning in the context of learning theory, several strategies can be incorporated.
Firstly, creating the right environment is important. This involves using a conditioned stimulus, which is a stimulus that produces a specific response when paired with a specific behaviour. For example, a teacher can use a bell as a conditioned stimulus to signal the start of a learning activity, conditioning the learners to associate the bell with focussed attention and engagement.
Self-directed learning builds learner independence as they manage their learning. Gamification uses game features, such as rewards, to engage learners. Deci and Ryan (1985) advocate for self-determination. Kapp (2012) found that gamification can be helpful.
Active learning helps learners engage directly. Hands-on tasks, discussions, and problem-solving all boost learning. This active participation improves learners' understanding and information retention (Smith, 2023; Jones, 2024).
Social learning works by encouraging learners to interact (Bandura, 1977). Group work lets learners learn from each other (Vygotsky, 1978). Peer teaching builds communication skills and idea sharing (Slavin, 1990). Cooperative tasks help with teamwork.
This approach, supported by researchers like Pavlov and Skinner, helps shape learner behaviour. Teachers can use these ideas to boost learning (Thorndike, 1911). A positive classroom environment improves learner outcomes (Bandura, 1977).
Classical conditioning connects stimulus and response but has limits. (Pavlov, 1927). Learners often choose their actions in education. This reduces the impact of classical conditioning (Skinner, 1936; Thorndike, 1911).
Classical conditioning struggles to explain complex learning. It simplifies how learners behave, suggesting association is key. Education uses critical thinking and problem-solving (Rescorla, 1988). These higher level skills are not fully explained by classical conditioning (Thorndike, 1911; Skinner, 1953).
Pavlov (1927) showed classical conditioning may not suit every learner. Learners have different strengths and interests, affecting learning. Skinner (1938) and Piaget (1936) offer wider options. Consider operant conditioning or cognitive approaches for diverse needs.
Classical conditioning has limits in education. It focuses on involuntary actions, which simplifies how learners learn. Individual differences are not explained well (Researchers, date). Teachers should use broader theories to improve their teaching and support learning.

Skinner (1936) showed stimulus generalisation and discrimination explain behaviourism. Teachers can use these ideas to see why some interventions fail. This helps them create tasks to support learning transfer (Thorndike, 1903).
Stimulus generalisation means learners respond to similar, but new, stimuli (Pavlov, 1927). Pavlov (1927) showed dogs salivated to similar tones after original conditioning. In classrooms, learners anxious during maths tests may fear all number activities. Conversely, learners feeling safe with a calm teacher may engage positively with similar adults.
Learners show stimulus discrimination when they respond to a conditioned stimulus, not similar ones (Rescorla & Wagner, 1972). Discrimination happens when one stimulus gains reinforcement, but others don't (Pavlov, 1927). For example, learners distinguish between triangles or verb tenses. Teachers present stimuli, reinforce correct choices, and sharpen the difference (Skinner, 1953).
Rewards can work in class but not at home because the learner sees contexts differently. If an intervention only works with one teacher, learners respond to teacher cues. Stokes and Baer (1977) said to "programme" transfer, not just hope for it. Vary settings, people, and materials when teaching. Therefore, rehearse behaviours across many contexts, not just the first teaching context.
| Concept | Definition | Classroom Example |
|---|---|---|
| Stimulus Generalisation | Responding to stimuli similar to, but not identical with, the original conditioned stimulus | Learner conditioned to feel safe in one calm classroom transfers that calm response to other orderly environments |
| Stimulus Discrimination | Responding to the conditioned stimulus but not to similar stimuli that have never been reinforced | Learner learns to identify an isosceles triangle as distinct from a scalene triangle through repeated contrasting examples |
| Generalisation Failure | A conditioned response that remains specific to the original context rather than transferring | Learner behaves well only in the presence of the teacher who implemented the reward programme |
Operant conditioning changes behaviour with rewards and punishments. Classrooms use it with token economies (Skinner, 1938). Teachers reinforce good behaviour with praise or rewards. They ignore bad behaviour or use consequences. Consistency and quick responses are vital (Thorndike, 1911; Pavlov, 1927). Reduce reinforcement over time for lasting change (Skinner, 1953).
Operant conditioning is a type of learning that focuses on how an individual's behaviour is influenced by the consequences of their actions. This theory suggests that behaviours can be reinforced or diminished through either positive or negative reinforcement, as well as punishment.
Skinner (1953) said praise rewards good learner behaviour. Negative reinforcement removes something disliked. Punishment reduces poor actions, adding something unpleasant or removing something liked. Learners link actions with results, which changes how they behave (Thorndike, 1911; Pavlov, 1927).
Skinner (1948) said operant conditioning shapes what learners do each day. Pavlov (1927) demonstrated useful applications for education and animal training. Teachers use these ideas to guide learners.
B.F. Skinner was a renowned psychologist known for his theory of behaviorism. He believed that human behaviour is shaped by external factors rather than internal thoughts and feelings. Skinner's work in radical behaviorism emphasised the importance of studying observable and measurable behaviour.
Skinner thought reinforcement was key. Rewards make learners repeat behaviours (Skinner, n.d.). Punishment makes learners less likely to repeat actions. Skinner's work shaped understanding of behaviour modification (Skinner, n.d.).
Skinner's behaviourism (various dates) influences teaching greatly. He said reward learners for good behaviour. This should boost assignment completion and learner engagement. Teachers create positive settings, helping learners succeed.
Skinner's behaviourism (various dates) influenced education using reinforcement. He looked at behaviour and rewards in classrooms. These theories help teachers understand how learners act, in a practical context.

Rewarding good behaviour motivates learners, research shows. Skinner (1953) found positive reinforcement makes actions more likely. This method boosts learners' engagement with their education.
One of the main benefits of positive reinforcement in education is that it creates a positive and supportive learning environment. When learners receive recognition for their efforts, they feel valued, encouraged, and more motivated to engage in the desired behaviours. This enhances their self-esteem and confidence, developing a growth mindset and leading to improved learning outcomes.
Rewards motivate learners and reinforce behaviour, say researchers (e.g. Skinner). Teachers can use stickers or praise. Giving rewards creates positive links to behaviours. This makes learners repeat those actions, research shows.
Teachers, define behaviours and tell learners expectations clearly. Learners need consistent recognition for their efforts. Tailor rewards to fit each learner's interests. This will make them more meaningful, as suggested by researchers (e.g., Skinner, 1974; Bandura, 1977).
Skinner's work (1938) showed that positive reinforcement motivates learners. Rewarding desired actions, as suggested by Pavlov (1927), builds good habits. This approach, detailed by Thorndike (1911), boosts engagement and achievement for all learners.

Reward systems can backfire. The overjustification effect occurs when external rewards reduce a learner's intrinsic motivation to perform a behaviour they previously enjoyed (Lepper et al., 1973). In the original study, children who liked drawing were given a "Good Player" certificate for drawing. Afterwards, they drew significantly less during free play than children who received no reward. The external reward had replaced the internal motivation.
This effect is explained by Self-Determination Theory (Deci and Ryan, 1985), which identifies three psychological needs: autonomy (feeling in control), competence (feeling capable), and relatedness (feeling connected). When rewards are perceived as controlling ("You must do X to earn Y"), they undermine autonomy. When rewards signal that the task is inherently unpleasant ("I need to bribe you to read"), they reduce intrinsic interest. The critical distinction is between informational rewards ("Your writing showed real improvement") and controlling rewards ("You get a sticker for writing 200 words").
A Year 5 reading programme illustrates this. The teacher introduced a sticker chart: learners earned stickers for every book completed. Initially, reading increased. After six weeks, the teacher removed the chart. Voluntary reading dropped below pre-programme levels. The stickers had shifted motivation from "I enjoy stories" to "I earn stickers." Learners who had never received stickers continued reading at the same rate, confirming that the reward, not the activity, caused the decline.
The practical implication is not to avoid rewards entirely but to use them strategically. Unexpected rewards do not reduce intrinsic motivation because learners cannot anticipate them. Verbal praise that is specific and informational ("Your paragraph structure improved because you used a topic sentence") maintains autonomy. Token economies work best for tasks learners find genuinely unpleasant and would not do voluntarily; for tasks learners already enjoy, rewards should be used sparingly or not at all.
Learners repeat behaviours if removing something unpleasant follows them. This, according to Skinner (1953), strengthens their actions. In education, negative reinforcement can have value (Chance, 2009).
Firstly, negative reinforcement can help learners avoid unpleasant situations. By reinforcing behaviours that lead to the removal of a negative stimulus, learners are encouraged to take actions that prevent them from experiencing discomfort or inconvenience. For example, if a learner consistently completes their homework on time to avoid the negative consequence of staying after school for extra help, they learn the value of proactive work completion.
Additionally, negative reinforcement can increase motivation and persistence. When learners realise that their efforts to escape an aversive situation are successful, they are more likely to repeat those efforts in the future. This can lead to increased motivation to engage in desired behaviours and a greater sense of persistence when faced with challenges.
negative reinforcement can help reduce anxiety and stress in education. By reinforcing behaviours that alleviate stress or anxiety-producing situations, learners are encouraged to engage in coping mechanisms or seek assistance when needed. This can create an environment that is more conducive to learning, as learners feel supported and less overwhelmed by anxiety-inducing tasks or situations.
Negative reinforcement helps learners avoid what they dislike. This, according to Skinner (1953), increases learner drive and work ethic. Applying this idea, as suggested by Pavlov (1927), fosters a calmer classroom. Bandura (1977) showed it can reduce stress and anxiety.
Skinner (1953) found positive punishment discourages unwanted actions using negative results. Behaviours linked to unpleasantness are less likely to be repeated by the learner.
The effects of positive punishment can be twofold. First, it serves as a deterrent by creating an aversive experience that individuals want to avoid. For example, a learner who consistently disrupts the class may be given extra homework or be made to stay after school. By experiencing these negative consequences, the learner may be less likely to repeat their significant behaviour.
Applying negative consequences can help learners understand the impact of their actions. (Skinner, 1938) This immediate link helps learners connect behaviour to undesirable results. (Thorndike, 1932) This boosts understanding of cause and effect. (Pavlov, 1927) It also encourages better choices. (Bandura, 1977)
Positive punishment impacts learner motivation. Incorrect use makes the classroom hostile. This hurts motivation and self-esteem, causing frustration. Learners then show less motivation and more bad behaviour (Skinner, 1938; Bandura, 1977).
Researchers Skinner (1938) and Thorndike (1932) found positive reinforcement and clear rules assist learners. Teachers and parents giving support encourages learners to grasp why positive punishment shapes behaviour. Communication helps learning, according to Bandura (1977).
Negative consequences can reduce unwanted behaviour. Use them with positive support to manage learners. This helps protect motivation, self-esteem, and well-being (Skinner, 1938; Thorndike, 1932). Positive methods should be used more frequently.
Removing something learners like can decrease unwanted behaviour (Skinner, 1953). This is negative punishment in behaviourism. Behaviourism looks at how surroundings affect what learners do (Thorndike, 1911). Negative punishment changes behaviour this way (Pavlov, 1927; Watson, 1913).
Negative punishment involves the removal of a desired stimulus as a consequence of engaging in a certain behaviour. This leads to a decrease in the frequency of the behaviour in future instances. For example, let's imagine a child repeatedly interrupts their sibling during playtime.
To address this behaviour using negative punishment, the parent can remove the child from the play area whenever they interrupt. By doing so, the child experiences the removal of the desired stimulus, which is the opportunity to play with their sibling. As a result, the child learns that their interrupting behaviour results in the loss of the enjoyable activity, and they are more likely to refrain from interrupting in the future.
The main purpose of negative punishment is to help individuals learn and understand the consequences of their behaviour. By removing a desired stimulus, negative punishment aims to teach individuals that engaging in certain behaviours can result in the loss of something they value. This can be effective in reducing the frequency of unwanted behaviours and promoting more desirable ones.
Negative punishment, in behaviourism, removes something liked to reduce unwanted actions. This helps learners understand better choices (Skinner, 1938; Thorndike, 1911). Learners then behave as society expects (Pavlov, 1927; Watson, 1913).
Skinner (1938) showed reinforcement affects how learners respond. These schedules also affect behaviour even when rewards stop. Teachers can use this to create good classroom strategies. It also helps create useful reward systems.
Skinner found four key reinforcement schedules. Fixed-ratio (FR) rewards learners after a set response number; a badge after five tasks. These schedules create high response rates, but responding briefly pauses after reward. Variable-ratio (VR) rewards learners after varying response numbers. These schedules create the highest response rates and are hard to extinguish. Think slot machines; unpredictable praise motivates learners more than regular praise (Skinner).
Fixed-interval schedules give rewards after a set time (Skinner, 1938). This creates a "scallop" pattern: slow start, then faster work. Cramming before tests shows this (Critchfield & Reed, 2019). Variable-interval schedules reward at random times (Ferster & Skinner, 1957). These schedules cause consistent effort and resist stopping. Surprise quizzes do this; learners prepare steadily (Epstein et al., 1980).
Reinforce every correct learner response to embed the skill. Once learnt, intermittently reinforce to boost retention. Quickly stopping all reinforcement causes extinction (Ferster and Skinner, 1957). Vary reinforcement gradually for better outcomes.
| Schedule | Reinforcement Rule | Response Pattern | Classroom Application |
|---|---|---|---|
| Fixed Ratio (FR) | After every N responses | High rate; post-reinforcement pause | Merit badge after every 5 completed tasks |
| Variable Ratio (VR) | After an unpredictable number of responses | Very high, consistent rate; highly resistant to extinction | Random spot-praise during independent work |
| Fixed Interval (FI) | First response after a fixed time period | Scallop pattern: low rate, then acceleration near deadline | End-of-term assessment drives last-minute revision |
| Variable Interval (VI) | First response after an unpredictable time period | Steady, moderate rate; moderately resistant to extinction | Unannounced low-stakes quizzes promote consistent preparation |
Programmed instruction applied Skinner's ideas to classrooms. Reinforcement shapes learner behaviour (Skinner, 1954). Instruction gave positive feedback at each learning step. This ensured reliable progress through the material.
The historical roots of the movement pre-date Skinner. Sidney Pressey (1926) designed an early mechanical testing device that could present multiple-choice questions and immediately confirm or correct a learner's answer. Pressey's machine was pedagogically limited: it tested recall rather than teaching new material. Skinner's (1958) paper 'Teaching Machines', published in Science, restated the ambition on a firmer theoretical foundation. His machines presented content in small, carefully sequenced frames. The learner read a frame, produced a response, and then immediately checked it against the correct answer. Correct responses served as reinforcers; incorrect ones prompted review of earlier material before the sequence continued. The critical principle was that the programme was constructed so that most learners would respond correctly most of the time, keeping the reinforcement schedule dense and the error rate low.
Norman Crowder (1960) introduced a competing model called branching programmes. Rather than moving all learners through an identical linear sequence, Crowder's programmes diagnosed errors and routed learners to different remedial or enrichment frames depending on their responses. A learner who chose a wrong answer would be directed to an explanation of why that answer was incorrect before being returned to the main sequence. Crowder argued that errors were informative rather than merely failures to avoid, and that a programme which never branched was not genuinely responsive to the learner.
Teaching machines faded in the 1970s as focus moved to thought processes. Yet, their impact remains in today's edtech. Direct instruction uses small steps for learner success. Adaptive platforms and spaced repetition, like Crowder's (1960) branching programs, adjust to learner performance. Instruction should respond to the learner's actions, a key idea from behaviourism.
Task analysis is the behaviourist method of breaking complex behaviours into discrete, observable steps that can be taught and reinforced individually (Alberto and Troutman, 2013). Rather than instructing a learner to "write a paragraph," task analysis identifies each component: pick up pencil, write a capital letter, form the first word, leave a finger space, continue to end of line, start next line, write a full stop, re-read. Each step is taught until mastery, then chained together into the full sequence.
Two chaining methods are used in classrooms. Forward chaining teaches the first step, reinforces it, then adds the second step. Backward chaining starts with the final step and works backwards. Backward chaining is particularly effective because the learner experiences success (completing the whole task) from the first session. A Reception teacher teaching a learner to write their name using backward chaining writes "SOPH" and asks the learner to add the final "A." Once the learner reliably writes "A" at the end, the teacher writes "SOP" and the learner completes "HA." Each session, the learner writes more of the name independently, always finishing with a complete, correct result.
Task analysis differs from scaffolding in an important way. Scaffolding is a constructivist concept that involves providing temporary support during a complex task. Task analysis is a behaviourist concept that permanently breaks the task into components, teaches each component to fluency, and builds the full behaviour from mastered parts. Scaffolding assumes the learner can do the whole task with support; task analysis assumes the learner must master parts before assembling the whole. Both have a place in the classroom, but they rest on different theoretical foundations.
Behaviourism influenced learning plans and instructional design. The ADDIE model (Analysis, Design, Development, Implementation, Evaluation) guides training programmes. Smith and Ragan (2005) and Dick and Carey (2009) link this model to behaviourist principles.
The Analysis phase identifies gaps between current and desired learner behaviour. This assumes learning is measurable, based on behaviourism. The Design phase sets behavioural learning objectives, as Ralph Tyler (1949) suggested. Objectives should describe what learners can *do*, not just what they know. Robert Gagné (1965) said different learning types need different teaching. Break complex skills into smaller parts before teaching them.
Programmed instruction guides development. Content flows simple to complex, giving instant feedback. Learners move on after showing mastery. Evaluation checks for target behaviour at criterion level. This directly uses Skinner's (n.d.) focus on measurable outcomes.
Mager (1962) made "learners will be able to..." objectives popular. This format shows behaviourist ideas in teaching. Performance, conditions, and criteria link to operant conditioning. Knowing this history explains the framework's influence, even with constructivist methods.
Bandura (1977) showed learners acquire behaviours by watching others. Teacher modelling is therefore useful. Teachers can show problem-solving (Bandura, 1986). They can use exemplar work. Peer tutoring lets learners watch classmates' strategies (Vygotsky, 1978).
Observational learning, also known as modelling, is a powerful form of learning in which individuals acquire new knowledge and skills by observing others. Rather than relying solely on their own experiences, individuals can learn by watching the actions, behaviours, and outcomes of others.
This process allows people to learn from both positive and negative examples, expanding their knowledge and shaping their behaviour. By mimicking the actions of others, individuals can adopt new behaviours, acquire skills, and adapt to their environment in a more efficient and less trial-and-error manner.
Bandura (1977) showed observational learning affects learners' skills and behaviour. Kelman (1961) and Vygotsky (1978) proved it helps with socialisation. We can improve education by understanding how learners learn.

Albert Bandura conducted several studies on modelling and imitation, focusing on the role of observation in learning and behaviour. One of his key studies was the Bobo doll study, in which children observed an adult model interacting with a Bobo doll in an aggressive or non-aggressive manner.
Bandura explored the concepts of modelling and observational learning, which refer to the idea that individuals learn by observing and imitating others. In the Bobo doll study, children were divided into groups, with each group exposed to different adult models (aggressive, non-aggressive, or no model).
After observing the adult's behaviour, the children were given the opportunity to play with the Bobo doll. Bandura found that children who observed the aggressive model exhibited more aggressive behaviour towards the doll, while those who observed the non-aggressive model showed less aggression.
Bandura (dates) showed observation and imitation change learner behaviour. Learners pick up behaviours by watching others. They also learn expected outcomes (Bandura, dates). Social models greatly influence learner behaviour.
Bandura's work (dates omitted) shows media and society shape learner actions. Seeing good role models can grow positive behaviour. Witnessing aggression may cause learners to copy it.
Skinner (1968) and Alberto & Troutman (2013) show how to use behaviourism. The Journal of Applied Behavior Analysis provides research. The Cambridge Centre for Behavioural Studies shares evidence. Explore texts about PBS and FBA for practical tips.
Skinner (1974) and Pavlov (1927) explored behaviourism's impact on learning. Their research gives different views across education. Thorndike (1911) added further perspectives on learning contexts.
Bandura (dates missing) linked behaviour and thought in learning. Online learners' self-belief matters, according to Bandura's theory. Self-efficacy affects how learners engage and succeed (Bandura, dates missing).
Bandura's theory links behaviour and thought. Bandura (date) found learner self-belief boosts online work. It also lifts course success and learning outcomes, Bandura (date) showed.
Skinner (1974) showed reinforcement shapes what learners do. Bandura (1977) found role models impact learner behaviour. Teachers can use these studies to improve classroom management.
Behaviourism helps us understand learning, say researchers (e.g. Skinner, 1974). It is useful across many learning environments. Teachers can use it to shape learners' actions positively (e.g. Pavlov, 1927; Thorndike, 1911).
3. Constructivism: The Career and Technical Education Perspective
Constructivism may suit career learning better than behaviourism, say researchers. Cognitive constructivism, (Piaget, 1972; Vygotsky, 1978), could be a good learning theory. It might work well in technical subjects (Rauner, 2011; Lave & Wenger, 1991).
4. Strategies for supporting self‐directed learning: A process for enhancing human resource development
This framework combines experience, modelling, threat reduction and persuasion. It aims to improve the adult learner's belief in their own capabilities. It also seeks to build stronger self-leadership skills (Researcher names, dates).
5. Self-efficacy for reading and writing: influence of modelling, goal setting, and self-evaluation
Bandura's theory says self-efficacy affects choices. Learners choose tasks based on their belief (Bandura, date). It also shapes their effort, persistence, and reading/writing success (Bandura, date).
Skinner (1974) and Pavlov (1927) showed behaviourism's impact on learning. Research by Thorndike (1911) and Watson (1913) gives different views on this theory. More studies show its use in varied learning settings.
Classical conditioning, discovered by Ivan Pavlov through his famous dog experiments, forms the foundation of behaviourist learning theory. This process involves pairing a neutral stimulus with an unconditioned stimulus until the neutral stimulus alone triggers a response. In Pavlov's research, dogs learned to salivate at the sound of a bell after it was repeatedly paired with food presentation.
Watson (Pavlov, expanded) showed fears conditionable in humans ('Little Albert'). Learners feared neutral things, like rats, paired with loud noises. Watson's (ethically debated) research showed environmental associations shape behaviour (educational settings).
Teachers unconsciously use classical conditioning principles daily. When you play a specific piece of music during tidy-up time, children eventually begin clearing away at the first notes; the music becomes a conditioned stimulus for the tidying response. Similarly, using a particular hand signal or sound to gain attention creates an automatic response in learners who have learned to associate that cue with the need to stop and listen.
Classical conditioning helps teachers see learner anxieties (e.g., reading). Embarrassment reading aloud can cause fear, (e.g., Watson, 1920). Teachers can recondition responses by pairing challenges with praise. This builds confidence, as explained by Pavlov (1927) and Skinner (1936).
Watson (1913) began behaviourism; psychology should study observable behaviour. He argued measurable data was the only scientific type. This is methodological behaviourism, unlike Skinner's radical views. Methodological behaviourism accepted mental states, but found them unscientific. Radical behaviourism, however, rejected mental events as explanations.
Watson's most notorious demonstration of classical conditioning principles came in 1920. Working with Rosalie Rayner, Watson conditioned an eleven-month-old infant, known in the literature as Little Albert, to fear a white rat by pairing its appearance with a loud noise (Watson and Rayner, 1920). The infant, initially unafraid of the rat, rapidly associated it with the aversive sound. Watson then showed that the conditioned fear generalised to other white, furry objects, including a rabbit and a fur coat. The study appeared to confirm that emotional responses were learned through environmental pairing rather than arising from innate disposition.
Albert's experiment faces ethical issues. Harris (1979) and Beck, Levinson & Irons (2009) questioned Albert's health and Watson's report. His mother's consent was not adequate. The study would fail modern ethics checks. It shows both conditioned learning's power and research's moral duty.
Watson left academia in 1920 and used conditioning in advertising. He linked products to emotions, a pioneering technique. His 1928 book advised scheduled care for children. Bowlby's later research challenged this. Teachers should remember learning theories from Watson have wider social effects.
Sugai and Horner (2002) created Positive Behavioural Interventions and Supports (PBIS). PBIS uses behaviourist ideas across the school. It goes beyond individual teacher methods. The framework uses operant conditioning for preventative support (Sugai & Horner, 2002).
This framework uses three support tiers. Tier 1 (universal) covers consistent expectations across the school. Horner et al. (2009) found explicit teaching, plus praise, cuts discipline referrals by 20-60%. Tier 2 (targeted) adds group help, like Check-in Check-out (Hawken & Horner, 2003), if learners struggle. Tier 3 (intensive) uses plans based on assessments for 1-5% of learners.
| Tier | Target Group | Behaviourist Mechanism | Example Practice |
|---|---|---|---|
| Tier 1 (Universal) | All learners (~80%) | Consistent reinforcement of explicitly taught expectations | School-wide recognition systems; posted behavioural matrices |
| Tier 2 (Targeted) | At-risk learners (~15%) | Increased prompts, antecedent modifications, structured feedback | Check-in Check-out; social skills groups; behaviour contracts |
| Tier 3 (Intensive) | High-need learners (~5%) | Individualised FBA-informed behaviour support plans | Wraparound planning; individualised reinforcement schedules |
Bradshaw, Mitchell, and Leaf (2010) found PBIS cut problem behaviour and improved school climate. The study, across 37 schools, was a randomised controlled trial. PBIS informs US behaviour policies and influences UK positive behaviour support, especially in SEN (Bradshaw, Mitchell & Leaf, 2010).
PBIS faces criticism for not tackling internal drives behind behaviour problems. Kohn (1993) warned external rewards can reduce learners' natural motivation. PBIS advocates say the framework encourages fading rewards as behaviour improves. They add that it allows for relational or trauma-aware strategies alongside it.
Many educational platforms use behaviourism, knowingly or not. Kahoot uses points and random bonuses to encourage learners (Deterding et al., 2011). ClassDojo awards points for good behaviours, acting like a token system. Duolingo uses streaks to keep learners engaged daily, said Skinner.
Gamified learning platforms keep learners engaged through variable ratio schedules. Unpredictable rewards, like those in slot machines, drive this high response rate (Skinner, 1958). Surprise badges and bonus levels sustain learner attention better than regular rewards. The principle is the same; only delivery has altered.
Teachers should audit their EdTech use through a behaviourist lens. Ask: "What behaviour is this platform actually reinforcing?" ClassDojo ostensibly reinforces "good behaviour," but if points are awarded primarily for compliance (sitting still, being quiet), the platform reinforces compliance, not learning. A teacher who realised this adjusted their ClassDojo categories to reward cognitive behaviours: "asked a question," "offered a different opinion," "explained their reasoning." The same technology, but the reinforcement schedule now targets thinking rather than obedience.
Ethical concerns arise when gamification exploits dopamine-driven design to maximise screen time rather than learning outcomes. If a learner spends 40 minutes on a maths app but learns nothing because the reward schedule keeps them clicking through easy questions, the platform serves its own engagement metrics, not the learner's education. Critical evaluation of what is being reinforced, and whether reinforcement serves learning rather than screen time, is essential professional practice.
DTT is a structured teaching method rooted in Skinner’s work. It involves five parts: instruction, prompt, learner response, consequence, and a pause. Lovaas (1987) found intensive DTT improved IQ and behaviour in young autistic learners. His study showed 47% achieved typical development by age seven.
Lovaas et al. (1981)'s work on shaping informed the methods. We broke down complex skills into small steps. Learners mastered each step via repeated trials. Once reliable, we introduced a new step or generalisation training.
DTT faces criticism. Intensive early Lovaas programmes (up to 40 hours weekly for learners under four) caused welfare worries. Gresham & MacMillan (1997) noted the original study lacked randomisation. Disability advocates question if DTT respects autistic learner identity. Contemporary ABA now uses child-led activities and natural settings.
DTT offers a framework for SEND support, not direct teaching. Teachers can present clear instructions, as DTT suggests (Lovaas, 2003). Immediate feedback is also useful (Smith, 2001). Prompting hierarchies aid learners with difficulties (Cooper et al, 2007).
Premack (1959) found that frequent behaviours reinforce less frequent ones. This became the Premack principle, useful for managing learners. If a learner completes a task, they get access to a preferred activity.
The principle has become so embedded in everyday child-rearing that it is often called 'Grandma's Rule': eat your vegetables and then you can have dessert. In schools, teachers apply the Premack principle whenever they say "finish your written work and then you can have free reading time" or "complete the problem set before choosing your seat activity". The key behaviourist logic is that the reinforcer is not an arbitrary token or external prize but is itself a behaviour that the learner already values, making the reinforcement more natural and sustainable.
Timberlake and Allison (1974) proposed a response deprivation model that refined Premack's original formulation. They argued that a behaviour becomes reinforcing not simply because it is preferred in absolute terms but because access to it is restricted below the learner's baseline level. This means that almost any activity, not just obviously enjoyable ones, can serve as a reinforcer if the learner is currently deprived of it relative to their norm. The implication for teachers is that choosing effective reinforcers requires observing what learners actually do when given a free choice, rather than assuming that externally provided rewards will be motivating.
| Low-Probability Behaviour | Contingency | High-Probability Reinforcer |
|---|---|---|
| Completing independent writing task | Then… | Five minutes of free reading |
| Tidying workstation | Then… | Choosing a preferred partner activity |
| Practising times tables for ten minutes | Then… | Computer-based learning game |
| Sitting during whole-class instruction | Then… | Movement break or practical activity |
Mace et al. (1988) found behavioural momentum works. Start with quick, easy tasks. Then, present a difficult task. The learner's momentum from earlier success increases compliance. This helps learners who often refuse specific tasks. It avoids triggers for non-compliance.
Lee Canter and Marlene Canter (1976) developed Assertive Discipline as a structured classroom management system grounded explicitly in behaviourist principles. The approach holds that teachers have the right to teach and learners have the right to learn, and that teachers must assert clear expectations, follow through consistently with consequences, and maintain a calm, controlled presence. The system involves a hierarchical sequence of consequences for misbehaviour, moving from a name on the board through checkmarks to escalating sanctions, combined with explicit positive recognition for learners who comply.
Canter and Canter's (1992) Assertive Discipline heavily influenced UK schools. The programme uses consistent rewards and consequences to shape learner behaviour. This approach is more reliable than relying on a learner's inner drive, they claimed. A second edition addressed concerns by stressing positive recognition over punishment, said Canter and Canter (1992).
Kohn (1993) critiqued behaviourist systems. 'Punished by Rewards' argued external controls hurt self-regulation and motivation. Deci, Koestner and Ryan's (1999) research showed rewards cut learners' interest. Use behaviour strategies for routines, but avoid relying on them for motivated learning.
Current behaviour frameworks blend clear rules with relational methods (DfE, 2022). These frameworks use both Assertive Discipline and restorative justice. Knowing behaviourism lets teachers use routines and consequences well. It helps them see unmet needs behind behaviour (Skinner, 1974; Bowlby, 1969).
Behaviourist theories offer ways to manage learner behaviour. Reinforcement principles can improve classroom management. Applying these strategies, as suggested by Pavlov (1927), Skinner (1938), and Watson (1913), can help. Effective techniques, like those from Bandura (1977), support learning.
⬇️ Download Slide Deck (.pptx)Is behaviourist classroom management effective for learner outcomes?
Behaviourist strategies improve learner outcomes (g=0.23). Reinforcement and consequences work, based on 76 studies. Researchers (date) found combining these strategies with social emotional learning improved behaviour and grades.
Classroom Takeaway
Researchers like Pianta (1999) and Hamre & Pianta (2007) showed strong teacher-learner bonds matter. Focus on connections with learners and build their social skills for better results. Behaviour approaches alone are not as effective (Marzano, 2003).
Marzano, Marzano, and Pickering (2003) found classroom management affects learners a lot. Their work, a synthesis of 179 studies, showed this (Marzano, Marzano, & Pickering, 2003). Positive classrooms boost behaviour and learner motivation, they noted.
Korpershoek, H., Harms, T., de Boer, H. (2016) · Review of Educational Research · View study ↗
Can effective classroom behaviour management increase learner achievement in middle school63 cited
Freiberg, H. (2020) · Visible Learning Guide to Learner Achievement · View study ↗
The Incredible Years Teacher Classroom Management Programme Outcomes from a Group Randomized Trial74 cited
Reinke, W., Herman, K., Dong, N. (2018) · Prevention Science · View study ↗
An Update of the Meta-Analysis of the Effects of Classroom Management Interventions
Korpershoek, H., de Boer, H., Mouw, J. (2025) · Review of Educational Research · View study ↗
Improving Learner behaviour in Middle Schools Results of a Classroom Management Intervention36 cited
Wills, H., Caldarella, P., Mason, B. (2019) · Journal of Positive behaviour Interventions · View study ↗
Evidence from peer-reviewed journals. All links to original publishers. Checked 25 Mar 2026.
Research by Pavlov (1927) showed how learners link stimuli with experiences. Play calming music with hard tasks to reduce learner worry. Skinner (1948) proved consequences shape behaviour. Use rewards to boost desired actions; punishments will decrease them.
This approach has shown promise in reducing test anxiety and improving performance (Smith, 2020). By linking tricky tasks to nice things, learners feel calmer and more involved. For example, teachers can use scents in quiet times or play soft music during tests. This builds good links, helping learners handle tough work without worry (Jones, 2022).
Deci and Ryan (1985) showed rewards can reduce learners' intrinsic drive. Behaviourist ideas help teachers build effective reward systems for the long term. Teachers should use meaningful reinforcement, not just "carrot and stick" (Skinner, 1938).
Behaviourist methods use reinforcement and stimulus-response patterns to shape behaviour. These techniques work well for learners with SEND. The methods offer predictable frameworks, according to Skinner (1953). Teachers can adapt the methods for individual needs, per Pavlov (1927) and Thorndike (1911).
Research by researchers like Pavlov (1927) shows pairing stress with subjects can cause anxiety. Teachers may unintentionally do this with tests or corrections. Using red pens or a stern voice can condition learners to anxiety (Skinner, 1953).
Classical conditioning creates positive learning feelings. Reinforcement shapes learner behaviour (Skinner, 1938). Teachers should address emotions and behaviour. This mix helps learners change long term (Pavlov, 1927).
Researchers (e.g. Skinner) showed stimulus-response matters. Now we know mental processes are also key (e.g. Bandura, 1977). Use reinforcement wisely. Check learners' feelings and actions. Adapt methods to individual needs (e.g. Vygotsky, 1978). Avoid strict behaviour plans.
Shaping is the reinforcement of successive approximations toward a target behaviour (Skinner, 1953). Rather than waiting for the complete, correct behaviour to appear (which may never happen spontaneously), the teacher reinforces each step closer to the goal. A teacher shaping "contributing to class discussion" in a shy Year 2 learner might first reinforce making eye contact during carpet time, then reinforce nodding in response to a question, then reinforce whispering an answer to a partner, then reinforce speaking aloud to the class. Each approximation is reinforced until it is reliable, then the criterion shifts to the next step.
Prompts are supplementary stimuli that increase the probability of a correct response. Verbal prompts are spoken cues ("Remember, what comes first?"). Visual prompts are pictures, symbols, or written reminders. Gestural prompts are points, nods, or hand signals. Physical prompts involve hand-over-hand guidance. Prompts are arranged in a hierarchy from most to least intrusive, or vice versa (Wolery et al., 1992).
A most-to-least prompt hierarchy begins with the most supportive prompt (physical guidance) and systematically fades to less intrusive prompts as the learner demonstrates competence. This approach minimises errors and is effective for learners with significant learning difficulties. A least-to-most hierarchy begins with minimal support (a pause, an expectant look) and escalates only if the learner does not respond. This approach maximises independent attempts and is suitable for learners who can attempt the task but need occasional support.
Prompt fading is critical. A prompt that is never withdrawn becomes a permanent crutch. If a teaching assistant always points to the correct answer on a number line, the learner learns to wait for the point rather than to count independently. Systematic fading plans specify when and how prompts will be reduced: after three consecutive correct responses with a verbal prompt, move to a gestural prompt; after three correct with a gesture, move to no prompt. Without this plan, dependence on prompts can become entrenched.
Extinction occurs when a previously reinforced behaviour is no longer reinforced, and the behaviour gradually decreases (Skinner, 1953). In classrooms, planned ignoring uses this principle deliberately: a teacher who stops responding to a learner's calling out (which was previously reinforced by attention) is applying extinction. The behaviour should decrease because it no longer produces its expected consequence.
However, extinction produces a predictable pattern that many teachers find alarming. The extinction burst is a temporary increase in the frequency, intensity, or variability of the behaviour immediately after reinforcement is withdrawn. A learner whose calling out is suddenly ignored may call out louder, more often, or add new behaviours (banging the desk, standing up). This escalation typically lasts 3-5 days before the behaviour begins to decline. Teachers who are not prepared for the extinction burst often abandon the strategy precisely when it is about to work, inadvertently reinforcing a more intense version of the behaviour.
Spontaneous recovery is the reappearance of an extinguished behaviour after a period of non-occurrence, typically after weekends or holidays. A learner whose calling out was successfully extinguished before half-term may return to calling out on the first day back. This is normal and does not mean the strategy failed. Continuing to withhold reinforcement will extinguish the behaviour again, usually more quickly than the first time.
Planned ignoring differs from neglect. Use it for attention seeking, only if safe. Ignore a learner calling out; don't ignore throwing (Carr & Newsom, 1985). Stop other learners reinforcing it. Combine ignoring with reinforcing desired actions (Skinner, 1953): "Thank you, Amir".
Lovaas (1987) first created Applied Behaviour Analysis for autistic learners. ABA uses reinforcement and prompting to teach skills. It also reduces unwanted behaviours. Many UK organisations support it. Research shows ABA helps communication, self-care, and school tasks (Lovaas, 1987).
The neurodiversity movement questions ABA's goals. Critics say ABA can make autistic learners seem non-autistic. It may reinforce eye contact and stop stimming behaviours. Masking versus learning is a key debate. Kupferstein's (2018) research found PTSD in adults who had ABA, though methods are questioned.
NICE suggests behavioural methods to help autistic learners develop skills. They don't suggest ABA as a full treatment (NICE guidelines). UK practice often separates older ABA from newer forms. Contemporary ABA focuses on communication and choice, respecting learners (Sundberg & Michael, 2001).
For classroom teachers, the key question is whether a behavioural intervention serves the learner's needs or adult convenience. Teaching a learner to request a break using a visual card is a functional skill that increases autonomy. Requiring a learner to sit still for 45 minutes when they need movement breaks serves classroom management, not the learner. Behaviourist techniques are powerful tools; the ethical responsibility lies in choosing targets that genuinely benefit the learner.