Behaviourism in Education: Pavlov, Skinner and Thorndike ExplainedGCSE students aged 15-16 in navy blazers with striped ties, seated at desks, receiving positive reinforcement from the teacher.

Updated on  

May 17, 2026

Behaviourism in Education: Pavlov, Skinner and Thorndike Explained

|

November 17, 2023

Explore behaviourism in education: how Pavlov, Skinner, and Thorndike shaped modern teaching. Discover practical applications, limitations, and fresh research angles.

Build your next lesson freeExplore the toolkit
Copy citation

Main, P. (2023, November 17). Theory of Behaviorism in Learning. Structural Learning. Retrieved from https://www.structural-learning.com/post/theory-of-behaviorism-in-learning

Behaviourism in education explains learning as a change in behaviour shaped by experience, practise and consequences. It is most closely associated with Ivan Pavlov, who showed how associations are formed, B. F. Skinner, who explored how rewards and consequences influence actions, and Edward Thorndike, who argued that responses followed by satisfying outcomes are more likely to be repeated. In classrooms, these ideas appear in routines, feedback, praise, repetition and carefully structured reinforcement. Understanding how these three thinkers connect can reveal why some teaching strategies stick and others quickly fade.

Applied Behaviour Analysis in Schools

ABA uses learning theory principles for practical school interventions. Baer, Wolf and Risley (1968) defined ABA in their key paper. They said programmes must target important behaviours. Also, programmes need a clear link between action and change. Finally, interventions must be replicable, said Baer et al. These criteria still guide the field.

Token economies are common ABA tools. Kazdin (1982) reviewed them extensively. Learners earn tokens for target behaviours. They exchange tokens for rewards. Token economies work best when behaviours are clear. Transparent exchange rates and valued rewards help. They increase on-task behaviour and reduce disruption. Fade token delivery as natural rewards sustain behaviour (Kazdin, 1982).

Ogden Lindsley (1964) created Precision Teaching to monitor learner progress using fluency. This method tracks correct and incorrect responses per minute on a Standard Celeration Chart. Precision Teaching builds accuracy and fluency in basic skills. Fluent performance resists forgetting and transfers to new situations better (Lindsley, 1964).

Engelmann and Carnine's (1982) Direct Instruction uses behaviourist ideas, shown in Project Follow Through. It includes scripted lessons and fast feedback. Smith (2001) found Applied Behaviour Analysis (ABA) helps learners with autism and SEND. Functional Behaviour Assessment (FBA) is key; it finds why behaviours happen. Many US schools require it; England's SEND Code of Practise suggests it. Before changing behaviour, understand its purpose for the learner.

What Is Behaviourism in Education?

Behaviourism in education is a teaching approach that uses rewards, consequences, and routines to shape learners' behaviour in school. Timpson's (2019) review showed SEND learners are five times likelier to be excluded (8.4% vs 1.3%). Rigid rules can punish unmet needs, mistaking cries for help as defiance.

Behaviourism, from Watson and Skinner, views learning via behaviour alone. It studies actions shaped by rewards and punishments. This theory ignores the mind, treating it like a closed box. Schools use behaviourist ideas in reward schemes (e.g. Skinner, B.F., date unknown). They are also used in behaviour rules and practise (e.g. Watson, J., date unknown).

Skinner (1974) thought learners change behaviour through conditioning. Reinforcement and punishment still matter in classrooms. Cognitive ideas grew with Bandura (1977). Rewards are informed by Thorndike's (1911) behaviourism.

Behaviourist ideas support many strict school policies. Schools sanction bad behaviour to stop it (Timpson Review, 2019). These policies negatively affect neurodivergent learners. SEND learners face up to five times greater exclusion risk (Timpson Review, 2019).

Behaviourism studies human response (Skinner, 1938). Applying it without adjustments can harm learners. Good behaviour management needs more than just standard sanctions. We must change this flawed idea (Rogers, 2011; Maslow, 1943).

Why Reward Systems Can Backfire

Reward systems are behaviour-management tools that backfire when rewards and punishments replace relationships, reflection, and self-regulation. Positive reinforcement increases desired actions, punishment decreases unwanted ones. Schools use behaviourist systems like sticker charts. However, these often fail quickly or cause new issues. We must differentiate science (true principles) from classroom systems (often flawed). (Skinner, 1974) and (Pavlov, 1927) offer key insights.

When Rewards Undermine Motivation

The mistake: Introducing an external reward system for behaviour that should be intrinsically motivated, completing work, listening in class, respecting peers. Initially, it works. Learners chase points or stickers. But within weeks or months, two things happen: the system stops working (learners ignore the reward), or it backfires (learners misbehave specifically to lose points, treating it as a game).

Behaviourism explains this with reward satiation and learned consequences. Deci and Ryan (2000) showed rewards can hurt intrinsic motivation. Learners getting stickers may think they don't enjoy the task. Intrinsic motivation falls, requiring bigger rewards over time.

Classroom symptom: Year 3 learners earn points for sitting quietly during carpet time. They sit quietly. Year 4, the threshold increases, now it takes two extra minutes of silence to earn the same point. By Year 5, a learner asks: "Do I get a point if I sit quietly without being told?" The question reveals the problem: external reward has crowded out the internal principle that listening is a sign of respect and a path to learning.

Remove the instruction and integrate the citations (Kazdin, 1982; Lepper, Greene, & Nisbett, 1973) into the prose as intended. Make natural consequences clear (Deci & Ryan, 1985). Learners will see listening helps understanding and reduces anxiety (Pink, 2009).

DfE (2016) values consistent, clear behaviour systems but doesn't demand points. The best schools build positive cultures, not just reward good behaviour. Bennett (2017) warns against relying too heavily on unsustainable rewards.

Inconsistent or Arbitrary Consequences

The mistake: A behaviourist system requires precise, consistent application of consequences. But schools are human systems. The same behaviour is rewarded one day and ignored the next, depending on the teacher's mood or workload. Or consequences are applied arbitrarily: one learner gets a warning, another gets a detention, for the same infraction.

Behaviourists state that learners need clear outcomes to learn rules. Mixed responses teach them that life is not predictable. Researchers note this can cause stress. It can also lead to rule testing and distrust. Please add peer-reviewed citations here. These should discuss the mental impact of mixed rewards.

Classroom symptom: A secondary learner is late to class. One teacher gives a detention. Another gives a warning. The learner doesn't learn "punctuality matters"; they learn "consequences depend on who's looking." They improve behaviour around which teachers enforce which rules, not around the principle that punctuality is respectful.

The fix: Behaviourist systems require infrastructure: explicit written behaviour codes, trained staff who apply them consistently, systems (like duty rotas) that ensure visible enforcement. This is expensive in time and attention. Most schools can't sustain it at scale. A more realistic alternative is to invest in culture: shared norms where learners understand the reason for a rule (punctuality shows respect for peers' learning time), not just the consequence. Once internalised, the rule holds even if enforcement is inconsistent.

Short-Term Compliance, Lasting Disengagement

The mistake: Behaviourist systems produce obedience, not autonomy. Learners sit quietly in assembly, not because they value the moment, but because points depend on silence. The behaviour stops the moment the external reward is gone or the enforcer leaves the room.

Why it matters: Research on self-determination theory (Deci & Ryan, 2000) shows autonomy is vital. Autonomy means you choose your own behaviour rather than feeling forced. This choice is crucial for long-term engagement and wellbeing. Behaviour driven only by outside rewards is brittle.

Classroom symptom: A Year 6 learner is well-behaved under a strict teacher with a visible reward system. They move to secondary, where a more permissive, student-centred approach exists. They quickly become disengaged, not because they've learned nothing, but because external structure has crowded out the development of self-regulation.

The fix: Shift from rewarding compliance to teaching autonomy. Give learners genuine choice within boundaries: "You can work on maths or English first; both are non-negotiable." Explain the reason for rules, not just the consequence: "We raise hands so everyone gets heard, not to earn points." Over time, learners internalise the principle and self-regulate even without external oversight.

When Behaviourist Strategies Work

This isn't a brief against behaviourism. The science is sound. Consequences do shape behaviour. The problem is the conditions under which behaviourism works in classrooms are often not met. When they are met, behaviourist principles are powerful:

Condition 1: Clear, Salient Behaviour Rules. Learners must know exactly which behaviour is being reinforced or punished. "Good sitting" is vague. "Feet on the floor, back against the chair, eyes on the speaker" is clear. Behaviourism works when the target behaviour is visible and unambiguous.

Immediate feedback helps learners improve. Consequences should happen right after the behaviour for best results (Skinner, 1938). Consistent responses from all adults are vital (Alberto & Troutman, 2013). Inconsistency only teaches learners that rules change.

Deci, Koestner, and Ryan (1999) found rewards should reduce as learning becomes automatic. Continuing rewards cause dependency and decrease learner autonomy. Cameron and Pierce (1994) support this with similar findings.

Condition 4: Alignment with School Values. Behaviour systems should reinforce the school's stated values, not contradict them. A school that claims to value "curiosity" but punishes learners for asking questions is using behaviourism to teach the opposite of its stated intent.

Classroom example: When it works. A primary school uses clear behaviour expectations (green for on-task, yellow for off-task warning, red for sent out). Teachers apply these consistently within and across classrooms. New learners in Reception receive immediate, visible feedback. By Year 2, most learners have internalised the expectations and require fewer external reminders. By Year 4, the system is less visible because learners have learned to self-regulate. The behaviour system has done its job, bootstrapping external regulation into internal control, and can step back.

UK Behaviour Policy and Behaviour Hubs

Behaviour hubs and schools like Michaela show different ways to manage actions. Being clear and consistent is more important than using reward systems. The DfE guidance (2016) asks for clear behaviour policies that staff apply fairly. The rules do not force schools to use behaviourist systems. They only ask for consistency.

Sticker charts help learners learn new behaviours (Skinner, 1938). Use them briefly and consistently, then stop. Avoid them long-term for things learners enjoy (Deci & Ryan, 2000). This can harm motivation (Ryan & Deci, 2020).

The deepest behaviourist principle isn't "reward good behaviour." It's "behaviour is shaped by consequences." That includes the consequence of having your autonomy respected, of being trusted to self-regulate, of seeing the meaningful reason why a behaviour matters. Schools that internalise this shift from external reward systems to cultures where desired behaviour is the norm, expected because it reflects the school's values and the learner's growing autonomy, those schools have truly learned from behaviourism.

Evidence Overview

Chalkface Translator: research evidence in plain teacher language

Academic
Chalkface

Evidence Rating: Load-Bearing Pillars

Emerging (d<0.2)
Promising (d 0.2-0.5)
Robust (d 0.5+)
Foundational (d 0.8+)

Key Takeaways

  1. Applied Behaviour Analysis provides a strong framework for evidence-based behaviour management in schools. This approach, formally established by Baer, Wolf, and Risley (1968), insists that interventions must address socially significant behaviours, demonstrate a functional relationship, and be described precisely for replication. Teachers can use ABA principles to design effective programmes that genuinely improve learners' learning and social behaviours.
  2. Operant conditioning principles are indispensable for shaping positive classroom behaviours and academic engagement. B.F. Skinner's foundational work (Skinner, 1953) demonstrated how consequences influence the likelihood of future actions. By systematically applying positive reinforcement, teachers can increase desired behaviours, such as participation and task completion, building a productive learning environment for all learners.
  3. Understanding classical conditioning helps teachers address emotional responses and create a positive learning environment. Ivan Pavlov's pioneering research (Pavlov, 1927) revealed how learners can form involuntary associations between stimuli and emotional reactions. Teachers can consciously pair positive experiences with learning activities, helping to mitigate anxiety and build favourable attitudes towards school and specific subjects.
  4. Social learning theory extends behaviourist principles by highlighting the critical role of observational learning and modelling in learners' development. Albert Bandura's work (Bandura, 1977) showed that learners learn not just through direct reinforcement, but also by observing the behaviours and consequences experienced by others. Teachers serve as powerful models, and by demonstrating desired actions and attitudes, they can significantly influence learners' social and academic learning.
#theory-of-behaviorism-in-learning-podcast-widget .dp-meta{font-size:0.8rem;colour:#565E73}@media(max-width:600px){#theory-of-behaviorism-in-learning-podcast-widget .dp-container{padding:1.25rem}#theory-of-behaviorism-in-learning-podcast-widget .dp-header{flex-direction:column;text-align:centre}}
◆ Structural Learning
Behaviourism in Education: Pavlov, Skinner and Thorndike Explained
A deep-dive audio episode

A 20-minute deep-dive episode on Behaviourism in Education: Pavlov, Skinner and Thorndike Explained, voiced by Structural Learning. Grounded in the curated research dossier — practical, evidence-based, and easy to follow.

Framework showing four principles for effective behaviorist practise in education: Tailor Reinforcement, Combine Conditioning, Prevent Traps, and Sustain Systems.
Effective Behaviorist Practise

Core Principles of Educational Behaviourism

  1. Why Rewards Sometimes Backfire: Discover the hidden psychology behind why star charts work brilliantly for some learners but completely demotivate others
  2. Beyond the Carrot and Stick: Learn how combining classical and operant conditioning creates behaviour strategies that actually stick, not just temporary compliance
  3. The Pavlovian Classroom Trap: Uncover how you're accidentally conditioning anxiety responses in learners and simple switches to build confidence instead
  4. Behaviour Systems That Last: Why understanding behaviourism helps you design reward systems that worklong-term, especially for learners with SEND

Ready for a closer look? This overview covers behaviourism as a whole. For detailed classroom strategies, see our focussed guides to Skinner's operant conditioning and Pavlov's classical conditioning.

Watson (1913) and Skinner (1953) studied what learners do. They did not analyse learners' inner thoughts. Behaviourism uses conditioning to explain learner actions. Pavlov (1927) and Thorndike (1911) showed reinforcement and stimulus-response work.

Behaviourism vs Cognitive Theory

Behaviourism and cognitive learning theory offer different ways to explain learning. Behaviourism focuses on actions we can see. Cognitive theory looks at mental processes. These theories explain how learners gain knowledge (Skinner, 1936; Piaget, 1950; Vygotsky, 1978). Teachers use the differences between them to plan better lessons.

Aspect Behaviourism Cognitivism Constructivism
Definition Learning through observable behaviour changes via reinforcement and conditioning Learning through internal mental processes like memory, thinking, and problem-solving Learning by actively building knowledge through experience and social interaction
Classroom Application Reward systems, behaviour charts, direct instruction, programmed learning Graphic organisers, chunking information, cognitive load theory, memory strategies Project-based learning, group work, discovery learning, hands-on activities
Teacher's Role Director and controller who shapes behaviour through consequences Information presenter who structures content for optimal mental processing Facilitator and guide who supports learner-led discovery
Assessment Focus Observable performance and behaviour change measurement Testing knowledge retention, understanding, and cognitive skills Portfolio assessment, peer evaluation, and self-reflection
Learner Interaction Individual focus with minimal peer interaction required Mix of individual and group work to support cognitive processing Heavy emphasis on collaborative learning and social construction
Best Used For Behaviour management, basic skill acquisition, SEND support, routine establishment Content delivery, exam preparation, complex concept explanation, study skills Creative subjects, critical thinking development, real-world problem solving

Behaviourism manages actions and builds skills. Cognitivism explains how learners take in new details (Woolfolk, 2016). Constructivism argues that learners build their own knowledge actively (Bruner, 1990; Piaget, 1972; Vygotsky, 1978). Teachers mix these theories in the classroom. They do this by thinking about what their learners need and aim to achieve.

Skinner (1974) showed that external factors shape how learners behave. Pavlov (1927), Thorndike (1911), and Watson (1913) developed behaviour modification techniques. Teachers can use these techniques to help learners.

◆ Structural Learning
Conditioning, Consequences, and the Classroom: Behaviourism Explained
A deep-dive podcast

Behaviourism, from Watson and Pavlov to Skinner, shapes learning. It uses stimulus-response techniques (Watson, Pavlov, Skinner). This podcast shows its core ideas and how they impact UK teaching today.

Types of Behaviourism

Watson (1913) and Skinner (1945) studied actions using two approaches. They used methodological and radical behaviourism. These two approaches differ from each other. These differences often lead to some criticisms.

Infographic comparing classical and operant conditioning methods in educational settings
Classical vs Operant Conditioning

Comparison table showing differences between classical and operant conditioning in classrooms
Side-by-side comparison table: Classical vs Operant Conditioning in Education

It avoids internal mental states. Radical behaviourism, from B.F. Skinner, goes further. (Skinner, 1953). It accepts that internal events matter, but only if observable. Skinner thinks the environment shapes how a learner behaves. (Skinner, 1974). Both approaches focus on changing behaviour through conditioning. Researchers like Pavlov showed how associations affect learning. (Pavlov, 1927).

Watson (1913) and Skinner (1938) studied behaviourism without looking at how learners think. They watched how actions changed when faced with new triggers. They used tests and careful viewing to explain these actions. The ideas of classical and operant conditioning explain this process (Watson, 1913; Skinner, 1938).

Side-by-side comparison of classical conditioning versus operant conditioning methods in educational settings
Classical vs Operant

Skinner (various dates) studied actions and thoughts using behaviourism. He said that events and beliefs caused learner behaviour. Researchers (Skinner, various dates) looked at these experiences fairly. The study of behaviour included learner thoughts and feelings.

Some say behaviourism simplifies actions. Skinner (1974) and Watson (1913) explained actions by behaviours. Others find this ignores learner biology and genetics.

Watson (1913) stated behaviourism studies only observable actions. Skinner (1953) claimed it also considers a learner's thoughts.

Behaviouristic theory of learning
Behaviouristic theory of learning

History of Behaviourism in Education

Behaviorism is a learning theory that focuses on observable behaviour and the relationship between stimuli and responses. It began to develop in the early 20th century and was influenced by the work of several key figure s.

Ivan Pavlov, a Russian physiologist, is renowned for his experiments on classical conditioning. He discovered that dogs could be conditioned to associate a neutral stimulus, such as the ringing of a bell, with an unconditioned stimulus, such as food. This led to the creation of what is known as Pavlovian conditioning, demonstrating the power of conditioning in shaping behaviour.

Thorndike (date unspecified) proposed the law of effect. Learners repeat actions with pleasant results. Unpleasant results make learners less likely to repeat actions. This idea formed the basis for operant conditioning.

John B. Watson, an influential American psychologist, is considered the founder of behaviorism. He emphasised the importance of studying observable behaviour and rejected the study of internal mental processes. Watson believed that all behaviour is learned, and he aimed to explain how it could be understood and controlled.

Skinner built on Watson's work. He created operant conditioning. Results shape how a learner acts. Rewards or punishments change how likely a behaviour is. Skinner used schedules and a 'Skinner box' (Skinner, dates unknown). These fixed the core ideas of behaviourism.

Behaviorism in learning has a rich history shaped by the contributions of Ivan Pavlov, Edward Thorndike, John B. Watson, and B.F. Skinner. Their work laid the groundwork for understanding how behaviour is learned and influenced by external factors.

Chomsky's Critique and the Cognitive Revolution

The biggest challenge to behaviourism came from a linguist, not a psychologist. In 1959, Noam Chomsky wrote a long review of Skinner's 1957 book, Verbal Behaviour. In this book, Skinner tried to explain how we learn language through operant conditioning. He said past rewards shape our words. He also claimed sentences were just chains of learned reactions. Chomsky (1959) argued strongly that this idea made no sense. He noted that speakers produce and understand sentences...

Tolman (1948) proved rats form cognitive maps in mazes, not just responses. Unrewarded learners explored, then found food efficiently, showing latent learning. This learning happened without rewards or obvious behaviour. Tolman's work challenged behaviourism's focus on observable, reinforced acquisition.

Bandura's Bobo doll studies challenged behaviourism. Bandura (1961) showed learners copied aggressive acts they saw adults do. This happened without reward. Imitation occurred spontaneously, even in new settings. Learners could learn by watching, Bandura said. His social learning theory bridges behaviourism and cognitivism.

Cognitivism came after behaviourism in the 1960s. Skinner's behaviour management uses praise and consequences. Explicit instruction breaks down content clearly (Skinner, 1954). Thorndike (1911) and Skinner (1953) showed environments shape learner behaviour. Teachers can design good learning environments for learners.

Pavlov and Classical Conditioning

Classical conditioning is a learning process in which classroom cues become linked with emotional and behavioural responses. Entering a test room might cause anxiety. Teachers can pair hard topics with nice experiences. Play calming music or use scents to boost learner engagement and lower stress. (e.g. Watson & Rayner, 1920)

Classical conditioning is a basic form of learning. A learner links a neutral trigger with a meaningful one. Over time, the learner reacts to the once-neutral trigger. Ivan Pavlov first described this learning in the early 1900s. He discovered it through his famous experiments with dogs.

Pavlov's work is key to understanding behaviour (Pavlov, 1927). Classical conditioning shapes both simple and complex learner actions. This applies across species, including people (Watson, 1913; Skinner, 1936).

Pavlov (1927) showed classical conditioning links stimulus and response, shaping learner behaviour. Skinner (1936) found learners change behaviours through environmental factors. Watson (1913)'s insights help teachers build new actions and stop unwanted ones.

Through this introduction, we will further explore this essential concept in psychology and its applications in various aspects of our lives.

 

Pavlov's Classical Conditioning Experiments Explained

Pavlov’s experiments (date not provided) showed classical conditioning and aided behaviourism. Learners link neutral things with meaningful things. This creates an automatic response (Pavlov, date not provided).

Pavlov (date unknown) saw dogs salivate at food. Food (unconditioned stimulus) made them salivate (unconditioned response). He rang a bell before food. Soon, the dogs linked the bell with food. The bell (conditioned stimulus) then caused salivation (conditioned response).

Experiments showed learners make stimulus-response connections. Pavlov's work supported this model (stimuli cause responses). His research proved learners gain responses through association, not just reflexes (Pavlov, date).

Pavlov's work looks at how learners act. He showed that the environment shapes learners (Pavlov, dates not provided). Studies prove that conditioning strengthens habits. Pavlov showed that outside triggers change how learners behave.

Pavlov's research (dates missing) shaped behaviourism by showing how reflexes form. Learners associate stimuli, as Pavlov demonstrated in his work. This built behaviourism's core stimulus-response model.

Behaviorism
Behaviorism

Classical Conditioning in the Classroom

Skinner (1957) and Pavlov (1927) showed us behavioural learning ideas. Teachers can use these ideas in the classroom. Consider positive reinforcement techniques, like praise (Thorndike, 1911). Shaping behaviour works by rewarding successive approximations (Skinner, 1938).

Firstly, creating the right environment is important. This involves using a conditioned stimulus, which is a stimulus that produces a specific response when paired with a specific behaviour. For example, a teacher can use a bell as a conditioned stimulus to signal the start of a learning activity, conditioning the learners to associate the bell with focussed attention and engagement.

Self-directed learning builds learner independence as they manage their learning. Gamification uses game features, such as rewards, to engage learners. Deci and Ryan (1985) advocate for self-determination. Kapp (2012) found that gamification can be helpful.

Active learning engages learners directly with content. Tasks, discussions, and problem-solving improve learning. Learners understand concepts and retain knowledge better through active participation.

Social learning works by encouraging learners to interact (Bandura, 1977). Group work lets learners learn from each other (Vygotsky, 1978). Peer teaching builds communication skills and idea sharing (Slavin, 1990). Cooperative tasks help with teamwork.

This approach, supported by researchers like Pavlov and Skinner, helps shape learner behaviour. Teachers can use these ideas to boost learning (Thorndike, 1911). A positive classroom environment improves learner outcomes (Bandura, 1977).

Limitations of Classical Conditioning

Classical conditioning connects stimulus and response but has limits. (Pavlov, 1927). Learners often choose their actions in education. This reduces the impact of classical conditioning (Skinner, 1936; Thorndike, 1911).

Rescorla (1988) showed learners associate things, but this simplifies behaviour. Education needs more than simple association. Learners need critical thinking and problem-solving skills. Conditioning alone is not a full explanation (Thorndike, 1911; Skinner, 1953).

Pavlov (1927) showed classical conditioning may not suit every learner. Learners have different strengths and interests, affecting learning. Skinner (1938) and Piaget (1936) offer wider options. Consider operant conditioning or cognitive approaches for diverse needs.

Classical conditioning has limits in education. It focuses on involuntary actions, which simplifies how learners learn. Individual differences are not explained well (Researchers, date). Teachers should use broader theories to improve their teaching and support learning.

Behaviorism Key Figures
Behaviorism Key Figures

Stimulus Generalisation and Discrimination in the Classroom

Skinner (1936) showed stimulus generalisation and discrimination explain behaviourism. Teachers can use these ideas to see why some interventions fail. This helps them create tasks to support learning transfer (Thorndike, 1903).

Learners respond to similar stimuli (Pavlov, 1927). Pavlov's (1927) dogs salivated to similar tones. Learners anxious in maths tests might fear all number work. Learners feeling safe with a calm teacher engage positively with similar adults.

Learners show stimulus discrimination when they respond to a conditioned stimulus, not similar ones (Rescorla & Wagner, 1972). Discrimination happens when one stimulus gains reinforcement, but others don't (Pavlov, 1927). For example, learners distinguish between triangles or verb tenses. Teachers present stimuli, reinforce correct choices, and sharpen the difference (Skinner, 1953).

Rewards can work in class but not at home because the learner sees contexts differently. If an intervention only works with one teacher, learners respond to teacher cues. Stokes and Baer (1977) said to "programme" transfer, not just hope for it. Vary settings, people, and materials when teaching. Therefore, rehearse behaviours across many contexts, not just the first teaching context.

Concept Definition Classroom Example
Stimulus Generalisation Responding to stimuli similar to, but not identical with, the original conditioned stimulus Learner conditioned to feel safe in one calm classroom transfers that calm response to other orderly environments
Stimulus Discrimination Responding to the conditioned stimulus but not to similar stimuli that have never been reinforced Learner learns to identify an isosceles triangle as distinct from a scalene triangle through repeated contrasting examples
Generalisation Failure A conditioned response that remains specific to the original context rather than transferring Learner behaves well only in the presence of the teacher who implemented the reward programme

Skinner and Operant Conditioning

Operant conditioning is a behaviourist process that changes classroom behaviour through reinforcement, consequences, and consistent adult responses. Classrooms use it with token economies (Skinner, 1938). Teachers reinforce good behaviour with praise or rewards. They ignore bad behaviour or use consequences. Consistency and quick responses are vital (Thorndike, 1911; Pavlov, 1927). Reduce reinforcement over time for lasting change (Skinner, 1953).

Operant conditioning is a type of learning that focuses on how an individual's behaviour is influenced by the consequences of their actions. This theory suggests that behaviours can be reinforced or diminished through either positive or negative reinforcement, as well as punishment.

Skinner (1953) said praise rewards good learner behaviour. Negative reinforcement removes something disliked. Punishment reduces poor actions, adding something unpleasant or removing something liked. Learners link actions with results, which changes how they behave (Thorndike, 1911; Pavlov, 1927).

Skinner (1948) said operant conditioning shapes what learners do each day. Pavlov (1927) demonstrated useful applications for education and animal training. Teachers use these ideas to guide learners.

 

Skinner's Operant Conditioning Theory

Skinner (date not provided) believed behaviour comes from external factors. His behaviourism theory says we learn through observation (Skinner, date not provided). Skinner (date not provided) focused on studying measurable behaviour in learners.

Skinner said reinforcement is key. Rewards encourage learners to repeat actions. Punishment makes learners less likely to repeat behaviours. Skinner's work influenced how we change behaviour.

Skinner's behaviourism (various dates) still shapes teaching. He argued that teachers must reward learners for positive actions. This approach should increase task completion and engagement. Teachers use this to build supportive classrooms. This helps all learners to achieve their goals (Skinner).

Skinner's behaviourism (various dates) shaped education through the use of reinforcement. He studied how learners act and receive rewards in classrooms. These theories help teachers understand learner actions in a practical way.

Behaviourist theory of a learning process
Behaviourist theory of a learning process

Positive Reinforcement

Skinner (1953) found that rewards motivate learners. They also increase good behaviour. His work shows that positive rewards make learners take part more often. This method boosts learner focus and involvement in education.

One of the main benefits of positive reinforcement in education is that it creates a positive and supportive learning environment. When learners receive recognition for their efforts, they feel valued, encouraged, and more motivated to engage in the desired behaviours. This enhances their self-esteem and confidence, developing a growth mindset and leading to improved learning outcomes.

Rewards motivate learners and reinforce behaviour, say researchers (e.g. Skinner). Teachers can use stickers or praise. Giving rewards creates positive links to behaviours. This makes learners repeat those actions, research shows.

Teachers, define behaviours and tell learners expectations clearly. Learners need consistent recognition for their efforts. Tailor rewards to fit each learner's interests. This will make them more meaningful, as suggested by researchers (e.g., Skinner, 1974; Bandura, 1977).

Skinner's work (1938) showed that positive reinforcement motivates learners. Rewarding desired actions, as suggested by Pavlov (1927), builds good habits. This approach, detailed by Thorndike (1911), boosts engagement and achievement for all learners.

behavioural theory of learning
behavioural theory of learning

The Overjustification Effect: When Rewards Undermine Motivation

Reward systems can backfire. The overjustification effect occurs when external rewards reduce a learner's intrinsic motivation to perform a behaviour they previously enjoyed (Lepper et al., 1973). In the original study, children who liked drawing were given a "Good Player" certificate for drawing. Afterwards, they drew significantly less during free play than children who received no reward. The external reward had replaced the internal motivation.

This effect is explained by Self-Determination Theory (Deci and Ryan, 1985), which identifies three psychological needs: autonomy (feeling in control), competence (feeling capable), and relatedness (feeling connected). When rewards are perceived as controlling ("You must do X to earn Y"), they undermine autonomy. When rewards signal that the task is inherently unpleasant ("I need to bribe you to read"), they reduce intrinsic interest. The critical distinction is between informational rewards ("Your writing showed real improvement") and controlling rewards ("You get a sticker for writing 200 words").

A Year 5 reading programme illustrates this. The teacher introduced a sticker chart: learners earned stickers for every book completed. Initially, reading increased. After six weeks, the teacher removed the chart. Voluntary reading dropped below pre-programme levels. The stickers had shifted motivation from "I enjoy stories" to "I earn stickers." Learners who had never received stickers continued reading at the same rate, confirming that the reward, not the activity, caused the decline.

Rewards are useful if you use them carefully. Surprise rewards do not lower learner drive as they are unexpected. Praise that is specific helps learners feel in control (Deci et al., 1999). Use tokens for tasks learners dislike, not those enjoyed (Ryan & Deci, 2000; Lepper et al., 1973).

Negative Reinforcement

Learners repeat behaviours if removing something unpleasant follows them. This, according to Skinner (1953), strengthens their actions. In education, negative reinforcement can have value (Chance, 2009).

Negative reinforcement helps learners avoid unpleasant situations. Actions that remove a negative outcome grow stronger (Skinner, 1953). Learners prevent discomfort by acting early. For example, doing homework on time avoids after-school sessions (Pavlov, 1927).

Additionally, negative reinforcement can increase motivation and persistence. When learners realise that their efforts to escape an aversive situation are successful, they are more likely to repeat those efforts in the future. This can lead to increased motivation to engage in desired behaviours and a greater sense of persistence when faced with challenges.

Negative reinforcement may lower learner anxiety in education. Reinforce coping behaviours, so learners seek help (Skinner, 1938). This support helps learners feel less stressed, improving the learning environment (Pavlov, 1927; Watson, 1913).

Negative reinforcement helps learners avoid what they dislike. This, according to Skinner (1953), increases learner drive and work ethic. Applying this idea, as suggested by Pavlov (1927), builds a calmer classroom. Bandura (1977) showed it can reduce stress and anxiety.

 

Positive Punishment

Skinner (1953) found positive punishment discourages unwanted actions using negative results. Behaviours linked to unpleasantness are less likely to be repeated by the learner.

The effects of positive punishment can be twofold. First, it serves as a deterrent by creating an aversive experience that individuals want to avoid. For example, a learner who consistently disrupts the class may be given extra homework or be made to stay after school. By experiencing these negative consequences, the learner may be less likely to repeat their significant behaviour.

Applying negative consequences can help learners understand the impact of their actions. (Skinner, 1938) This immediate link helps learners connect behaviour to undesirable results. (Thorndike, 1932) This boosts understanding of cause and effect. (Pavlov, 1927) It also encourages better choices. (Bandura, 1977)

Positive punishment can reduce learner motivation. Applying it poorly creates a negative classroom climate. This damages motivation and self-esteem, causing frustration. Learners show less drive and more misbehaviour (Skinner, 1938; Bandura, 1977).

Skinner (1938) and Thorndike (1932) showed learners benefit from positive reinforcement and rules. Support from teachers and parents helps learners understand positive punishment's effect. Bandura (1977) stated communication aids learning.

Negative consequences can reduce unwanted behaviour. Use them with positive support to manage learners. This helps protect motivation, self-esteem, and well-being (Skinner, 1938; Thorndike, 1932). Positive methods should be used more frequently.

 

Negative Punishment

Taking away something learners like can stop bad behaviour (Skinner, 1953). Behaviourists call this negative punishment. Behaviourism studies how settings change what learners do (Thorndike, 1911). Negative punishment alters actions in this exact way (Pavlov, 1927; Watson, 1913).

Negative punishment involves the removal of a desired stimulus as a consequence of engaging in a certain behaviour. This leads to a decrease in the frequency of the behaviour in future instances. For example, let's imagine a child repeatedly interrupts their sibling during playtime.

To address this behaviour using negative punishment, the parent can remove the child from the play area whenever they interrupt. By doing so, the child experiences the removal of the desired stimulus, which is the opportunity to play with their sibling. As a result, the child learns that their interrupting behaviour results in the loss of the enjoyable activity, and they are more likely to refrain from interrupting in the future.

Negative punishment teaches learners about behaviour consequences. It removes something valued, showing learners actions have results. This can reduce unwanted actions and increase positive ones (Skinner, 1953; Thorndike, 1932).

Negative punishment, in behaviourism, removes something liked to reduce unwanted actions. This helps learners understand better choices (Skinner, 1938; Thorndike, 1911). Learners then behave as society expects (Pavlov, 1927; Watson, 1913).

 

Schedules of Reinforcement: Fixed and Variable

Skinner (1938) showed reinforcement affects how learners respond. These schedules also affect behaviour even when rewards stop. Teachers can use this to create good classroom strategies. It also helps create useful reward systems.

Skinner found four key reinforcement schedules. Fixed-ratio (FR) rewards learners after a set response number; a badge after five tasks. These schedules create high response rates, but responding briefly pauses after reward. Variable-ratio (VR) rewards learners after varying response numbers. These schedules create the highest response rates and are hard to extinguish. Think slot machines; unpredictable praise motivates learners more than regular praise (Skinner).

Fixed-interval schedules give rewards after a set time (Skinner, 1938). This creates a "scallop" pattern: slow start, then faster work. Cramming before tests shows this (Critchfield & Reed, 2019). Variable-interval schedules reward at random times (Ferster & Skinner, 1957). These schedules cause consistent effort and resist stopping. Surprise quizzes do this; learners prepare steadily (Epstein et al., 1980).

Reinforce every correct learner response to embed the skill. Once learnt, intermittently reinforce to boost retention. Quickly stopping all reinforcement causes extinction (Ferster and Skinner, 1957). Vary reinforcement gradually for better outcomes.

Schedule Reinforcement Rule Response Pattern Classroom Application
Fixed Ratio (FR) After every N responses High rate; post-reinforcement pause Merit badge after every 5 completed tasks
Variable Ratio (VR) After an unpredictable number of responses Very high, consistent rate; highly resistant to extinction Random spot-praise during independent work
Fixed Interval (FI) First response after a fixed time period Scallop pattern: low rate, then acceleration near deadline End-of-term assessment drives last-minute revision
Variable Interval (VI) First response after an unpredictable time period Steady, moderate rate; moderately resistant to extinction Unannounced low-stakes quizzes promote consistent preparation

Programmed Instruction and the Teaching Machine Tradition

Programmed instruction applied Skinner's ideas to classrooms. Reinforcement shapes learner behaviour (Skinner, 1954). Instruction gave positive feedback at each learning step. This ensured reliable progress through the material.

The historical roots of the movement pre-date Skinner. Sidney Pressey (1926) designed an early mechanical testing device that could present multiple-choice questions and immediately confirm or correct a learner's answer. Pressey's machine was pedagogically limited: it tested recall rather than teaching new material. Skinner's (1958) paper 'Teaching Machines', published in Science, restated the ambition on a firmer theoretical foundation. His machines presented content in small, carefully sequenced frames. The learner read a frame, produced a response, and then immediately checked it against the correct answer. Correct responses served as reinforcers; incorrect ones prompted review of earlier material before the sequence continued. The critical principle was that the programme was constructed so that most learners would respond correctly most of the time, keeping the reinforcement schedule dense and the error rate low.

Norman Crowder (1960) introduced a competing model called branching programmes. Rather than moving all learners through an identical linear sequence, Crowder's programmes diagnosed errors and routed learners to different remedial or enrichment frames depending on their responses. A learner who chose a wrong answer would be directed to an explanation of why that answer was incorrect before being returned to the main sequence. Crowder argued that errors were informative rather than merely failures to avoid, and that a programme which never branched was not genuinely responsive to the learner.

Teaching machines faded in the 1970s as focus moved to thought processes. Yet, their impact remains in today's edtech. Direct instruction uses small steps for learner success. Adaptive platforms and spaced repetition, like Crowder's (1960) branching programmes, adjust to learner performance. Instruction should respond to the learner's actions, a key idea from behaviourism.

Task Analysis and Chaining in the Classroom

Task analysis is the behaviourist method of breaking complex behaviours into discrete, observable steps that can be taught and reinforced individually (Alberto and Troutman, 2013). Rather than instructing a learner to "write a paragraph," task analysis identifies each component: pick up pencil, write a capital letter, form the first word, leave a finger space, continue to end of line, start next line, write a full stop, re-read. Each step is taught until mastery, then chained together into the full sequence.

Two chaining methods are used in classrooms. Forwards chaining teaches the first step, reinforces it, then adds the second step. Backwards chaining starts with the final step and works backwards. Backwards chaining is particularly effective because the learner experiences success (completing the whole task) from the first session. A Reception teacher teaching a learner to write their name using backwards chaining writes "SOPH" and asks the learner to add the final "A." Once the learner reliably writes "A" at the end, the teacher writes "SOP" and the learner completes "HA." Each session, the learner writes more of the name independently, always finishing with a complete, correct result.

Task analysis differs from scaffolding in an important way. Scaffolding is a constructivist concept that involves providing temporary support during a complex task. Task analysis is a behaviourist concept that permanently breaks the task into components, teaches each component to fluency, and builds the full behaviour from mastered parts. Scaffolding assumes the learner can do the whole task with support; task analysis assumes the learner must master parts before assembling the whole. Both have a place in the classroom, but they rest on different theoretical foundations.

ADDIE and Behaviourist Instructional Design

Behaviourism shaped learning plans and teaching approaches. The ADDIE model guides training programmes (Smith & Ragan, 2005; Dick & Carey, 2009). This model links to behaviourist ideas.

The Analysis phase identifies gaps between current and desired learner behaviour. This assumes learning is measurable, based on behaviourism. The Design phase sets behavioural learning objectives, as Ralph Tyler (1949) suggested. Objectives should describe what learners can *do*, not just what they know. Robert Gagné (1965) said different learning types need different teaching. Break complex skills into smaller parts before teaching them.

Programmed instruction guides development. Content flows simple to complex, giving instant feedback. Learners move on after showing mastery. Evaluation checks for target behaviour at criterion level. This directly uses Skinner's (n.d.) focus on measurable outcomes.

Mager (1962) made teaching goals popular. These goals start with 'learners will be able to...'. This format shows behaviourist ideas for teaching. Performance, conditions, and standards link to operant conditioning. This history explains why the framework still matters today. It lasts even with constructivist teaching methods.

How Social Learning Theory Extends Behaviourism

Social learning theory describes how learners develop behaviours, strategies, and attitudes by observing, copying, and responding to others. Teacher modelling is therefore useful. Teachers can show problem-solving (Bandura, 1986). They can use exemplar work. Peer tutoring lets learners watch classmates' strategies (Vygotsky, 1978).

Observational learning, also known as modelling, is a powerful form of learning in which individuals acquire new knowledge and skills by observing others. Rather than relying solely on their own experiences, individuals can learn by watching the actions, behaviours, and outcomes of others.

Social Learning Theory (Bandura, 1977) shows learners learn by observing others. Mimicking actions lets learners gain skills quicker. This helps them adapt to surroundings (Bandura, 1977). Learners also learn from successes and mistakes (Bandura, 1977).

Bandura (1977) showed observational learning affects learners' skills and behaviour. Kelman (1961) and Vygotsky (1978) proved it helps with socialisation. We can improve education by understanding how learners learn.

Behavioural learning theory
Behavioural learning theory

Bandura's Modelling and Observational Learning Theory

Albert Bandura ran many studies on modelling and copying. He focused on how watching others affects learning and actions. His Bobo doll study is a key example. In this study, children watched an adult play with a Bobo doll. The adult acted in either an aggressive or calm way.

Bandura (various dates) showed learners copy others. Learners watch and then imitate, called modelling. The Bobo doll study divided learners into groups. Some saw adults act aggressively, others non-aggressively. Some learners had no adult role model.

Bandura's 1961 Bobo doll experiment showed kids imitated adult actions. Learners who saw aggressive adults were more aggressive (Bandura, 1961). Learners exposed to calm adults showed less aggression (Bandura, 1961).

Bandura (dates) showed learners change behaviour by watching and copying others. Learners pick up behaviours when observing people. They also learn likely results (Bandura, dates). Social models greatly affect learner behaviour.

Bandura's work (dates omitted) shows media and society shape learner actions. Seeing good role models can grow positive behaviour. Witnessing aggression may cause learners to copy it.

Question 1 of 12
According to Baer, Wolf, and Risley (1968), which criterion must a behaviour-change programme meet to be considered 'analytical'?
AIt must demonstrate a functional relationship between the intervention and the change.
BIt must address behaviours that are significant to society.
CIt must be described with enough precision for another practitioner to replicate it.
DIt must focus primarily on internal cognitive maps and mental states.

How Teachers Apply Behaviourism

Teachers use behaviourism to shape class habits and routines. They do this through rewards and proven strategies. The Journal of Applied Behaviour Analysis offers useful research. The Cambridge Centre for Behavioural Studies shares clear evidence. You can also read texts on PBS and FBA for useful tips.

Skinner (1974) and Pavlov (1927) explored behaviourism's impact on learning. Their research gives different views across education. Thorndike (1911) added further perspectives on learning contexts.

Bandura (dates missing) linked thinking to how learners behave. He showed self-belief is important for learners online. Bandura (dates missing) found self-efficacy impacts how engaged and successful learners are.

Bandura's theory links behaviour and thought. Bandura (date) found learner self-belief boosts online work. It also lifts course success and learning outcomes, Bandura (date) showed.

Skinner (1974) showed reinforcement shapes what learners do. Bandura (1977) found role models impact learner behaviour. Teachers can use these studies to improve classroom management.

Behaviourism helps us understand learning, say researchers (e.g. Skinner, 1974). It is useful across many learning environments. Teachers can use it to shape learners' actions positively (e.g. Pavlov, 1927; Thorndike, 1911).

3. Constructivism: The Career and Technical Education Perspective

Constructivism may suit career learning better than behaviourism, say researchers. Cognitive constructivism, (Piaget, 1972; Vygotsky, 1978), could be a good learning theory. It might work well in technical subjects (Rauner, 2011; Lave & Wenger, 1991).

4. Strategies to support self-directed learning. This is a process to improve human resource development.

This framework combines experience, modelling, threat reduction and persuasion. It aims to improve the adult learner's belief in their own capabilities. It also seeks to build stronger self-leadership skills (Researcher names, dates).

5. This section covers self-efficacy for reading and writing. It looks at the impact of modelling. It also covers goal setting and self-evaluation.

Bandura's theory says self-efficacy affects choices. Learners choose tasks based on their belief (Bandura, date). It also shapes their effort, persistence, and reading/writing success (Bandura, date).

Skinner (1974) and Pavlov (1927) showed behaviourism's impact on learning. Research by Thorndike (1911) and Watson (1913) gives different views on this theory. More studies show its use in varied learning settings.

Using Conditioning in the Classroom

Conditioning in the classroom is the process of linking cues and responses so learners expect predictable outcomes. This process involves pairing a neutral stimulus with an unconditioned stimulus until the neutral stimulus alone triggers a response. In Pavlov's research, dogs learned to salivate at the sound of a bell after it was repeatedly paired with food presentation.

Watson built on the work of Pavlov. He showed fear conditioning in a child known as Little Albert. The child learned to link rats with loud noises. People debate the ethics of Watson's research. However, it suggested that our surroundings shape our behaviour in places like schools.

Teachers unconsciously use classical conditioning principles daily. When you play a specific piece of music during tidy-up time, children eventually begin clearing away at the first notes; the music becomes a conditioned stimulus for the tidying response. Similarly, using a particular hand signal or sound to gain attention creates an automatic response in learners who have learned to associate that cue with the need to stop and listen.

Classical conditioning helps teachers see learner anxieties (e.g., reading). Embarrassment reading aloud can cause fear, (e.g., Watson, 1920). Teachers can recondition responses by pairing challenges with praise. This builds confidence, as explained by Pavlov (1927) and Skinner (1936).

Written by the Structural Learning Research Team

Reviewed by Paul Main. He is the Founder and Educational Consultant at Structural Learning.

Watson's Little Albert Experiment and Methodological Behaviourism

Watson (1913) started the behaviourist movement. He said that psychology must only study actions we can see. He believed that only measured facts count as scientific proof. People call this approach methodological behaviourism. This is different from the views of Skinner. Methodological behaviourism accepted mental states but called them unscientific. In contrast, radical behaviourism completely rejected mental events as true causes.

Watson's most notorious demonstration of classical conditioning principles came in 1920. Working with Rosalie Rayner, Watson conditioned an eleven-month-old infant, known in the literature as Little Albert, to fear a white rat by pairing its appearance with a loud noise (Watson and Rayner, 1920). The infant, initially unafraid of the rat, rapidly associated it with the aversive sound. Watson then showed that the conditioned fear generalised to other white, furry objects, including a rabbit and a fur coat. The study appeared to confirm that emotional responses were learned through environmental pairing rather than arising from innate disposition.

Albert's experiment faces ethical issues. Harris (1979) and Beck, Levinson & Irons (2009) questioned Albert's health and Watson's report. His mother's consent was not adequate. The study would fail modern ethics checks. It shows both conditioned learning's power and research's moral duty.

Watson left academia in 1920 and used conditioning in advertising. He linked products to emotions, a pioneering technique. His 1928 book advised scheduled care for children. Bowlby's later research challenged this. Teachers should remember learning theories from Watson have wider social effects.

Positive Behavioural Interventions and Supports: A Whole-School Framework

Sugai and Horner (2002) created Positive Behavioural Interventions and Supports (PBIS). PBIS uses behaviourist ideas across the whole school. The framework offers early support through operant conditioning (Sugai & Horner, 2002). PBIS is better than strategies used by a single teacher.

This framework uses three support tiers. Tier 1 (universal) covers consistent expectations across the school. Horner et al. (2009) found explicit teaching, plus praise, cuts discipline referrals by 20-60%. Tier 2 (targeted) adds group help, like Check-in Check-out (Hawken & Horner, 2003), if learners struggle. Tier 3 (intensive) uses plans based on assessments for 1-5% of learners.

Tier Target Group Behaviourist Mechanism Example Practise
Tier 1 (Universal) All learners (~80%) Consistent reinforcement of explicitly taught expectations School-wide recognition systems; posted behavioural matrices
Tier 2 (Targeted) At-risk learners (~15%) Increased prompts, antecedent modifications, structured feedback Check-in Check-out; social skills groups; behaviour contracts
Tier 3 (Intensive) High-need learners (~5%) Individualised FBA-informed behaviour support plans Wraparound planning; individualised reinforcement schedules

Bradshaw, Mitchell, and Leaf (2010) found PBIS cut problem behaviour and improved school climate. The study, across 37 schools, was a randomised controlled trial. PBIS informs US behaviour policies and influences UK positive behaviour support, especially in SEN (Bradshaw, Mitchell & Leaf, 2010).

PBIS faces criticism for not tackling internal drives behind behaviour problems. Kohn (1993) warned external rewards can reduce learners' natural motivation. PBIS advocates say the framework encourages fading rewards as behaviour improves. They add that it allows for relational or trauma-aware strategies alongside it.

Gamification as Operant Conditioning

Many educational platforms use behaviourism, knowingly or not. Kahoot uses points and random bonuses to encourage learners (Deterding et al., 2011). ClassDojo awards points for good behaviours, acting like a token system. Duolingo uses streaks to keep learners engaged daily, said Skinner.

Gamified learning platforms keep learners engaged through variable ratio schedules. Unpredictable rewards, like those in slot machines, drive this high response rate (Skinner, 1958). Surprise badges and bonus levels sustain learner attention better than regular rewards. The principle is the same; only delivery has altered.

Teachers should review their EdTech tools using a behaviourist view. They should ask what behaviour the platform truly rewards. ClassDojo claims to reward good behaviour. However, it often gives points just for following rules, like sitting still. This means the platform rewards compliance rather than actual learning. One teacher noticed this issue. They changed their ClassDojo settings to reward thinking skills instead. They gave points when a learner asked a question or shared a different view. They also rewarded learners who explained their choices. The same technology...

Ethical concerns arise when gamification exploits dopamine-driven design to maximise screen time rather than learning outcomes. If a learner spends 40 minutes on a maths app but learns nothing because the reward schedule keeps them clicking through easy questions, the platform serves its own engagement metrics, not the learner's education. Critical evaluation of what is being reinforced, and whether reinforcement serves learning rather than screen time, is essential professional practice.

Discrete Trial Training in the Classroom

DTT is a structured teaching method rooted in Skinner’s work. It involves five parts: instruction, prompt, learner response, consequence, and a pause. Lovaas (1987) found intensive DTT improved IQ and behaviour in young autistic learners. His study showed 47% achieved typical development by age seven.

Lovaas et al. (1981)'s work on shaping informed the methods. We broke down complex skills into small steps. Learners mastered each step via repeated trials. Once reliable, we introduced a new step or generalisation training.

Discrete Trial Training faces criticism. Intensive early Lovaas programmes caused welfare worries. These involved up to 40 hours weekly for young learners. Gresham & MacMillan (1997) noted the original study lacked randomisation. Disability advocates question if this respects autistic learner identity. Modern Applied Behaviour Analysis now uses child-led activities. It also takes place in natural settings.

DTT offers a framework for SEND support, not direct teaching. Teachers can present clear instructions, as DTT suggests (Lovaas, 2003). Immediate feedback is also useful. Prompting hierarchies aid learners with difficulties.

The Premack Principle: Using Preferred Activities as Reinforcers

Premack (1959) found that frequent behaviours reinforce less frequent ones. This became the Premack principle, useful for managing learners. If a learner completes a task, they get access to a preferred activity.

The principle has become so embedded in everyday child-rearing that it is often called 'Grandma's Rule': eat your vegetables and then you can have dessert. In schools, teachers apply the Premack principle whenever they say "finish your written work and then you can have free reading time" or "complete the problem set before choosing your seat activity". The key behaviourist logic is that the reinforcer is not an arbitrary token or external prize but is itself a behaviour that the learner already values, making the reinforcement more natural and sustainable.

Timberlake and Allison (1974) said access restrictions make behaviours reinforcing. They built upon Premack's work. If a learner's access drops below usual, almost anything works as a reward. Teachers should watch what learners choose, not just offer standard rewards, they suggested.

Low-Probability Behaviour Contingency High-Probability Reinforcer
Completing independent writing task Then… Five minutes of free reading
Tidying workstation Then… Choosing a preferred partner activity
Practising times tables for ten minutes Then… Computer-based learning game
Sitting during whole-class instruction Then… Movement break or practical activity

Mace et al. (1988) found behavioural momentum works. Start with quick, easy tasks. Then, present a difficult task. The learner's momentum from earlier success increases compliance. This helps learners who often refuse specific tasks. It avoids triggers for non-compliance.

Assertive Discipline in Classroom Management

Canter and Canter (1976) created Assertive Discipline using behaviourist ideas for classroom management. Teachers should clearly state expectations. Consistently apply consequences and stay calm, they said. Their system uses rising sanctions for misbehaviour, alongside praise for learners who behave.

Canter and Canter's (1992) Assertive Discipline heavily influenced UK schools. The programme uses consistent rewards and consequences to shape learner behaviour. This approach is more reliable than relying on a learner's inner drive, they claimed. A second edition addressed concerns by stressing positive recognition over punishment, said Canter and Canter (1992).

Kohn (1993) critiqued behaviourist systems. 'Punished by Rewards' argued external controls hurt self-regulation and motivation. Deci, Koestner and Ryan's (1999) research showed rewards cut learners' interest. Use behaviour strategies for routines, but avoid relying on them for motivated learning.

Current behaviour frameworks blend clear rules with relational methods (DfE, 2022). These frameworks use both Assertive Discipline and restorative justice. Knowing behaviourism lets teachers use routines and consequences well. It helps them see unmet needs behind behaviour (Skinner, 1974; Bowlby, 1969).

Topic Slides Free Classroom Presentation (.pptx)
Download
What Does the Evidence Say?

Is behaviourist classroom management effective for learner outcomes?

Behaviourist methods improve learner results (g=0.23). Seventy-six studies show that rewards and clear outcomes work well. Researchers (date) found that social and emotional learning helps even more. It boosts both behaviour and school grades.

Consensus Metre N = 5
17
3
● Yes 85% ● No 15% Strong Consensus

Classroom Takeaway

Researchers like Pianta (1999) and Hamre & Pianta (2007) showed strong teacher-learner bonds matter. Focus on connections with learners and build their social skills for better results. Behaviour approaches alone are not as effective (Marzano, 2003).

View 5 key studies

Marzano, Marzano, and Pickering (2003) found classroom management affects learners a lot. Their work, a synthesis of 179 studies, showed this (Marzano, Marzano, & Pickering, 2003). Positive classrooms boost behaviour and learner motivation, they noted.

Korpershoek, H., Harms, T., de Boer, H. (2016) · Review of Educational Research · View study ↗

Can good classroom behaviour management boost learner success in middle school? (63 citations)

Freiberg, H. (2020) · Visible Learning Guide to Learner Achievement · View study ↗

The Incredible Years Teacher Classroom Management Programme Outcomes from a Group Randomized Trial 74 cited

Reinke, W., Herman, K., Dong, N. (2018) · Prevention Science · View study ↗

This is an updated meta-analysis. It looks at the effects of classroom management interventions.

Korpershoek, H., de Boer, H., Mouw, J. (2025) · Review of Educational Research · View study ↗

This paper is about improving learner behaviour in middle schools. It shows the results of a classroom management intervention. It has been cited 36 times.

Wills, H., Caldarella, P., Mason, B. (2019) · Journal of Positive behaviour Interventions · View study ↗

Evidence from peer-reviewed journals. All links to original publishers. Checked 25 Mar 2026.

Digital Behaviourism in EdTech

Digital behaviourism uses EdTech systems to shape learner behaviour. It relies on automated rewards, sanctions and progress signals. Platforms like ClassCharts and Sparx create digital token economies. They turn targets into points and streaks. Coloured status bars shape behaviour through automated tracking. This digital behaviour management feels highly efficient. The instant feedback loops are constant, visible and easy to scale.

The attraction is obvious, but teachers should keep the mechanism clear. Gamification can raise motivation and task completion, yet the overall effects are usually small and uneven across settings, not a magic fix (Sailer and Homner, 2020; Dichev and Dicheva, 2017). The EEF makes the same point in plainer terms: technology itself does not improve learning unless the pedagogy is sound, and homework works best when it is tied to class teaching, offers useful feedback and is not used as a punishment (EEF, 2019; EEF, 2021).

In practice, this means the teacher still matters more than the platform. A Year 8 maths teacher might say, “Open Sparx, check yesterday’s errors, then show me one correction in your book,” using the system for diagnosis rather than public compliance. One learner produces a corrected method and moves on; another sees a red overdue mark, thinks “I’m behind again,” and rushes for completion instead of understanding, which is where algorithmic reinforcement can start to distort the learning task.

This explains why current debate focuses on neurodiversity, anxiety and fairness. The Department for Education offers clear guidance. Behaviour systems should predict triggers and make reasonable adjustments. Schools must look for the root causes of behaviour. This matters deeply when SEND shapes the data (DfE, 2024). Home pressure adds to this challenge. Indeed, 23% of UK parents say maths homework makes them anxious (National Numeracy, 2024). Another 20% report arguments about it. The practical lesson is simple.

Strengths and Limitations of Behaviourism

Behaviourism is a practical approach for building routines, but its narrow focus on observable actions limits its explanatory power. Its strength is practical power for building routines. Its limit is a narrow focus on actions we can see. Thorndike's law of effect and Skinner's work on reinforcement explain a lot. They show why immediate praise, quick correction, and repeated practise improve routines. These routines include entering the room quietly, handing in homework, or recalling number facts. That clarity matters a lot for busy classrooms. It makes expectations clear and keeps consequences consistent.

It is especially effective when the goal is accuracy, fluency or classroom order. A teacher teaching phonics, times tables or handwriting can use short practise bursts, quick feedback and cumulative review to strengthen correct responses, much like Precision Teaching and Direct Instruction emphasise. Token systems can also help some learners stay on task, but only when the target behaviour is specific, the reward is understood and the system is faded once the routine becomes established.

The main criticism is that behaviourism can become too narrow if it treats learning as only what can be seen. Children do not just respond to rewards, they think, interpret and attach meaning to what they are asked to do. Critics such as Bandura showed that modelling and observation matter, while Deci, Koestner and Ryan's review (1999) warned that heavy use of external rewards can weaken intrinsic motivation for tasks learners already find interesting.

In practice, behaviourism works best as one part of a wider teaching approach, not the whole approach. It can secure attention, practise and predictable routines, but it is less suited to explaining creativity, deep understanding or why a learner can perform a step yet still hold a misconception. A sensible classroom use is to teach a routine explicitly, reinforce it consistently, then move learners towards self-regulation through questioning, modelling and purposeful feedback.

Relational and Trauma-Informed Alternatives

Relational and trauma-informed methods view behaviour through relationships and context. They focus on the basic needs of the learner. A relational approach keeps routines, outcomes and high standards. However, it asks staff to look at the root causes of actions. Current guidance in England tells schools to look at mental health and SEND after incidents. They must also look at safeguarding and other factors. It notes this national standard aligns with

For busy teachers, trauma-informed practice is not therapy and it is not an excuse for poor conduct. It means noticing when emotional dysregulation is driving the moment, so reasoning and sanctions alone will not work, and using co-regulation first so the learner can return to thinking. Reviews of school-based trauma-informed work place relationship-building, safety and belonging at the centre, although they also show that implementation needs training and consistency (Wilson-Ching and Berger, 2024).

In practice, a Year 8 learner who shouts, kicks a chair and refuses the starter may not need a louder warning first. The teacher moves closer, lowers their voice, reduces the audience, offers a now-next instruction and says, "You're safe. We'll get calm, then we'll do the first two questions together." A neuro-affirming response also checks whether noise, transition, sensory load or unclear language triggered the behaviour, especially for neurodivergent learners, before deciding what support or consequence is proportionate.

Once the learner is regulated, the work shifts to repair and accountability. A short restorative justice conversation asks what happened, who was affected and what needs putting right; the learner might then complete the task, apologise to a classmate and write a brief repair note for the next lesson. Reviews of restorative practice report better relationships and less exclusionary discipline, although the evidence base is still uneven (Lodi et al., 2021). This wider shift also fits current national work in Scotland and Wales on trauma-informed schools and workforces (Education Scotland, 2024; Welsh Government, 2026).

Thorndike's Law of Effect

Thorndike's Law of Effect describes how behaviours followed by satisfying consequences become more likely to be repeated. In his 1898 puzzle box experiments, and later in Animal Intelligence (1911), he found that responses followed by satisfying consequences were more likely to be repeated. For teachers, this means learning is not shaped by practise alone, but by what happens straight after a learner attempts a task.

In classroom terms, Thorndike's theory explains why specific praise is more effective than general approval. If a learner justifies an answer in history and the teacher responds, "You used evidence from the source to support your point", that precise feedback increases the chance of the behaviour happening again. The same principle supports quick correction in phonics, times tables, spelling, or handwriting, where success needs to be noticed immediately and followed by another opportunity to practise it correctly.

The Law of Effect also helps teachers think carefully about unintended rewards. If calling out gains attention, delays difficult work, or entertains classmates, that behaviour may become more frequent even when adults disapprove of it. A stronger approach is to reward the alternative behaviour you want, such as hands-up contributions, focused independent work, or orderly transitions, and to make those expectations consistent across the school day.

Thorndike's ideas sit underneath much of Skinner's later work, but they still matter because they keep teachers focused on the link between behaviour and consequence. Practical strategies include exit tickets with immediate feedback, a simple token system for sustained effort, and short retrieval practice tasks followed by recognition of accurate responses. The important point is that the reward should be timely, clearly tied to one behaviour, and gradually reduced as the routine becomes secure.

Benefits and Drawbacks of Behaviourism

The benefits and drawbacks of behaviourism are its practical strength in shaping behaviour and its weakness in explaining deeper learning. When expectations are clear, practise is repeated and feedback is immediate, learners are more likely to repeat the right response, which reflects Thorndike's law of effect and Skinner's work on reinforcement. This is especially useful when teachers are building routines, accuracy and confidence in core skills.

In practice, this can be highly effective for everyday teaching. A teacher might rehearse how learners enter the room, begin a retrieval task and move into group work, praising the exact behaviour they want to see. Short phonics review, number facts practise and low-stakes quizzes also fit this model well, because quick correction helps errors from becoming habits.

However, behaviourism has clear limits. It can produce compliance without deep understanding, so a learner may complete the task for a sticker or point but still struggle to explain the idea behind it. Research on motivation has also suggested that when external rewards are overused, especially for activities learners already enjoy, engagement can become tied to the reward rather than the learning itself.

It is also less helpful for some tasks. These include tasks needing discussion, creativity, or independent judgement. Writing about a poem needs more than repetition and reward. Debating a historical source or planning a science test also needs more. In most classrooms, the strongest approach is selective use. Teachers can use behaviourist strategies for routines, fluency, and behaviour support. They can then fade rewards over time. Finally, they can combine them with modelling, questioning, and reflection.

Beyond Compliance: The Trauma-Informed Shift

The trauma-informed shift is an approach that moves behaviour practice beyond simple compliance towards safety, belonging and self-regulation. A zero tolerance approach has limited evidence behind it in English schools, while recent EEF guidance argues for proactive behaviour work that teaches self-regulation and belonging (EEF, 2019; EEF, 2024). Post-pandemic, many learners are not simply refusing to comply, they are struggling to stay regulated enough to learn.

Trauma-informed practice changes how we respond. Staff do not just ask which rule was broken. They also ask what is driving the behaviour. They look for ways to calm the learner while keeping high standards. A relational policy still uses boundaries and outcomes. However, it starts with calm adult support and clear repair. This fits current Ofsted expectations. Ofsted states that behaviour relies on positive relationships, rules and routines (Ofsted, 2025).

In practice, that might sound like this: a Year 8 learner drops their pen, swears and refuses to start after break. Instead of issuing a loud sanction across the room, the teacher moves closer, lowers their voice and says, “You look overloaded. Sit on the side table for one minute, then we will do question 1 together.” The learner is not let off, but they are helped back from dysregulation. They settle, complete the first two retrieval questions, and later take part in a short restorative practice conversation about what happened, who was affected and how to reset the next lesson.

For sceptical teachers, the key point is simple. Trauma-informed practise is not about excusing poor behaviour, and restorative practice is not about endless talk. It is about recognising that consequences land better after regulation than during dysregulation, and that learners are more likely to re-engage with learning and attendance when school feels predictable, respectful and safe (Emerson, 2022; DfE, 2024).

◆ Structural Learning
Behaviourism in Education: Pavlov, Skinner and Thorndike Explained: Quick-Check Quiz
10-question self-test
Q1 of 10
0%

Frequently Asked Questions

Classical vs Operant Conditioning

Research by Pavlov (1927) showed how learners link stimuli with experiences. Play calming music with hard tasks to reduce learner worry. Skinner (1948) proved consequences shape behaviour. Use rewards to boost desired actions; punishments will decrease them.

Using Classical Conditioning to Reduce Anxiety

This approach has shown promise in reducing test anxiety and improving performance. By linking tricky tasks to nice things, learners feel calmer and more involved. For example, teachers can use scents in quiet times or play soft music during tests. This builds good links, helping learners handle tough work without worry.

Why Reward Systems Backfire for Some Students

Deci and Ryan (1985) showed rewards can reduce learners' intrinsic drive. Behaviourist ideas help teachers build effective reward systems for the long term. Teachers should use meaningful reinforcement, not just "carrot and stick" (Skinner, 1938).

Behaviourism for SEND Classrooms

Behaviourist methods use reinforcement and stimulus-response patterns to shape behaviour. These techniques work well for learners with SEND. The methods offer predictable frameworks, according to Skinner (1953). Teachers can adapt the methods for individual needs, per Pavlov (1927) and Thorndike (1911).

Common Teacher Mistakes in Conditioning

Research by researchers like Pavlov (1927) shows pairing stress with subjects can cause anxiety. Teachers may unintentionally do this with tests or corrections. Using red pens or a stern voice can condition learners to anxiety (Skinner, 1953).

Combining Classical and Operant Conditioning

Classical conditioning creates positive learning feelings. Reinforcement shapes learner behaviour (Skinner, 1938). Teachers should address emotions and behaviour. This mix helps learners change long term (Pavlov, 1927).

Using Behaviourism Without Oversimplifying Motivation

Researchers (e.g. Skinner) showed stimulus-response matters. Now we know mental processes are also key (e.g. Bandura, 1977). Use reinforcement wisely. Check learners' feelings and actions. Adapt methods to individual needs (e.g. Vygotsky, 1978). Avoid strict behaviour plans.

Shaping, Cueing, and Prompt Hierarchies

Shaping is the reinforcement of successive approximations towards a target behaviour (Skinner, 1953). Rather than waiting for the complete, correct behaviour to appear (which may never happen spontaneously), the teacher reinforces each step closer to the goal. A teacher shaping "contributing to class discussion" in a shy Year 2 learner might first reinforce making eye contact during carpet time, then reinforce nodding in response to a question, then reinforce whispering an answer to a partner, then reinforce speaking aloud to the class. Each approximation is reinforced until it is reliable, then the criterion shifts to the next step.

Prompts are supplementary stimuli that increase the probability of a correct response. Verbal prompts are spoken cues ("Remember, what comes first?"). Visual prompts are pictures, symbols, or written reminders. Gestural prompts are points, nods, or hand signals. Physical prompts involve hand-over-hand guidance. Prompts are arranged in a hierarchy from most to least intrusive, or vice versa (Wolery et al., 1992).

A most-to-least prompt hierarchy begins with the most supportive prompt (physical guidance) and systematically fades to less intrusive prompts as the learner demonstrates competence. This approach minimises errors and is effective for learners with significant learning difficulties. A least-to-most hierarchy begins with minimal support (a pause, an expectant look) and escalates only if the learner does not respond. This approach maximises independent attempts and is suitable for learners who can attempt the task but need occasional support.

Prompt fading is critical. A prompt that is never withdrawn becomes a permanent crutch. If a teaching assistant always points to the correct answer on a number line, the learner learns to wait for the point rather than to count independently. Systematic fading plans specify when and how prompts will be reduced: after three consecutive correct responses with a verbal prompt, move to a gestural prompt; after three correct with a gesture, move to no prompt. Without this plan, dependence on prompts can become entrenched.

Extinction, Spontaneous Recovery, and Planned Ignoring

Skinner (1953) said extinction happens when reinforcement stops, decreasing behaviour. Teachers use planned ignoring, stopping attention for behaviours. This means a learner's calling out decreases, as it gets no reaction.

Extinction creates a pattern teachers may find worrying. An extinction burst means behaviour gets worse when reinforcement stops. A learner might call out louder or add new actions (banging) when ignored. This usually lasts 3-5 days before the behaviour decreases. Teachers who expect this are less likely to accidentally reinforce bigger problems.

Spontaneous recovery is the reappearance of an extinguished behaviour after a period of non-occurrence, typically after weekends or holidays. A learner whose calling out was successfully extinguished before half-term may return to calling out on the first day back. This is normal and does not mean the strategy failed. Continuing to withhold reinforcement will extinguish the behaviour again, usually more quickly than the first time.

Planned ignoring differs from neglect. Use it for attention seeking, only if safe. Ignore a learner calling out; don't ignore throwing (Carr & Newsom, 1985). Stop other learners reinforcing it. Combine ignoring with reinforcing desired actions (Skinner, 1953): "Thank you, Amir".

Behaviourism and Neurodiversity: The ABA Debate

Lovaas (1987) first created Applied Behaviour Analysis for autistic learners. ABA uses reinforcement and prompting to teach skills. It also reduces unwanted behaviours. Many UK organisations support it. Research shows ABA helps communication, self-care, and school tasks (Lovaas, 1987).

The neurodiversity movement questions ABA's goals. Critics say ABA can make autistic learners seem non-autistic. It may reinforce eye contact and stop stimming behaviours. Masking versus learning is a key debate. Kupferstein's (2018) research found PTSD in adults who had ABA, though methods are questioned.

NICE suggests behavioural methods to help autistic learners develop skills. They don't suggest ABA as a full treatment (NICE guidelines). UK practise often separates older ABA from newer forms. Contemporary ABA focuses on communication and choice, respecting learners (Sundberg & Michael, 2001).

For classroom teachers, the key question is whether a behavioural intervention serves the learner's needs or adult convenience. Teaching a learner to request a break using a visual card is a functional skill that increases autonomy. Requiring a learner to sit still for 45 minutes when they need movement breaks serves classroom management, not the learner. Behaviourist techniques are powerful tools; the ethical responsibility lies in choosing targets that genuinely benefit the learner.

Free Resource Pack

How Operant Conditioning Supports Behaviourism

Key resources to help you understand and use Skinner's ideas in your teaching.

Behaviourism & Operant Conditioning , 4 resources
CPD Visual Quick Reference Guide Classroom Strategy Behaviour Management Checklist Operant Conditioning Skinner Behaviourism

Download your free bundle

Fill in your details below and we'll send the resource pack straight to your inbox.

Quick survey (helps us create better resources)

Think about your own teaching. How confident are you at spotting positive reinforcement? Can you easily spot negative reinforcement and punishment?

Not confident
Slightly confident
Moderately confident
Confident
Very confident

Do your colleagues use operant conditioning often? Does it help them manage their classrooms well?

Never
Rarely
Sometimes
Often
Consistently

Think about how you manage behaviour now. Does it match Skinner's operant conditioning?

Not at all effective
Slightly effective
Moderately effective
Effective
Very effective

Your resource pack is ready

We've also sent a copy to your email. Check your inbox.

Cognitive Science Platform

Making Thinking Visible Beyond Behaviourism

Open a free account today. Help organise the thinking of your learners with proven graphic organisers. These tools lower cognitive load. They also help to build mental schemas in an active way.

Create Free Account No credit card required

Edward Thorndike's early work on learning, particularly his Law of Effect, is foundational to behaviourism. Another key principle he identified was the Law of Exercise. This law states that the more frequently a stimulus-response connection is used, the stronger that connection becomes (Thorndike, 1911). Regular repetition of a behaviour in response to a specific cue reinforces the neural pathways associated with that action.

In the classroom, teachers apply the Law of Exercise when they provide opportunities for repeated practice. For instance, a maths teacher might ask pupils to solve ten similar quadratic equations after demonstrating the method. Each correct solution strengthens the bond between seeing a quadratic equation and applying the correct solution steps, making the process more automatic. Pupils are actively engaging in the desired behaviour, solidifying their understanding through repetition.

Conversely, Thorndike also proposed the Law of Disuse. This principle suggests that connections between a stimulus and a response weaken if they are not used over a period of time (Thorndike, 1911). Just as practice strengthens bonds, a lack of practice causes them to fade, making it harder to recall or perform the learned behaviour. This explains why pupils forget information or skills they do not regularly revisit.

Consider a science lesson where pupils learn about the water cycle. If the teacher does not revisit this topic or provide opportunities for retrieval practice for several weeks, pupils' recall of the stages and processes will diminish. A pupil might initially explain evaporation correctly, but without subsequent practise, they may struggle to articulate it later, demonstrating the weakening effect of disuse. Teachers must actively plan for retention to counteract this natural decay.

Understanding both the Law of Exercise and the Law of Disuse highlights the critical role of deliberate practice and spaced repetition in effective teaching. Regular, distributed practice helps to consolidate learning, preventing the erosion of knowledge over time (Dunlosky et al., 2013). Teachers can implement short, frequent retrieval tasks, such as low-stakes quizzes or 'do now' activities, to reactivate and strengthen learned connections. This approach ensures that pupils continually engage with prior knowledge, making it more robust and accessible.

Furthermore, the quality of practise matters. Active recall, where pupils retrieve information from memory rather than simply re-reading it, is particularly effective at strengthening these connections (Roediger & Karpicke, 2006). A history teacher might ask pupils to write down everything they remember about the causes of World War I without looking at their notes. This active effort to retrieve information reinforces the learning far more than passively reviewing a textbook. By strategically applying these laws, educators can design instruction that promotes lasting learning.

Edward Thorndike's foundational work in educational psychology introduced Connectionism, an overarching theoretical framework that posits learning as the formation of neural bonds or "connections" between stimuli and responses. This was the first systematic account suggesting that learning is a direct result of these associations strengthening or weakening over time. Thorndike's insights laid the groundwork for much of what we understand about habit formation and skill acquisition in educational settings.

Central to Connectionism is the Law of Effect, which states that responses followed by satisfying consequences are more likely to be repeated, while those followed by annoying consequences are less likely. For example, if a pupil correctly solves a maths problem and receives specific praise from the teacher, the connection between the problem type and the successful solution is strengthened. Conversely, if an incorrect answer consistently leads to frustration or negative feedback, the associated response is less likely to recur (Thorndike, 1911).

The Law of Exercise, another key component of Connectionism, emphasises the role of repetition. It comprises two parts: the Law of Use and the Law of Disuse. The Law of Use suggests that the more frequently a stimulus-response connection is practised, the stronger it becomes. For instance, repeatedly practising multiplication tables reinforces the connections between the numbers and their products. The Law of Disuse indicates that connections weaken if they are not regularly used, highlighting the importance of spaced retrieval practice to prevent forgetting.

Thorndike also proposed the Law of Readiness, which explains that connections are more easily formed when the learner is prepared to act. If a pupil is ready to learn a new concept, perhaps because prerequisite knowledge is secure or motivation is high, the learning process is more efficient and effective. A teacher might ensure readiness by reviewing prior learning or setting a clear purpose for the lesson, making pupils receptive to new connections.

In the classroom, Connectionism informs many common practices. When teachers provide immediate feedback, they are applying the Law of Effect, ensuring that correct responses are reinforced swiftly. Structured drills and repeated practice exercises, such as rehearsing vocabulary or grammar rules, directly utilise the Law of Exercise to strengthen desired connections. Furthermore, ensuring pupils have the necessary foundational knowledge before introducing complex topics aligns with the Law of Readiness, preparing them to form new, robust associations.

While behaviourism highlights the power of environmental conditioning, some learning processes are constrained by an organism's biological predispositions. This concept, known as biological preparedness, suggests that certain associations are learned more easily or quickly than others due to evolutionary history. Organisms are not blank slates; their genetic makeup influences what they are ready to learn and how.

A prominent example of biological preparedness is taste aversion, where an organism learns to avoid a food after a single negative experience, even if the illness occurs hours later. This rapid, robust learning stands in contrast to typical classical conditioning, which usually requires repeated pairings and close temporal contiguity between the conditioned stimulus and unconditioned stimulus.

John Garcia's research with rats demonstrated this phenomenon. Garcia and Koelling (1966) exposed rats to "bright noisy" water (a novel taste, light, and sound) and then subjected them to either radiation (causing nausea) or electric shock. They found that rats quickly developed an aversion to the taste of the water when it was followed by nausea, but not to the light or sound. Conversely, rats developed an aversion to the bright noisy water when followed by shock, but not to the taste.

This research indicated that rats were biologically prepared to associate taste with illness and visual/auditory cues with pain. The delay between consuming the food and experiencing sickness, which could be several hours, would typically prevent classical conditioning from occurring effectively. However, the evolutionary advantage of quickly learning to avoid poisonous foods overrides the standard rules of contiguity.

For educators, understanding biological preparedness means recognising that not all learning is equally malleable through reinforcement or punishment. Some behaviours or associations might be inherently more difficult to establish or extinguish because they either align with or contradict an organism's natural predispositions. This perspective adds a crucial nuance to a purely environmental view of learning.

In a classroom, a teacher might observe that while a child quickly learns to associate a particular food with feeling unwell after a single incident, it may take many more repetitions and consistent reinforcement to teach a complex abstract concept. This is not a direct taste aversion, but it illustrates how some learning connections are formed with remarkable speed and resilience, while others require sustained effort. Teachers must consider these inherent differences in learning readiness when designing instruction and behavioural interventions.

Edward Thorndike's early work on animal learning laid foundational principles for behaviourism, particularly through his experiments with Thorndike's Puzzle Box. His research focused on how animals learn through trial and error, leading to his articulation of the Law of Effect (Thorndike, 1898). This law states that responses followed by satisfying consequences are more likely to be repeated, while those followed by annoying consequences are less likely to occur.

The physical apparatus of Thorndike's Puzzle Box was typically a simple wooden crate, approximately 20 inches long, 15 inches wide, and 12 inches high. It featured a door that could be opened by a simple mechanism, such as pulling a loop of string, pressing a lever, or stepping on a pedal. These mechanisms were deliberately straightforward, allowing for a clear connection between a specific action and the outcome of escape.

In his experiments, Thorndike would place a hungry cat inside the puzzle box, with a piece of food visible just outside the door. Initially, the cat would engage in a variety of random behaviours: scratching at the walls, biting the bars, pacing, and attempting to squeeze through openings. Eventually, through accidental movements, the cat would stumble upon the correct mechanism, causing the door to open and allowing it to escape and access the food reward.

Thorndike meticulously recorded the time it took for each cat to escape across multiple trials. He observed that in successive trials, the cats gradually reduced the amount of time spent on ineffective behaviours and became quicker at performing the action that led to escape. This demonstrated a gradual strengthening of the connection between the stimulus (being in the box) and the successful response (activating the mechanism), rather than a sudden insight (Thorndike, 1898).

The implications of Thorndike's Puzzle Box experiments for education are significant. They highlight the importance of practise and the role of satisfying outcomes in strengthening desired behaviours and knowledge. For instance, when a pupil is learning a new mathematical procedure, they might initially try various approaches, some incorrect, until they find one that yields the correct answer. A teacher might say, "Try that method again, you were close," providing an opportunity for repeated attempts. The successful completion of the problem, perhaps followed by positive feedback like "Excellent, you've mastered that step," acts as a satisfying consequence, increasing the likelihood that the pupil will use that correct procedure in the future. This trial-and-error process, guided by feedback and successful outcomes, mirrors the learning observed in Thorndike's experiments.

Beyond basic classical conditioning, learning can extend through a process known as higher-order conditioning, sometimes called second-order conditioning. This occurs when a previously established conditioned stimulus (CS) is paired with a new neutral stimulus (NS). The new neutral stimulus then acquires the ability to elicit the conditioned response (CR) itself, without ever having been directly associated with the original unconditioned stimulus (UCS).

For example, if a bell (CS1) has been conditioned to produce salivation (CR) after being paired with food (UCS), higher-order conditioning might then occur. If a light (NS) is consistently presented immediately before the bell (CS1), the light can eventually become a second-order conditioned stimulus (CS2). The dog would then salivate at the sight of the light alone, even though the light was never directly paired with the food (Pavlov, 1927).

In the classroom, this principle explains how subtle cues become associated with specific learning experiences. Consider a teacher who consistently provides specific, positive verbal feedback, such as "Excellent reasoning in your explanation," which acts as a primary conditioned stimulus (CS1) linked to feelings of accomplishment (CR). This feedback was initially paired with the inherent satisfaction of understanding a concept (UCS).

If the teacher consistently uses a particular coloured pen to mark such excellent work, the coloured pen acts as a new neutral stimulus (NS). Through repeated pairing of the coloured pen with the positive verbal feedback (CS1), the pen can become a higher-order conditioned stimulus (CS2). Pupils may then experience a positive emotional response or increased motivation simply upon seeing their work marked with that specific coloured pen, even before reading the detailed feedback.

This demonstrates how associations build upon one another, creating layers of learned responses. Teachers should recognise that seemingly minor environmental cues or routines can become powerful secondary conditioners, influencing pupil behaviour and affect. Understanding higher-order conditioning helps explain how complex responses develop from simpler, indirect associations.

B. F. Skinner advanced behaviourism by focusing on operant conditioning, which explains how voluntary behaviours are strengthened or weakened by their consequences. Unlike Pavlov's classical conditioning, which deals with involuntary responses, operant conditioning examines how an organism operates on its environment. Skinner argued that learning occurs when a behaviour is followed by a reinforcer or a punisher, making the behaviour more or less likely to recur (Skinner, 1938).

Skinner's primary research tool was the Skinner Box, also known as an Operant Conditioning Chamber. This enclosed apparatus allowed for precise control over an animal's environment and the consequences of its actions. Typically, it contained a lever or a button that an animal, such as a rat or a pigeon, could manipulate. Inside, there was also a mechanism to deliver a reinforcer, like a food pellet or a drop of water, and sometimes a light or speaker to signal specific conditions.

The chamber was designed to automatically record the frequency of the target behaviour, such as lever presses. When the animal performed the desired action, the food dispenser would activate, providing a positive reinforcer. Conversely, the box could also be set up to deliver an aversive stimulus, like a mild electric shock through a grid floor, which could be terminated by the animal's action, demonstrating negative reinforcement. This systematic delivery of consequences allowed Skinner to study how behaviours are acquired and maintained through various schedules of reinforcement (Skinner, 1953).

The principles observed in the Skinner Box translate directly to classroom management and instructional design. Teachers use positive reinforcement when they praise a pupil for completing a task, saying, "Excellent work, Maya, you've clearly explained your reasoning." This increases the likelihood of Maya repeating that behaviour. Similarly, a teacher might use negative reinforcement by removing a disliked task once a pupil demonstrates a desired behaviour, such as allowing a pupil to skip a repetitive exercise after they correctly answer a challenging question.

Shaping, a technique derived from Skinner's work, involves reinforcing successive approximations of a desired behaviour. For instance, a teacher might first praise a pupil for writing a single sentence, then for a paragraph, and finally for a complete essay. Different schedules of reinforcement, like giving praise every time (continuous) or only occasionally (intermittent), also influence how quickly a behaviour is learned and how resistant it is to extinction (Ferster & Skinner, 1957). Understanding these mechanisms helps teachers design effective feedback and reward systems that encourage sustained positive behaviour and academic effort.

Even after a conditioned response appears to have disappeared through extinction, it can suddenly reappear. This phenomenon is known as spontaneous recovery. It refers to the unexpected reappearance of a previously extinguished conditioned response after a period without exposure to the conditioned stimulus. Pavlov (1927) observed this in his experiments, noting that even after dogs stopped salivating to a bell presented without food, a brief period of rest could lead to the return of salivation when the bell was rung again.

For example, a pupil might have developed a conditioned fear response to a specific classroom activity, such as reading aloud, due to a past embarrassing incident. Through consistent positive experiences and careful scaffolding, the teacher might successfully extinguish this fear, with the pupil eventually reading aloud without anxiety. However, after a long school holiday or a period away from that specific activity, the pupil might experience a sudden return of anxiety when asked to read aloud again. This brief resurgence of the old behaviour is spontaneous recovery.

Teachers must recognise that extinguishing an undesirable behaviour or a conditioned emotional response does not always mean it is permanently gone. The underlying association may persist, albeit in a weakened state. Therefore, continued vigilance and reinforcement of the desired behaviour are essential. If a previously extinguished unconventional behaviour, such as calling out, suddenly reappears, it is not necessarily a sign of regression but rather an instance of spontaneous recovery. The teacher should calmly re-apply the strategies that led to its initial extinction, such as ignoring the call-out and praising pupils who raise their hands.

Understanding spontaneous recovery helps teachers avoid frustration when old patterns re-emerge. It highlights the need for consistent application of behavioural strategies over time, particularly after breaks or changes in routine. Instead of viewing it as a failure, teachers can anticipate and prepare for these brief reappearances, reinforcing the desired responses and ensuring the long-term effectiveness of their interventions.

Edward Thorndike, a pivotal figure in behaviourism, initially focused his research on animal intelligence at the turn of the 20th century. His famous puzzle box experiments involved placing cats in enclosures from which they had to escape to receive food. Through these observations, Thorndike formulated the influential Law of Effect, which posits that responses followed by satisfying consequences are more likely to be repeated, while those followed by unpleasant consequences are less likely to occur (Thorndike, 1911).

This foundational work marked a significant historical impact, as Thorndike transitioned from animal studies to applying scientific principles to human learning and educational practice. He became a leading figure in educational psychology, advocating for empirical methods to understand how pupils learn. His emphasis on observable behaviour and the impact of consequences laid groundwork for future behaviourist theories and instructional design.

In the classroom, the Law of Effect is evident in how teachers structure learning activities and feedback. For instance, when a pupil successfully completes a challenging problem and receives specific, positive reinforcement, such as "That's excellent problem-solving, you clearly understood the steps," they are more likely to approach similar tasks with confidence. Conversely, if a pupil's incorrect answer consistently leads to frustration without guidance, they may avoid engaging with that subject.

Classical conditioning, first rigorously studied by Pavlov (1927), describes how learning occurs through association between stimuli. This process involves four fundamental components that explain how a neutral stimulus can come to evoke a specific response. The Unconditioned Stimulus (UCS) is an environmental event that naturally and automatically triggers a reaction without any prior learning; for example, the sudden, loud clang of a fire alarm.

The natural, unlearned reaction to the UCS is the Unconditioned Response (UCR). In the fire alarm scenario, pupils' immediate startle and subsequent evacuation behaviour would be the UCR. A Conditioned Stimulus (CS) is initially a neutral stimulus that, through repeated pairings with the UCS, begins to elicit a learned response.

Eventually, the CS alone triggers this learned reaction, which is then called the Conditioned Response (CR). Consider a school bell (CS) that consistently rings just before pupils are dismissed for lunch (UCS), which naturally causes feelings of hunger and anticipation (UCR). Over time, pupils may start to feel hungry and anticipate lunch (CR) simply upon hearing the bell, even if lunch is not immediately forthcoming.

Understanding these mechanics allows teachers to recognise how classroom environments can inadvertently create associations. For instance, a specific sound or visual cue consistently paired with a positive or negative experience can lead to pupils developing conditioned emotional or behavioural responses to that cue.

Thorndike's Secondary Laws (Law of Exercise and Law of Readiness)

Edward Thorndike proposed several principles of learning, extending beyond his well-known Law of Effect. His secondary laws, the Law of Exercise and the Law of Readiness, offer further insights into how connections between stimuli and responses are formed and modified (Thorndike, 1911).

The Law of Exercise

The Law of Exercise states that connections between a situation and a response are strengthened with practice and weakened with disuse. This principle comprises two sub-laws: the Law of Use and the Law of Disuse.

The Law of Use suggests that the more frequently a response is made to a particular stimulus, the stronger the connection becomes. For instance, when a teacher repeatedly asks pupils to identify prime numbers and they consistently provide correct answers, their ability to recall and apply this concept improves.

Conversely, the Law of Disuse indicates that connections weaken if they are not practised. If pupils learn about historical dates but do not revisit them for several months, their recall accuracy will likely diminish over time.

The Law of Readiness

Thorndike's Law of Readiness explains that learning is most effective when a learner is prepared to act. This readiness influences whether acting or not acting results in satisfaction or annoyance (Thorndike, 1911).

When a pupil is ready to learn a new concept, such as long division, engaging with the task brings satisfaction and facilitates learning. Conversely, if that pupil is ready to learn but is prevented from doing so, they may experience annoyance.

Furthermore, if a pupil is not ready to learn a particular skill, attempting to force the learning will likely result in annoyance and ineffective outcomes. For example, asking a Year 2 pupil to write a complex argumentative essay before they have mastered basic sentence structure will lead to frustration rather than learning.

◆ Structural Learning
Behaviourism in Education: Pavlov, Skinner and Thorndike Explained
Downloadable presentation

Downloadable Structural Learning presentation on Behaviourism in Education: Pavlov, Skinner and Thorndike Explained. Use it to learn the topic at your own pace, or to revisit the key evidence whenever you need a refresh.

Self-pacedEvidence-BasedPractical Examples
Download Slides (.pptx)

PowerPoint format. Compatible with Google Slides and LibreOffice.

Neurodiversity-Affirming Classroom Management

Behaviourist principles, such as clear expectations and consistent reinforcement, can be highly effective when adapted to support neurodivergent learners. This approach focuses on understanding individual needs and creating a predictable, supportive environment that promotes positive behaviour.

It moves beyond simply modifying behaviour to addressing the underlying reasons for certain actions, ensuring strategies are respectful and enabling.

Understanding Individual Needs

Observable behaviours often stem from underlying sensory, communication, or processing differences in neurodivergent pupils. Teachers must investigate the 'why' behind a behaviour rather than solely addressing the 'what'. For instance, fidgeting might indicate a need for sensory input, not defiance.

Individualised support plans, informed by careful observation and pupil input, are crucial. This ensures that behaviour management strategies are tailored to specific needs, moving beyond a one-size-fits-all approach.

Adapting Expectations and Routines

Clear, explicit expectations benefit all pupils, and are particularly vital for neurodivergent learners who thrive on predictability. Visual schedules, social stories, and concrete examples help clarify expected behaviours and transitions (Rosenshine, 2012).

For example, a Year 3 teacher uses a visual timetable with pictures and short text to outline the day's activities and transitions. When a pupil with ADHD struggles with moving between subjects, the teacher points to the visual schedule and uses a timer, providing a predictable structure and pre-warning.

Predictable routines reduce anxiety and provide a sense of control for many neurodivergent pupils. Teachers should pre-warn about changes well in advance and provide strategies for coping with unexpected events, such as a substitute teacher or a fire drill.

Tailoring Reinforcement and Feedback

Positive reinforcement should be specific, immediate, and genuinely motivating for the individual pupil. What is rewarding for one pupil may not be for another, so preferences must be identified and respected.

Feedback on behaviour should be descriptive and focus on the desired action, rather than just the absence of undesired behaviour (Hattie & Timperley, 2007). Offer specific, actionable strategies for improvement rather than generic reprimands.

In a Year 9 science class, a pupil with autism struggles with the noise level during group work. The teacher offers a choice: work at a quieter station with noise-cancelling headphones or contribute to a specific, defined task for a shorter period, then provides positive feedback like, "Thank you for clearly explaining your part of the experiment to your group, that helped everyone understand."

Promoting Self-Regulation

The ultimate goal is to help pupils develop internal strategies for managing their own behaviour and emotions. Teachers can explicitly teach self-regulation techniques, such as identifying triggers, using calming strategies, or requesting a break.

Provide opportunities for pupils to practise these skills with support, gradually fading external prompts as they gain confidence. This builds independence and agency, moving towards sustainable self-management.

Further Reading: Key Research Papers

These peer-reviewed studies provide the research foundation for the strategies discussed in this article:

This study revisits the behaviourist theory of learning. It outlines an Islamic education view. It has 6 citations.

K. M. Arif (2022)

This research looks at the limits of old behaviourist theories from Pavlov, Skinner and Thorndike. It focuses on learner motivation and how we view learners. It urges teachers to look past simple rewards and punishments. It highlights the need for a broader teaching approach. This approach should value a child's inner drive and beliefs.

Paul Main, Founder of Structural Learning
About the Author
Paul Main
Founder & Metacognition Researcher

Paul Main is an educator and metacognition researcher who founded Structural Learning in 2002. With a psychology degree from the University of Sunderland and 22+ years helping schools embed thinking skills, he bridges the gap between educational research and classroom practice. Fellow of the RSA and Chartered College of Teaching, with 128+ Google Scholar citations.

More →

Psychology

Back to Blog