Operant Conditioning

AKA

Instrumental Conditioning
Instrumental Learning
Reinforcement Theory
Type II Conditioning
Type R Conditioning

Focus

Affecting voluntary behaviors by manipulating environmental conditions

Principal Metaphors

Drawn from and/or aligned with principles from physics (Newtonian mechanics):
  • Knowledge is … repertoire of behaviors
  • Knowing is … behaving (triggered by stimuli)
  • Learner is … an organism
  • Learning is … changes in behavior (linking stimuli to responses) due to changes in the environment
  • Teaching is … conditioning (structures of reward and punishment that prompt changes in behavior)

Originated

1930s

Synopsis

Operant Conditioning and Classical Conditioning are two branches of Behaviorisms that are more properly understood as advice on teaching than as theories of learning. Both are concerned with manipulating and managing behaviors controlled by environmental stimuli, although focusing on different categories of behavior and employing different methods. Operant Conditioning is concerned with non-reflexive, voluntary behaviors that are under the learners’ control (an "operant behaviour" is one that produces some sort of effect), and it deals with rewarding or punishing target behaviors to increase or decrease their manifestation. Core constructs of Operant Conditioning include:
  • Discriminative Stimulus – in Operant Conditioning, the trigger – that is, the stimulus to which the organism responds
  • Respondent Behavior – in Operant Conditioning, the organism's behavior (that's triggered by a Discriminative Stimulus and followed by a Reinforcing Stimulus)
  • Reinforcing Stimulus – in Operant Conditioning, the feedback – that is, the stimulus the organism receives after a behavior. Types include:
    • Positive Reinforcement (Positive Reinforcer) – a Reinforcing Stimulus (i.e., something presented after a specific behavior is elicited) that increases the likelihood that behavior will be repeated
    • Negative Reinforcement (Negative Reinforcer) – a Reinforcing Stimulus (i.e., something presented after a specific behavior is elicited) decreases the likelihood that behavior will be repeated
  • Reinforcement Schedule – Reinforcing Stimuli can be administered according to different schedules, to different effects. Types include:
    • Continuous Reinforcement – every desired response is reinforced
    • Intermittent Reinforcement (Partial Reinforcement) – some, but not all desired responses are reinforced – purported to better reflect real life whereby desired behaviors are not reinforced every time
  • Reward – either introducing Positive Reinforcement or withdrawing Negative Reinforcement after a specific behavior, for the purpose of increasing the likelihood that behavior will be repeated
  • Punishment –  either withdrawing Positive Reinforcement or introducing Negative Reinforcement after a specific behavior, for the purpose of decreasing the likelihood that behavior will be repeated.
  • Three-Term Contingency (ABCs of Behavior) – the sequenced association of Antecedent stimulus, the Behavior, and the Consequence in Operant Conditioning. (Compare Two-Term Contingency of Classical Conditioning.)
The following table provides a truncated list of strategies and intentions to control behavior. Precursors to Operant Conditioning included:
  • Trial-and-Error Learning Theory(Edward Lee Thorndike, 1890s) – a mode of learning through which a range of actions is tested until a satisfying outcome occurs. Trial-and-Error Learning Theory was an important precursor to Operant Conditioning for its realization that learning is promoted by positive consequences, which Thorndike formally articulated at the Law of Effect (see Behaviorisms). Associated constructs include:
    • All-or-Nothing Learning – a conjecture that, in a single event, learning is either completed or does not happen at all (contrast: Incremental Learning; see above)
    • Incremental Learning – learning that unfolds in small steps over time (contrast: All-or-Nothing Learning; see above)
    • Puzzle Box – a highly structured learning environment in which an organism must come across an action for a positive outcome – which was used to demonstrate that the time taken to manifest such actions tended to increase with the number of trials
  • Law of Effect (Edward Lee Thorndike, 1890s) – Actions associated with a satisfying result are more likely to be repeated under the same circumstances, and actions associated with an uncomfortable effect are less likely to be repeated.
  • Discrimination Learning (Carl Lashley, 1920s) – within Behaviorisms, learning to respond differently to different triggers
Constructs and discourses associated with Operant Conditioning include:
  • Avoidance Conditioning – learning how to elude or prevent an unwanted outcome
  • Chaining (Sequential Learning) breaking down a skill or competency into elemental parts, and then using those elements to define a teaching/training sequencing. Chaining is usually associated with Behaviorisms, but it is also taken up by Instructivism and similar discourses. In the context of Behaviorisms, it refers specifically to a sequence of behaviors in which the outcome of one behavior becomes the stimulus for the next.
  • Errorless Learning (Charles Ferster, 1950s) – within Behaviorisms, a learning procedure that is sequenced and paced in a manner that minimizes the likelihood of undesired responses. It is based on the premise that errors impede learning (and, so, a learner’s errors are seen as a function of the teacher’s inadequate analysis of behavior)
  • Escape Conditioning – learning how to terminate a persistent, unpleasant stimulus
  • Human Contingency Learning (Jan De Houwer, 1990s) – a probability-based, experimental approach to studying human learning that embraces foundational principles from both Classical Conditioning and Operant Conditioning, but that presses into topics of human cognitive processes. Its methods are focused on determining the extent to which (i.e., probability that) specific stimuli are related to specific responses.
  • Learned Generalization (Secondary Generalization) (Roger Shepard, 1980s) – when an organism responds the same way to different stimuli with similar properties
  • Omission Training – removing positive reinforcement when unwanted behavior occurs
  • Operant Chamber (Skinner Box) – a highly structured learning environment in which an organism is rewarded for a specific response, enabling full control of the conditions of learning and great accuracy of counts and measures
  • Probability Learning (William Glasser, 1970s) – an attempt to make sense of “choice” within a frame of Behaviorisms (i.e., ignoring mental processes), Probability Learning posits that behavioral choices are made according to the probabilities of their outcomes
  • Shaping (Successive Approximation) –a process of teaching complex behaviors by rewarding spontaneous actions that initially approximate desired behaviors and, as things progress, that come closer and closer to desired behaviors
  • Stimulus Discrimination (Clark Hull, 1940s) – learning to respond in a specific way to a Discriminative Stimulus that is very similar to a Discriminative Stimulus already associated with a learned response
  • Stimulus Generalization (Clark Hull, 1940s) – associating a learned response with a Discriminative Stimulus that is different from the Discriminative Stimulus originally associated with the response
  • Time-Out – excluding the learner from a situation in order to stop the incidental reinforcement of an undesired behavior
  • Visceral Learning – the use of Operant Conditioning techniques to affect nonconscious functions, such as heart rate. (Contrast with Autonomic Learning, under Classical Conditioning.)

Commentary

See Behaviorisms for global commentaries, especially as this theory relates to human learning. Three major critiques of Operant Conditioning in particular bear emphasis: Firstly, in recent decades it has become evident that Intrinsic Motivation Discourses (i.e., structures and dynamics that are not external to target behaviors) can be much more powerful for changing and maintaining behaviors than Extrinsic Motivation Discourses. Second, behaviors that are shaped by Operant Conditioning regimes of reward and punishment can be volatile and prone to regression and other distortions – indicating that the underlying metaphor of cognition (i.e., a network of cause–effect links) is not adequate, especially for higher-order functions. Thirdly, effective application of Operant Conditioning training techniques generally requires highly controlled situations with minimum distractions and maximum consistency in the rewarding and punishing of target behaviors. Such conditions simply cannot be achieved in a classroom setting.

Authors and/or Prominent Influences

B.F. Skinner; Edward Thorndike

Status as a Theory of Learning

Operant Conditioning is not a theory of learning, but advice for manipulating and managing behavior that arises from Behaviorisms.

Status as a Theory of Teaching

Operant Conditioning is a more appropriately regarded as a theory of teaching than a theory of learning, and this fact was recognised in the mid-20th century when it rose to prominence in the field of education. With its focus on manipulating behavior, Operant Conditioning falls among mechanistic conceptions of teaching associated with Correspondence Discourses on learning.

Status as a Scientific Theory

The foci, processes, and interpretations of Operant Conditioning are clearly articulated and supported by a substantial body of uncontradicted evidence.

Subdiscourses:

  • All-or-Nothing Learning
  • Avoidance Conditioning
  • Chaining (Sequential Learning)
  • Continuous Reinforcement
  • Discrimination Learning
  • Discriminative Stimulus
  • Errorless Learning
  • Escape Conditioning
  • Human Contingency Learning
  • Incremental Learning
  • Intermittent Reinforcement (Partial Reinforcement)
  • Visceral Learning
  • Law of Effect
  • Learned Generalization (Secondary Generalization)
  • Negative Reinforcement (Negative Reinforcer)
  • Operant Chamber (Skinner Box)
  • Omission Training
  • Positive Reinforcement (Positive Reinforcer)
  • Probability Learning
  • Punishment
  • Puzzle Box
  • Reinforcing Stimulus
  • Respondent Behavior
  • Reward
  • Shaping (Successive Approximation)
  • Stimulus Discrimination
  • Stimulus Generalization
  • Three-Term Contingency (ABCs of Behavior)
  • Time-Out
  • Trial-and-Error Learning Theory

Map Location



Please cite this article as:
Davis, B., & Francis, K. (2021). “Operant Conditioning” in Discourses on Learning in Education. https://learningdiscourses.com.


⇦ Back to Map
⇦ Back to List