Reward occupies a peculiar crossroads in the depth-psychology corpus, where neurobiological precision, linguistic archaeology, and clinical pragmatics converge without fully resolving into a single theoretical voice. Wolfram Schultz anchors the dominant neuroscientific position: reward is defined operationally as any object or stimulus that is liked and sought in greater quantity, and its significance for behavior is mediated not by its absolute value but by the deviation from expectation — the prediction error — encoded by dopamine neurons. This framing transforms reward from a static incentive into a dynamic relational quantity, one that perpetually recalibrates toward escalation and is thus structurally insatiable. The clinical literature on addiction (Blum, Miller, Paulus) elaborates this instability, showing how hypodopaminergic states and hijacked reward circuitry underlie compulsive seeking. A therapeutic counter-current, represented by Garland and Taylor, proposes that mindfulness-based interventions can devalue maladaptive rewards through reinforcement learning mechanisms, restoring the experiential salience of natural pleasures. Benveniste's philological voice introduces a wholly different register: in Indo-European antiquity, reward carried juridical and eschatological weight — a recompense for merit in contest or devotion — distinguishing earthly wages from transcendent compensation. Taken together, the corpus reveals reward as simultaneously a neurochemical signal, a motivational engine, a site of pathological capture, and a category embedded in the deepest structures of human social and spiritual accounting.
In the library
21 passages
'Reward' is any object or stimulus that I like and of which I want more. 'Reward prediction error' then means the difference between the reward I get and the reward that was predicted.
Schultz provides the foundational operational definition of reward as a liked, sought quantity whose behavioral significance derives from its discrepancy from prediction.
The dopamine prediction error response may belong to a mechanism that underlies our drive for always wanting more reward. This mechanism would explain why we need ever higher rewards and are never satisfied with what we have.
Schultz argues that the neurodynamics of prediction-error updating structurally compel reward escalation, producing insatiability as a biological default.
The study of reward prediction errors touches the fundamental conditions of life.
Schultz frames reward-prediction-error research as bearing on the deepest motivational conditions of human existence, anchoring it in both neuroscience and philosophical anthropology.
These substances seem to hijack the neuronal systems that have evolved for processing natural rewards... the drug effects mimic a positive dopamine reward prediction error... and thus induce continuing strong dopamine stimulation.
Schultz explains addiction as a pathological simulation of natural reward prediction errors, producing unregulated dopamine stimulation that overwhelms evolved homeostatic limits.
This is not concerned with some advantage of an economic character, nor of a regular remuneration, nor again with a wage for an ordinary piece of work, but rather with a recompense — material or otherwise — awarded to the one who emerges victorious from a stru
Benveniste distinguishes the Indo-European root of reward from ordinary economic remuneration, locating it in a domain of contest-based or devotional merit with material and spiritual dimensions.
Benveniste, Émile, Indo European Language and Society, 1973thesis
When the wage is to be received from the Father who is in Heaven, it is laun; the word mizdo was considered inappropriate.
Benveniste demonstrates that Gothic translators lexically distinguished transcendent reward (laun) from human wages (mizdo), revealing an ancient bifurcation between sacred and profane recompense.
Benveniste, Émile, Indo European Language and Society, 1973supporting
MBIs may provide a means of reward replenishment and ultimately reverse the reward deficiency syndrome inherent in addiction — a therapeutic process plausibly important for allaying craving and deterring relapse.
Garland argues that mindfulness-based interventions can therapeutically restore deficient reward processing in addiction by enhancing savoring and positive affect.
Garland, Eric L., Mindfulness training targets neurocognitive mechanisms of addiction at the attention-appraisal-emotion interface, 2014supporting
both expected reward values and present-moment reward values can be assessed to determine the correspondence between these two constructs and if they were influenced in the same way by the use of the mindful smoking craving tool.
Taylor operationalizes reward value at two temporal scales — expected and present-moment — and demonstrates that mindfulness-based craving tools systematically reduce both in smoking behavior.
Taylor, Veronique A., App-Based Mindfulness Training Predicts Reductions in Smoking Behavior by Engaging Reinforcement Learning Mechanisms: A Preliminary Naturalistic Single-Arm Study, 2022supporting
showed a decreasing slope trajectory of the reward value of eating across uses of a mindful eating app-based craving tool, which predicted corresponding decreases in maladaptive eating behavior.
Taylor provides empirical evidence that mindfulness training systematically decreases the subjective reward value of addictive behaviors, linking reinforcement learning to clinical behavior change.
Taylor, Veronique A., App-Based Mindfulness Training Predicts Reductions in Smoking Behavior by Engaging Reinforcement Learning Mechanisms: A Preliminary Naturalistic Single-Arm Study, 2022supporting
the association of dopaminergic signaling as it relates to the differences between expected and obtained reward, i.e. the reward prediction error, is one of the most influential recent developments in understanding reward-related processes.
Paulus situates reward prediction error as the central organizing concept for contemporary understanding of reward processing, particularly in the context of drug addiction and interoception.
Paulus, Martin P., Interoception and drug addiction, 2014supporting
a high density of low reward possibilities was associated with greater insular activation... enhancement of the hedonic and incentive values of an option by both cognitive and affective processes involves the insular cortex.
Paulus implicates the insular cortex in modulating both the hedonic and incentive dimensions of reward, linking interoceptive processing to reward valuation.
Paulus, Martin P., Interoception and drug addiction, 2013supporting
The dopamine error signal could be a teaching signal that affects neuronal plasticity in brain structures that are involved in reward learning, including the striatum, frontal cortex, and amygdala.
Schultz identifies the dopamine prediction error signal as a plasticity-inducing teaching signal distributed across key structures of the reward learning network.
The study of risky rewards allows us to address two important questions for dopamine neurons, the incorporation of risk into subjective reward value, and the construction of a formal, mathematical economic utility function.
Schultz extends reward analysis into the domain of risk and subjective utility, arguing that dopamine neurons integrate probabilistic risk into the computation of reward value.
This reward response is, not surprisingly, enhanced for one's own infant; multiple fMRI studies have reported stronger reward circuit activation while new mothers viewed photos of their own babies.
Lench demonstrates that the mammalian reward circuit is selectively amplified by attachment-relevant stimuli, with oxytocin and dopamine jointly mediating approach motivation toward one's own offspring.
Lench, Heather C., The Function of Emotions: When and Why Emotions Help Us, 2018supporting
the reward cycle's phases: wanting, liking, and learning, representing a peak in consummatory pleasure and the onset of the learning phase.
Schoeller maps aesthetic chills onto the tripartite reward cycle, situating peak pleasure as both a consummatory pinnacle and the trigger for reward-learning processes.
Schoeller, Felix, The neurobiology of aesthetic chills: How bodily sensations shape emotional experiences, 2024supporting
Taylor demonstrates a tight correspondence between expected and present-moment reward valuations, confirming that reinforcement learning models accurately capture the motivational dynamics of addictive behavior.
Taylor, Veronique A., App-Based Mindfulness Training Predicts Reductions in Smoking Behavior by Engaging Reinforcement Learning Mechanisms: A Preliminary Naturalistic Single-Arm Study, 2022supporting
the slope for smoking expected reward value was significantly more negative than that of the slope for expected value of not smoking.
Taylor shows that mindfulness craving-tool use differentially deflates the reward value of smoking relative to non-smoking, demonstrating selective devaluation of the addictive action.
Taylor, Veronique A., App-Based Mindfulness Training Predicts Reductions in Smoking Behavior by Engaging Reinforcement Learning Mechanisms: A Preliminary Naturalistic Single-Arm Study, 2022supporting
It is always Agamemnon who distributes to the aristêessi and the basileûsi, to the lords and kings, their géras, their portions of honor.
Benveniste traces géras as a distributed portion of honor-reward within a hierarchical social order, illustrating the agonistic and status-constitutive functions of reward in Homeric society.
Benveniste, Émile, Indo European Language and Society, 1973supporting
Substance use disorders (SUD) are inheritable and the culprit is hypodopaminergic function regulated by reward genes.
Blum frames addiction etiologically as heritable dysregulation of reward-gene-mediated dopaminergic function, positioning reward deficiency as the primary pathological substrate.
Blum, Kenneth, Early Intervention of Intravenous KB220IV Neuroadaptagen Amino-Acid Therapy (NAAT)™ Improves Behavioral Outcomes in a Residential Addiction Treatment Program: A Pilot Study, 2012supporting
the attentional characteristics of the initial response enhance the subsequent processing of reward information. This mechanism affords overall a gain in speed and accuracy without major cost.
Schultz describes a two-component dopamine response architecture in which an initial attentional signal primes the system for efficient downstream reward information processing.
A positive reinforcer, such as food, praise, or attention, should be made contingent on the prior performance of an appropriate response... partial reinforcement leads to greater persistence of learned behavior.
James introduces operant conditioning principles, noting that contingent positive reinforcers and partial reinforcement schedules shape behavioral persistence — an early behavioral cognate of reward learning.
James, William, The Principles of Psychology, 1890aside