Behaviorist Evolutionary Psychology

We evolved to condition one another.

Mar 06, 2023

1Behaviorism and evolutionary psychology may seem most antithetical to one another in terms of their ramifications for an understanding of human nature, and the premises that they begin with. Behaviorists implicitly posit a scant number of general purpose learning mechanisms that govern all behavior regardless of the adaptive landscape of the species, whereas evolutionary psychologists posit a multitude of domain specific learning programs with varying input flexibility depending on the adaptive landscape of the particular organism or species. While evolutionary psychologists and others have approached learning from the perspective of classical and operant conditioning, the behaviorist study of human behavior can be evolutionized further. We can examine reinforcement and punishment from an evolutionary perspective and the way learning schedules are established by individuals in social interactions.

Behaviorism had its heyday throughout most of the recent history of psychology. Later, radical behaviorism was extensively criticized because of various implicit assumptions it makes about equipotentiality and its rebuking of species specific adaptations in response to environmental stimuli. Behaviorist principles are still incredibly useful when understanding behavior and learning. While it is patently obvious that some behaviors are reinforcing and punishing, most psychologists do not address this facet of social interactions. For instance, it is understood that a man who is jealously enraged when his mate makes sexual advances towards a rival may verbally or physically abuse her. This can be called abuse or intimidation, but this is rarely if ever directly labeled “punishment”: a consequence that can reduce the frequency of behavior. Explicitly using behaviorist terminology integrates behaviorism with evolutionary psychology. I will begin by discussing the nature of primary reinforcers in human beings and move on to the thesis that human beings have an evolved psychology to behaviorally condition one another.

Human species specific primary reinforcers and primary punishers

Animals have different primary reinforcers depending on the cues in their environment and which cues are most likely to be statistically associated with adaptive outcomes. Primary reinforcers are defined as unconditioned stimuli that are inherently reinforcing without classical or operant conditioning involved. Food, air, sex and sleep are examples of primary reinforcers, and these are well preserved among many organisms. From an evolutionary perspective, there must be many many more primary reinforcers than just the handful put forward by behaviorism. Because evolved psychological mechanisms solve adaptive problems with biologically prepared inputs, there are primary reinforcers that exist for the human species that do not exist for other species. We can also predict that there will be primary reinforcers associated with most psychological mechanisms that function to help motivate organisms the adaptive goals of those mechanisms. As a generalization, we can expect that any cue indicating that a relevant adaptive problem has been solved would be a primary reinforcer. Seeing a smile is an example of such a primary reinforcer. While behaviorists would argue that a smile is a secondary reinforcer, because it has been classically conditioned by being paired with food or another primary reinforcer, an evolutionary learning perspective would differ. Experimental evidence shows that infants that are 2-7 months old can be conditioned to prefer some sounds over others using only smiles as reinforcement (Routh, 1969). For infants, the major adaptive problem they face is having a caretaker who is engaged in caring for him or her adequately. Therefore, a smile by the caretaker towards the infant is a cue that a relevant adaptive problem is at least partially being resolved by the behavior that elicits the smile.

Another prediction derived from evolutionary psychology is that primary reinforcers differ over the lifespan and that different primary reinforcers would have different salience depending on the adaptive problems that were recurrently faced by our ancestors during particular life stages. For example, pictures of nude sexually mature women would be a strong primary reinforcer for males after puberty, but probably would less of a reinforcer for males that had not reached puberty. Researchers have found that male rhesus macaques will “pay” (give up part of a juice reward) to view the perineum region of female macaques or the faces of high status males but will not “pay” to view the faces of low status conspecifics (Deaner, Khera, Platt, 2005)2. This study is interesting because it shows that social information in and of itself can act as reinforcement. Evolutionary psychology would predict that these same stimuli would not be reinforcing for female monkeys or monkeys who have not reached sexual maturity. Therefore, primary reinforcers also do not display equipotentiality.

Primary reinforcers do not always feel pleasurable and may even have “negative” emotional affect associated with them. Many emotions may have negative emotional valence to them, making an individual feel anxious or upset, but these emotions may still cause motivation towards an act in a certain domain or perpetuate a behavior upsetting to the actor. While these emotions may feel negative to the individual they have evolved to help motivate individuals solve certain adaptive problems. When a jealous person goes through their mate’s belongings and finds a phone number or other evidence of infidelity this may act as a reinforcer to continue investigative behavior even though the finding itself is unpleasant. In a fight, a hard hit from one’s rival may be painful but can serve to motivate and perpetuate the desire to fight and increase proximity to the source of the pain. Both of these examples are exactly the opposite of what radical behaviorism would predict. All stimuli have context and domain specific effects on the organism depending on the adaptive problem they face.

Just as there are primary reinforcers there are also primary punishers. Shocks, loud noises, pain, and other such stimuli are almost always primary punishers, aversive stimuli that do not have to be paired with other aversive stimuli to reduce the frequency of behaviors that they follow or to cause aversion to stimuli they are paired with. Just as humans have their own suite of primary reinforcers they also have their own species specific primary punishers. Exclusion, embarrassment, reputational damage, shame, and the realization of inequity in a social exchange could all be examples of social primary punishers or consequences that reduce the prevalence of the action they follow (operant conditioning) or make the individual more averse to the stimuli they are paired with (classical conditioning).

Social primary reinforcers such as cues to sexual access, affection, grooming, smiling, friendly eye contact, cues of elevation in a status hierarchy and social primary punishments like those mentioned above are important for the thesis that humans have evolved to be shapers of the behavior of other humans. Evolutionary psychologists have investigated the evolved cognitive structure of humans in regard to adaptive preferences and aversions. This may be important to understand how we implicitly understand how other humans learn in their social interactions.

The extended phenotype

In “The Extended Phenotype”, Dawkins advances the hypothesis an organism’s physiological characteristics, behavior patterns and individual differences are not the only things that make up their phenotype. Phenotype also consists of an organism’s effect on the environment and even its effects on other organisms. In a famous example, the “Bruce effect” is a pregnancy disruption that occurs when a female mouse smells an unfamiliar male mouse’s pheromones. In this case, her pregnancy is immediately terminated and she becomes sexually receptive and fertile much more quickly than if she had carried her litter to term. Dawkins argues that one does not only have to view the abortion and early receptivity of the female mouse as an adaptation on her part but can also view this as an adaptation on the part of the male mouse. Manipulation of the external environment, including the manipulation of other organisms and conspecifics is constantly under just as intense selection pressure as those phenotypic characteristics that enable the organism to adapt to its immediate environment. Humans shape their physical environment and social environment in myriad ways. By using intuitive behaviorism, including rewards and punishments, humans extend their phenotype into the behavior of other humans.

Social interaction as shaping behavior

In a seminal paper by Dawkins and Krebs, they claimed that all communication can be viewed as manipulation (Dawkins & Krebs 1978). Evolutionary psychologists have advanced different adaptive hypotheses about the functions of different kinds of social behavior. For example language can communicate vital information to kin or court mates by demonstrating quality (e.g. through storytelling). Repeated interactions can demonstrate trustworthiness.

David Buss developed the term “strategic interference” to explain anger, upset and conflict between men and women. On average, women want a long courtship to evaluate the quality of men and men want sexual access more quickly, described here in the Evolution of Desire

Men and women also clash over resources and sexual access. In the evolutionary psychology of human mating, the sexual strategy adopted by one sex can trip up and conflict with the strategy adopted by the other sex. I call these phenomena strategic interference. Consider the differences in men’s and women’s proclivities to seek casual short-term sex. Men and women typically differ in how long and how well they need to know someone before they consent to have sex….There is a fundamental conflict between these different sexual strategies: men cannot fulfill their short-term wishes without simultaneously interfering with women’s long-term goals. An insistence on immediate sex interferes with the goal of a longer courtship phase. The interference is reciprocal, since any delay also obstructs the goal of those seeking short-term sex. Whenever the strategy adopted by one sex interferes with the strategy adopted by the other sex, strategic interference and conflict ensue.

Strategic interference doesn’t just describe conflict between men and women. Every organism has an adaptive strategy. This strategy can be finding a mate, taking care of young, or drinking dew off a leaf. When another organism interferes with an adaptive strategy one response can be to use punishment to prevent behavior that interferes with your strategies in the future.

The flip side of strategic interference is strategic facilitation - let’s say you are out having a drink with some colleagues and you want your colleagues to think you are smart and virtuous. Your colleague William can tell everyone that he once saw you kick a puppy, that would be strategic interference. Anger, annoyance, upset and feelings of revenge would facilitate you punishing William so he would reconsider interfering with your strategy of looking good in front of others. If your colleague William talks about how he saw you save a puppy, that would be strategic facilitation with regard to making other people think you are virtuous. You would expect feelings of gratitude, pleasure and satisfaction that could influence you to reward William so he might be more likely to speak highly of you in the future.

From a behaviorist evolutionary perspective social behavior 1- helps us monitor how others’ actions help or hinder our adaptive goals and 2- repeated interactions can establish reinforcement and punishment schedules. To take this further, I would claim that reciprocal training is always occurring between interacting members of a social group, consciously or unconsciously. Social interaction can be seen as a way to adaptively expand your phenotype into the behavior of other people.

Above I mentioned that organisms all have primary reinforcers that feed into their evolved psychological mechanisms motivating them towards adaptive goals. Just as these primary reinforcers or primary punishers have intrinsically rewarding or punishing properties respectively, organisms interact with other organisms in the language of their primary reinforcers or primary punishers. Part of every organism’s “theory of mind” of other organisms is an innate understanding of the contingencies they can establish in order to manipulate the behavior of other organisms towards their adaptive goals.

The puzzle of inflicted costs

Evolutionary psychologists make frequent mention of individuals inflicting costs on one another without explicitly stating why individuals would do this. Although tactics of manipulation have been studied in romantic relationships (Buss, Gomes, Higgins, Lauterbach 1987) the intended effects of both conscious and unconscious manipulation have not been approached from an evolutionary behaviorist perspective.

One area in which the idea of shaping behavior could be leveraged is in the debate between homicide byproduct theorists, (Daly & Wilson, 1988) and homicide adaptation theorists (Duntley & Buss 2011). Step parents are far more likely to abuse step children than genetically related parents and being a step child is one of the major predictors of child abuse. This has been termed “the Cinderella effect” and Daly and Wilson argue that the reason step parents abuse step children is because the evolved parental psychological mechanisms that enable the generous care of children in biological parents are not fully activated in step parents, if they are activated at all. On the other hand, Buss and Duntley argue that step parents have adaptations to directly inflict costs on and kill step children. But why is it that step parent abuse towards step children is more common than homicide? Why are there are adaptations for “inflicting costs” as well as killing? Why would stepparents not more efficiently cut off the flow of resources to non-genetic children via homicide but instead engage in a protracted period of inflicting costs? From a purely resource safeguarding standpoint, if a man wants to best use parental resources to his genetic offspring the most effective strategy is for him to kill these non-genetic offspring. A well-known example of this behavior is that male lions quickly kill all the cubs in a pride when they are newly dominant (video of the behavior here). In humans, a period of inflicting costs is actually much more costly to the male; Any abuse above and beyond negligence will cost him time and energy as well as leave him a child that is potentially injured and will therefore require extended maternal time and energy and potentially more resources.

One way of reducing the incidence of costly solicitous behaviors in an unrelated child, is to extinguish the behavior. Many (but not all) behaviors undergo extinction when they are no longer followed by a positive consequence, but punishment can often work more quickly. From a behaviorist evolutionary perspective, abuse, like imposing pain, bodily injury, or humiliation is a punishment that reduces the frequency of resource solicitation or drives it to extinction . From this perspective, neglect and abuse can reduce or prevent the resource depleting demands of unrelated children, enabling them to be channeled to other adaptive opportunities.

Another puzzle psychologists have wrestled with is prosocial punishing (AKA altruistic or third party punishment). Why would anyone expend time or effort to punish an individual whose actions do not directly affect them? There have been many reasons posited for this including social displays, displays of dominance and even group selection. The simplest explanation may be that this prosocial punishing is shaping the behavior of the individual being punished in order to deter them from engaging in costly acts in the future. From this perspective one could predict that prosocial punishing is more likely to occur in domains in which one predicts they may find themselves or their kin with the individual perpetuating the undesirable behavior. To use the example above, I might punish William for engaging in malicious gossip about a friend because it may deter him from engaging in malicious gossip about me. Or, if I am a man with a daughter I may be more likely to prosocially punish a man who sexually assaults another woman in my group than if I am a man with a son. In small hunter gatherer societies, deterring an undesirable behavior in an individual may have conditioned him or her not to perpetuate that behavior in the future in ways that inflict direct costs. With a psychology that expects interactions at some point with every individual of the community prosocial punishment may be a preemptive means of preventing behaviors that will be costly in the future.

Manipulation of Mates: Inflicting costs and doling out rewards

3Of all the relationships humans have with one another romantic relationships often result in the most shared genetic fate. Because the behavior of one’s mate may have such a strong impact on one’s reproductive success, tactics of shaping behavior should be heavily employed in the context of romantic relationships. We should expect that reinforcers and punishments will be used in mating contexts from the initial stages of courtship to managing parental duties and deterring the divergence of resources.

In Buss et. al’s 1987 paper, 6 major classes of manipulation tactics were found in romantic relationships: charm, silent treatment, coercion, reasoning, regression and debasement. Each of these can be viewed from a social shaping perspective. Charm, reasoning, and debasement can be viewed as examples of positive reinforcement. The “charm” tactic included acts of love and affection, compliments, gifts and promising a reciprocal favor in exchange for a behavior in the mate. The reasoning tactic included the item “I point out all the good things that will come from [the desired behavior]” which may represent a further extension of shaping via positive reinforcement. Finally, debasement could be a tactic used if the couple is asymmetrical in mate value. If your mate communicates that you are equal or better mate value it may signal their greater commitment to you. If one mate debases or lowers themselves in the eyes of the other mate, this could play on the primary reinforcers intrinsic to motivations in the psychology of human mating. Thus, debasement rewards the mate.

Silent treatment, coercion and regression are forms of punishment. Coercion, including threats and criticism is simple punishment similar to other kinds of abuse. Silent treatment, like the neglect discussed above, represents “negative punishment” withholding positive consequences for the behavior as an adaptive tactic meant to extinguish the behavior. If someone values interacting with their mate, taking away this reinforcer is a way of simultaneously punishing the behavior and no longer supplying positive reinforcement towards that behavior. Regression, including whining, sulking, crying and pouting parasitizes the domain of child rearing in order to meet its ends. Children engage in these tactics in order to elicit parental resources via manipulating parental mechanisms attuned to real displays of these emotions in emergency situations. Because of error management, parents often err on the side of giving children what they want when crying or screaming because ignoring a desperately needed appeal is more costly reproductively than sometimes giving in to ersatz appeals. By using these same tactics, a mate may be able to elicit the behavior desired if someone is sensitive to their elicitations.

More subtle and potentially unconscious forms of shaping behavior may also be employed. Pupil dilation signals interest and love and may act as a reinforcer on the part of both romantic partners. Pupil dilation in response to statements of love and affection can reinforce the cognitions that stimulate these expressions and make them more frequent. Women use sex as a reinforcer and the withdrawal of sex as a punishment which also need not be conscious. Males may use resource displays, statements of affection and exclusivity including the derogation of other women in order to elicit sexual or other behavior desired from their female partner. A final example is that men and women may evoke jealousy directly (in a social encounter with a sexual rival of the mate) or indirectly (by speaking highly or commenting on the attractiveness of an intrasexual rival) in order to produce a punishment for unwanted behavior.

Antagonistic coevolution and responses to shaping adaptations

Because individuals evolved the ability to shape others’ behavior a number of coevolved adaptations may have arisen in response to this. As discussed above positive reinforcement may be given to an individual in return for an act or behavior. Individuals will calibrate their expectations for positive and negative consequences for their actions based on previous consequences.

In a famous study in developmental psychology, children who were coloring with markers were divided into three different reinforcement groups. One group was told they would be given a reward for coloring, another was given a unexpected reward at the end of coloring, and the third group was given no reward (Lepper, Greene & Nisbett, 1973). The reward consisted of a ceremonious presentation of a “good player” award, which simultaneously was a material and social reward. Later, they saw how long children played with markers when given free opportunity depending on if they had been rewarded for doing so in the past. They found that the children who were initially rewarded played with the markers the least while children that had been given an unexpected reward played with them the longest. Over the years this effect has come to be known as the “overjustification hypothesis” and individuals who are rewarded for actions they intrinsically enjoy reduce their frequency of these actions when given an expected extrinsic reward. An evolutionary psychology of learning could have a different rationale for this effect. If individuals have evolved in an environment where their behavior was being recurrently shaped by other humans using reinforcement then we may have an evolved psychology to “hold out” on performing rewarded actions unless this benefit is initially offered. Rewards that are given before an action is performed are more obviously given by other humans (environmental rewards rarely precede a behavior) whereas beneficial consequences that occur after an action is performed could be “payment” or a positive conditionality generated by the environment. One prediction generated by this hypothesis is that if a reward that follows a behavior is obviously generated by the environment that behavior will either stay the same or increase in frequency whereas if a positive consequence is obviously generated by a human potentially shaping one’s behavior then the behavior may be “withheld” unless the reward is seen as available or unless the individual has some evidence that they will be given a greater reward for the same action. This is another possible exception to equipotentiality, a different response to rewards given by an agent versus those given by the environment.

Other ramifications of the current thesis

Social rewards and punishment need not be detrimental to one’s fitness or otherwise “manipulative” but may help people achieve shared adaptive goals better than without them. For example, the rewards and punishments your parents provide you may serve as guideposts about how to behave appropriately in society. Being too sensitive to the rewards and punishments of others can make your behavior better at facilitating their adaptive strategies than your own. This has potential in explaining why we rebel against conditioning- and why disapproval from someone who is not aligned with us can be reinforcing.

Do individuals have an evolved psychology of shaping their own behavior towards adaptive goals with reinforcement and punishment? It is rumored that B.F. Skinner used operant conditioning techniques on himself to increase his productivity. As a reward for work completed he would take a break from working to listen to music or play ping pong, one of his favorite pastimes in graduate school. One of my best friends in college used a similar tactic with herself, smoking a cigarette each time she completed a page of a term paper. Consciousness itself may be the most comprehensive shaper of one’s own behavior. Consciousness affords us the ability to view ourselves as an outside actor thus enabling one to punish or reinforce one’s own behavior in light of the conscious plans and goals that one has for oneself. Internal emotional states act as positive and negative consequences for actions that one’s consciousness deems acceptable or unacceptable.

References

Anderson, J.R. (1998). Social stimuli and social rewards . J. Neurophysiol. 92, 3056–3068.

Buss, D.M. (2005). The murderer next door: Why the mind is designed to kill. New York: The Penguin Press.

Buss, D. M., Gomes, M., Higgins, D. S. & Lauterbach, K. (1987). Tactics of manipulation. Journal of Personality & Social Psychology, 52(6), 1219-1229.

Buss, D. M. (2016). The evolution of desire: Strategies of human mating. Hachette UK.

Dawkins, R., & Krebs, J. R. (1978). Animal signals: information or manipulation. Behavioural ecology: An evolutionary approach, 2, 282-309.k

Daly, M., & Wilson, M. (1988). Homicide. Hawthorne, NY-. Aldine de Gruyter.

Deaner, Robert O., Amit V. Khera, and Michael Platt (2005) Monkeys Pay Per View: Adaptive Valuation of Social Images by Rhesus Macaques. Current Biology, Volume 15, Issue 6, Pages 543-548

Duntley, J. D., & Buss, D. M. (2011). Homicide adaptations. Aggression and Violent Behavior, 16(5), 399-410.

Lepper, M. R., Greene, D., & Nisbett, R. E. (1973). Undermining children’s intrinsic interest with extrinsic rewards: A test of the “overjustification” hypothesis. Journal of Personality and Social Psychology, 28, 129-137.

Routh, D.K. (1969). Conditioning of vocal response differentiation in infants . Developmental Psychobiology, 1, 219–226 .

I wrote this essay for an evolutionary psychology seminar and it’s the origin of many ideas that are now being expanded in my forthcoming book, How to Train Your Boyfriend.

See Anderson, 1998 on the history of studies examining the operantly reinforcing properties of social stimuli in nonhuman primates.

My forthcoming book is about “training” in romantic relationships. You can get a sense of it here.

Discussion about this post

Yudhistira Ghifari Adlani

Mar 6, 2023

A genuinely interesting thesis. It seems that, if my reading is correct, you're trying to apply the insights of behaviorism (or to be precise, applied behavior analysis) with a dose of EP (specifically, an adaptationist program). And you want to write a book about relationship, and how we can basically shape each other's behavior with the insights from sexual selection and other parts of EP. Pretty neat.

Some paper that might help on your journey for writing that book:

1. A Comparison of the Properties of Different Reinforcers (Hogan & Roper, 1978)

2. A Natural Science of Behavior (Catania, 2013)

3. Timberlake’s behavior systems: A paradigm shift toward an ecological approach (Felipe et al., 2019)

4. Taming the boojum: Being theoretical about peculiarities of learning (Bowers, 2022)

These are just some of the papers that might help you think more clearly in regards to your thesis. If you want to check out paper that discuss technical issues with learning and behavior, you might want to check out my google sheets in the section "issues" with learning (https://docs.google.com/spreadsheets/d/112hxKPKyp7OOOd5bFfcy5FzxNFywDJqOaX2npnhh3zM/edit?usp=sharing).

Perhaps, my one criticism for this article is when you make generalizations, "Behaviorism and evolutionary psychology may seem most antithetical to one another in terms of their ramifications for an understanding of human nature, and the premises that they begin with." Because it begs the question, do they though? From reading the literature, we might need to ask which "behaviorist" and which "evolutionary psychologist".

Cause technically, Skinner IS an evolutionary psychologist, as any other behaviorist that also talks about Darwin (e.g. Kantor, Catania, Carrera, Baum). They just don't think about it in the same way as an EP would do it. Here is Skinner's take on evolution (https://doi.org/10.1017/S0140525X0002673X; which he got criticized heavily lol, but hey, that's Skinner :D). Funny enough, he once discuss his ideas with E.O. Wilson in this book (https://link.springer.com/book/10.1007/978-0-387-89462-1). Another one that he might differ from EP is that Skinner genuinely dislike hypothetical construct that can't be observed (e.g. mind). Couple days before his passing, he writes this paper to summarize his view (https://psycnet.apa.org/record/1991-08735-001). However, other behaviorist such as Howard Rachlin, he does talk about the mind, but then again with his own way (https://onlinelibrary.wiley.com/doi/10.1002/jeab.782). I think a lot (not all) of the differences between EP and behaviorism, even perhaps other system in psychology is a matter of language that they use. That's why perhaps I think your work is important because it can bridge between ideas that's never been bridged before, but you have to characterize them with their own respective language.

An example of a paper that manage to translate between system in a manner that's fair is by Meehl (https://meehl.umn.edu/sites/meehl.umn.edu/files/files/151murrayskinner.pdf). I usually share this paper if anyone want to understand behaviorism (and psychoanalysis) better, but also how a student "defend" his mentor in a manner that's charitable and accurate.

Expand full comment

Anonymous Dude

Jun 24, 2023

I was listening to your podcast with Meghan Daum, and was laughing at your description of what you need to make polyamory work (high IQ, lack of jealousy, self-control, low agreeableness...so basically be a tech bro?) and the behavior of the various Reddit communities (yeah, I don't see why you have to be a Marxist to be poly).

Here's my question for you: you basically accept that masculinity/femininity varies on a spectrum (with the biological sexes' distributions centered at very different places). Have you thought about writing a guide for masculine women? Topics might include the pluses and minuses of promiscuity, male versus female friends and the strengths and weaknesses of each, whether to have kids and how to decide, and so on. I write this because I have a rather nerdy female friend, all of whose friends are non-nerd women or nerd men, and neither of them really gets her. It might help other people in similar situations.

(There needs to be a guide for feminine men, too, but I don't know who would write it.)