The Stanford Prison Experiment: The Concept of "Evil"

In August 1971, a group of college students arrived at the basement of Stanford University’s psychology department for what they believed would be a two-week study on prison life. They had been randomly assigned roles as either guards or prisoners in a simulated prison environment. Within days, the “guards” were engaging in psychological abuse, humiliation, and cruelty toward the “prisoners,” who showed signs of severe emotional distress, learned helplessness, and psychological breakdown. The experiment became so disturbing that it was terminated after only six days instead of the planned two weeks. What began as a psychological study became one of the most famous—and controversial—demonstrations of how quickly ordinary people can engage in cruel behavior when placed in certain situations.

The Stanford Prison Experiment, conducted by psychologist Philip Zimbardo and his colleagues, has been cited for decades as evidence that evil isn’t inherent in certain individuals but rather emerges from situations that encourage or permit cruel behavior. Zimbardo argued that the experiment demonstrated how normal, psychologically healthy people could transform into abusers simply by being placed in positions of power within an institutional setting that dehumanized others. The study seemed to show that we’re all capable of terrible acts under the right circumstances, challenging the comforting belief that only fundamentally bad people do bad things.

The experiment’s influence extends far beyond academic psychology. It’s been invoked to explain real-world atrocities from the abuse at Abu Ghraib prison during the Iraq War to police brutality to corporate malfeasance. Zimbardo himself testified as an expert witness in the Abu Ghraib trials, arguing that the abusive behavior of American soldiers reflected situational factors similar to those in his prison experiment. The study has been featured in countless psychology textbooks, referenced in popular culture, and shaped how we think about the origins of evil and cruelty in human behavior.

However, in recent years, the Stanford Prison Experiment has faced intense scrutiny and criticism. Researchers have questioned Zimbardo’s methodology, revealed evidence that guards were coached to behave harshly, and challenged the experiment’s core conclusions. Some critics now argue that the study tells us less about the inherent capacity for evil in all humans and more about how authority figures can manipulate people into cruel behavior through explicit instructions and social pressure. This reassessment raises profound questions about what the experiment actually demonstrated and what it reveals—or doesn’t reveal—about human nature, evil, and our capacity for cruelty.

Table of Contents hide

1 The Experiment: What Actually Happened

2 Zimbardo’s Interpretation: The Situational Nature of Evil

3 What the Experiment Claimed to Reveal About Human Nature

4 The Criticisms: Was the Experiment Fundamentally Flawed?

5 Alternative Explanations: Social Identity and Leadership

6 Ethical Issues: Should the Study Have Been Conducted?

7 What We Can Actually Learn: Nuanced Lessons

8 FAQs About The Stanford Prison Experiment

8.1 Was the Stanford Prison Experiment scientifically valid, or has it been discredited?

8.2 Did the experiment prove that anyone can become evil in the right situation?

8.3 How does the Stanford Prison Experiment relate to real-world atrocities like the Holocaust or Abu Ghraib?

8.4 What happened to the participants after the experiment ended?

8.5 Should the Stanford Prison Experiment still be taught in psychology classes?

The Experiment: What Actually Happened

Philip Zimbardo and his research team recruited 24 male college students through newspaper advertisements offering $15 per day to participate in a psychological study of prison life. The volunteers underwent psychological testing to ensure they were emotionally stable, physically healthy, and had no history of crime or drug abuse. From this pool, participants were randomly assigned to play either guards or prisoners, with assignment determined by coin flip. This random assignment was crucial to Zimbardo’s later claims—if roles were assigned randomly, then differences in behavior must stem from the roles themselves rather than pre-existing personality differences.

The basement of Stanford’s psychology building was converted into a mock prison complete with cells, a solitary confinement closet called “the hole,” and an observation area. On a Sunday morning, actual Palo Alto police “arrested” the students assigned as prisoners at their homes, charging them with armed robbery, handcuffing them, and bringing them to the mock prison. Upon arrival, prisoners were stripped, deloused, given uniforms with numbers instead of names, and fitted with chains around their ankles. Guards wore khaki uniforms, carried wooden batons, wore reflecting sunglasses to prevent eye contact, and were given minimal instructions except that they should maintain order without using physical violence.

Within hours, the guards began asserting their authority through increasingly arbitrary rules and punishments. Prisoners were awakened for counts in the middle of the night, forced to do push-ups, and subjected to humiliating tasks. When prisoners rebelled on the second day, guards responded by spraying them with fire extinguishers, stripping them naked, removing mattresses from cells, and forcing ringleaders into solitary confinement. The guards became increasingly creative in their degradation tactics—making prisoners clean toilets with their bare hands, forcing them to simulate sexual acts, denying bathroom privileges, and generally treating them with contempt and cruelty.

The prisoners’ responses varied but generally moved toward passivity and psychological breakdown. Some became extremely distressed, crying and showing signs of acute anxiety. One prisoner had to be released on the second day due to uncontrollable crying, fits of rage, and disorganized thinking. Others became passively obedient, internalizing their prisoner role to the point of referring to themselves by their assigned numbers rather than their names. The prisoners largely accepted the guards’ authority and their own degraded status, rarely attempting collective resistance after the initial rebellion was crushed.

Zimbardo himself played the role of prison superintendent, becoming deeply absorbed in maintaining the prison rather than observing the experiment objectively. When his girlfriend, Christina Maslach (now his wife), visited on the fifth day and was horrified by what she witnessed, her strong objections finally penetrated Zimbardo’s absorbed state and led him to terminate the experiment the next day. Of the approximately fifty people who observed the experiment during its six-day run—including graduate students, colleagues, and family members—Maslach was the only one who objected strongly enough to question whether the study should continue.

Zimbardo’s Interpretation: The Situational Nature of Evil

Philip Zimbardo interpreted his experiment as demonstrating the power of situations to overwhelm individual personality and values in determining behavior. His central claim was that the cruel behavior of the guards and the passive suffering of the prisoners resulted from the roles and institutional context rather than from the individuals’ pre-existing dispositions. Since participants were randomly assigned to roles and were psychologically healthy college students, their dramatically different behaviors must have stemmed from the situational forces of the prison environment rather than from their individual characteristics.

This situational explanation of behavior challenged what Zimbardo called the “fundamental attribution error”—the tendency to attribute others’ behavior to their personality or character while underestimating the influence of situations. When we see someone behaving cruelly, we typically conclude they’re a cruel person. Zimbardo argued that this attribution is often mistaken and that most cruelty results from situational pressures that would affect anyone placed in similar circumstances. The Stanford Prison Experiment seemed to show that you don’t need to be a sadist to act sadistically—you just need to be placed in a situation that encourages, permits, or rewards sadistic behavior.

Zimbardo elaborated these ideas in his 2007 book “The Lucifer Effect: Understanding How Good People Turn Evil,” where he argued that evil is not the result of a few bad apples but rather of bad barrels that corrupt good apples. Institutional and situational forces, he argued, can transform ordinary people into perpetrators of atrocities. He identified key situational factors that facilitate such transformations: deindividuation (losing one’s sense of individual identity), diffusion of responsibility, dehumanization of victims, conformity to group norms, and gradual escalation where small transgressions make larger ones easier.

This interpretation had profound implications. If evil results primarily from situations rather than individual pathology, then any of us could become perpetrators under the wrong circumstances. This democratic view of evil—”we’re all capable of this”—challenged the comforting belief that only fundamentally bad people do terrible things. It suggested that preventing atrocities requires changing systems and situations rather than just identifying and removing bad individuals. It implied that understanding and preventing genocide, torture, or abuse requires examining social structures and institutional practices rather than focusing solely on individual psychology.

Zimbardo’s situational account also offered a kind of explanation, if not excuse, for perpetrators of abuse. If situations cause cruel behavior, then individuals bear less personal responsibility for their actions. This implication became controversial when Zimbardo testified as an expert witness defending Sergeant Ivan “Chip” Frederick, a guard involved in the Abu Ghraib prisoner abuse scandal. Zimbardo argued that the situational pressures at Abu Ghraib—similar to those in his prison experiment—explained the guards’ behavior, suggesting that the military and political leadership bore more responsibility than the individual guards who actually committed the abuse.

What the Experiment Claimed to Reveal About Human Nature

The Stanford Prison Experiment entered popular consciousness as a powerful demonstration that the line between good and evil is thinner than we’d like to believe. It suggested that civilized behavior represents a fragile veneer that can dissolve rapidly under certain conditions, revealing a darker human nature beneath. The experiment appeared to show that humans have a disturbing capacity for cruelty that emerges readily when authority legitimizes it, when victims are dehumanized, and when social roles provide scripts for abusive behavior.

The study resonated because it aligned with other disturbing findings from social psychology, particularly Stanley Milgram’s obedience experiments showing that most people would deliver apparently dangerous electric shocks to strangers when ordered by an authority figure. Together, these studies painted a picture of humans as highly susceptible to social influence and capable of cruelty when situations permit or encourage it. They challenged Enlightenment optimism about human rationality and goodness, suggesting instead that we’re vulnerable to being swept into collective evil through social and institutional pressures.

The experiment also seemed to reveal how quickly people internalize assigned roles and how powerfully these roles shape behavior. The guards didn’t just pretend to be authoritarian—many seemed to genuinely become cruel and domineering. The prisoners didn’t just act submissive—they appeared to genuinely lose their sense of agency and internalize their degraded status. This suggested that our sense of self is more malleable and situationally determined than we typically recognize, and that social roles can override our core values and personality traits.

The study appeared to show that power corrupts quickly and thoroughly. Within mere hours, college students given guard roles began abusing their power, and this abuse escalated rapidly over days. This seemed to demonstrate that the problem isn’t just that power attracts corrupt individuals but that power itself corrupts, transforming even decent people into abusers when institutional structures permit or encourage such behavior. This interpretation suggested that hierarchical institutions with unchecked power dynamics create conditions for abuse regardless of the individuals involved.

The dehumanization process the experiment demonstrated seemed particularly important for understanding how ordinary people commit atrocities. When prisoners were stripped of their names, given numbers, subjected to degrading uniforms and procedures, and treated as less than fully human, the guards found it easier to abuse them. This echoed historical atrocities where victims were systematically dehumanized before being subjected to violence. The experiment suggested that preventing atrocities requires vigilantly resisting dehumanization and maintaining recognition of others’ full humanity even in institutional settings.

The Criticisms: Was the Experiment Fundamentally Flawed?

In recent years, serious criticisms have emerged challenging the Stanford Prison Experiment’s methodology, interpretation, and conclusions. These critiques suggest that the study may tell us less about the inherent human capacity for evil than about how researchers can manipulate participants into behaving in predetermined ways through coaching and social pressure.

French researcher Thibault Le Texier’s 2019 investigation revealed that guards received much more specific instructions than Zimbardo had publicly acknowledged. Rather than simply being told to maintain order, guards attended orientation meetings where they were told the experiment’s purpose was to study prisoners, and researchers discussed ways to generate feelings of frustration and powerlessness in prisoners. One guard, Dave Eshelman, later revealed that he deliberately played an exaggerated, cruel character based on a movie he’d seen, consciously performing for the researchers rather than spontaneously becoming abusive. Far from guards naturally becoming cruel, some were essentially coached or encouraged toward harsh behavior.

The selection of participants also raises questions. The newspaper advertisement recruiting participants specifically mentioned a “prison life” study, which may have attracted individuals interested in or comfortable with power dynamics and control. While Zimbardo screened for psychological health, this screening didn’t assess attitudes toward authority, punishment, or dominance. The study’s findings might reflect characteristics of people attracted to prison studies rather than universal human tendencies.

The claim that prisoners became genuinely passive and broken has also been challenged. Prisoners knew they could leave by asking to be released, and several did leave early. Those who stayed may have been consciously playing along with the experiment or may have stayed for the payment rather than because they’d been psychologically broken. Some prisoners later reported that they were acting or exaggerating distress. The interpretation that they’d genuinely internalized degraded identities may overstate what was actually happening.

Zimbardo’s deep involvement as both researcher and prison superintendent created fundamental conflicts of interest. Rather than objectively observing what happened, he actively shaped events to create the outcomes he expected. He intervened to encourage guards to be tougher, discouraged prisoners from leaving, and became so absorbed in his superintendent role that he lost scientific objectivity. This dual role makes it impossible to know whether observed behaviors reflected natural responses to the situation or responses to Zimbardo’s active manipulation.

The lack of control groups or comparison conditions also limits what can be concluded. Without comparing the mock prison to other institutional settings or to conditions where guards received different instructions, we can’t know which aspects of the situation produced the observed behaviors. The study essentially demonstrated that when researchers create a harsh, dehumanizing environment and coach guards toward cruelty, some cruelty results—but this tells us little about whether similar outcomes would occur without such researcher involvement.

Alternative Explanations: Social Identity and Leadership

Contemporary social psychologists Stephen Reicher and Alexander Haslam have proposed an alternative interpretation of what the Stanford Prison Experiment actually revealed. Rather than demonstrating that situations automatically produce evil behavior, they argue the study shows how leaders can mobilize followers around particular social identities and norms. The guards became cruel not simply because they were put in guard roles but because Zimbardo actively encouraged harsh treatment and because certain guards like Dave Eshelman modeled extreme behavior that others then followed.

Reicher and Haslam conducted their own prison study for the BBC in 2002 with crucial differences from Zimbardo’s experiment. In their study, guards were not coached toward harshness, and the researchers observed without intervening to shape outcomes. Under these conditions, guards did not spontaneously become brutal. Instead, they were reluctant to assert authority, felt uncomfortable with their power, and the prisoner group actually became more cohesive and powerful over time. Only when researchers introduced a dominant personality to the guard group did harsh behavior emerge—and even then, it remained limited.

This alternative study suggests that tyranny and cruelty don’t arise automatically from power imbalances or institutional settings. Rather, they require active leadership that mobilizes people around identities and norms permitting cruelty. The Stanford Prison Experiment’s outcomes may tell us less about universal human nature and more about how authority figures can manipulate social dynamics to produce desired outcomes—which is itself an important finding but a quite different one than Zimbardo’s interpretation suggested.

Social identity theory, developed by Henri Tajfel and John Turner, provides a framework for understanding these dynamics. People define themselves partly through group memberships, and these social identities shape how they perceive and treat others. Leaders can influence which identities become salient and what norms those identities entail. In the Stanford Prison Experiment, Zimbardo and certain dominant guards established norms of harsh treatment that became associated with the guard identity, and other guards conformed to these norms. But this wasn’t automatic—it required active shaping by authority figures.

This reconceptualization has important practical implications. If cruelty requires active leadership mobilizing people around harsh norms rather than arising automatically from situations, then preventing atrocities involves identifying and countering such mobilization efforts. It suggests that most people won’t spontaneously become cruel even in hierarchical institutions unless leaders actively encourage or model such behavior. This view attributes more agency to individuals and suggests that institutional abuse isn’t inevitable but rather depends on specific leadership practices that can potentially be identified and prevented.

Ethical Issues: Should the Study Have Been Conducted?

Even setting aside questions about what the experiment actually demonstrated, the Stanford Prison Experiment raises profound ethical concerns about research practices. By contemporary standards, the study clearly violated ethical principles that now govern human subjects research, though it’s worth noting that these principles were developed partly in response to problematic studies like Zimbardo’s.

The study caused genuine psychological distress to participants. Several prisoners experienced severe emotional breakdowns requiring early release. Even guards reported discomfort with their behavior during the study and expressed regret afterward. While participants were debriefed and followed up with, the question remains whether any research finding justifies deliberately creating conditions likely to produce such distress. The principle of “do no harm” in research ethics suggests that studies should minimize risks to participants, and the Stanford Prison Experiment violated this principle.

Informed consent was compromised because participants couldn’t genuinely know what they were consenting to. They were told they’d participate in a prison study, but they couldn’t have anticipated the intensity of the experience or their own reactions to it. The arrests, strip searches, and degrading procedures went beyond what participants could reasonably have expected based on the study description. True informed consent requires that participants understand what they’re agreeing to, and this wasn’t fully possible given the experiment’s nature.

Participants’ right to withdraw was effectively compromised despite being technically present. When prisoners asked to leave, Zimbardo often discouraged them, and the social dynamics of the experiment made it psychologically difficult to quit despite participants theoretically being free to do so. Guards who were uncomfortable with the harsh treatment similarly felt pressure to continue. The principle that research participation should be voluntary without coercion was violated through subtle social pressures that made leaving difficult.

The question of whether the study’s findings justified its costs must be considered in light of what we now know about those findings. If the experiment provided unique and important insights about human nature that couldn’t be obtained otherwise, one might argue the costs were justified. However, given the methodological problems and the availability of less harmful ways to study similar questions, this justification becomes harder to sustain. The study’s fame may stem more from its dramatic nature than from unique scientific contributions.

The Stanford Prison Experiment serves as a cautionary tale about how researchers’ own biases and desired outcomes can shape studies in problematic ways. Zimbardo’s deep involvement in creating the harsh conditions he then observed violated the objectivity that scientific research requires. His later use of the study to explain real-world atrocities while downplaying his own role in creating the experimental outcomes raises questions about researcher responsibility and the ethics of extrapolating from flawed studies to real-world events.

What We Can Actually Learn: Nuanced Lessons

Despite its limitations, the Stanford Prison Experiment offers valuable lessons when interpreted carefully. Rather than demonstrating that situations automatically produce evil, it shows how complex social dynamics involving leadership, social influence, identity, and institutional structures can facilitate cruel behavior under certain conditions. Understanding these mechanisms remains important even if the study’s original interpretation was overstated.

The experiment demonstrates how dehumanization processes can make cruelty easier. When people are stripped of individual identities, referred to by numbers rather than names, and subjected to degrading procedures, treating them harshly requires less psychological work. This insight applies beyond prisons to any institutional setting where bureaucratic processes risk treating people as cases or numbers rather than individuals. Maintaining humanization—using names, recognizing individuality, preserving dignity—serves as a crucial safeguard against abuse.

The study reveals how unchecked power can enable abuse even among people who wouldn’t normally behave cruelly. When authority figures face no accountability, when victims have no recourse, and when institutional cultures permit or ignore abusive behavior, conditions exist for mistreatment regardless of individual personalities. This suggests that preventing institutional abuse requires oversight, accountability mechanisms, and cultures that encourage reporting and addressing problems rather than covering them up.

The research highlights the importance of individual resistance and moral courage. While most participants went along with the emerging norms, some guards remained relatively humane, some prisoners resisted, and Christina Maslach objected strongly enough to end the experiment. These examples show that situational pressures don’t determine behavior completely and that individuals can resist even strong social influences. Understanding what enables such resistance—moral clarity, social support, willingness to accept social costs—matters as much as understanding what facilitates conformity.

The Stanford Prison Experiment also teaches about researchers’ ethical obligations and the dangers of becoming too invested in particular outcomes. Zimbardo’s transformation from objective observer to prison superintendent demonstrates how researchers can become complicit in harmful dynamics they’re supposed to be studying objectively. This lesson applies broadly to research ethics and the importance of maintaining appropriate boundaries and objectivity.

Perhaps most importantly, the study and the subsequent critiques demonstrate the importance of skepticism, replication, and ongoing critical examination of influential findings. For decades, the Stanford Prison Experiment was accepted as demonstrating fundamental truths about human nature, but careful scrutiny revealed serious methodological problems that change how we should interpret the findings. This underscores the tentative nature of scientific knowledge and the importance of subjecting even famous studies to continued critical examination.

FAQs About The Stanford Prison Experiment

Was the Stanford Prison Experiment scientifically valid, or has it been discredited?

The experiment has serious methodological problems that limit what can be concluded from it, though “discredited” may be too strong. The main issues include lack of control groups, researcher coaching of participants toward expected behaviors, Zimbardo’s dual role as both researcher and active participant shaping events, small sample size, and selection effects in who volunteered for a prison study. Recent investigations have revealed that guards received more specific instructions than originally acknowledged and that some participants were deliberately performing for researchers rather than spontaneously becoming cruel. These problems mean the study doesn’t demonstrate that situations automatically produce evil behavior in anyone placed in them. However, it still provides a case study of how certain conditions—dehumanization, unchecked power, researcher encouragement, social influence—can facilitate cruel behavior. The study should be understood as a demonstration of specific dynamics rather than as proof of universal human tendencies. Contemporary psychologists generally view it as historically important but methodologically flawed, requiring careful interpretation rather than acceptance at face value.

Did the experiment prove that anyone can become evil in the right situation?

No, this interpretation oversimplifies what happened and ignores the experiment’s methodological limitations. The study showed that some college students, when assigned guard roles in a mock prison, coached by researchers, and influenced by particularly dominant participants modeling harsh behavior, engaged in cruel treatment of prisoners over several days. This doesn’t prove that anyone in any situation would behave similarly. Important factors include the researcher coaching, the self-selected sample of people who volunteered for a prison study, and the active resistance shown by some participants who remained relatively humane or objected to the conditions. The study suggests that situational factors can influence behavior powerfully, but individual differences, moral values, and personal choice still matter. Many people have been in positions of power without becoming abusive, and even within the experiment, behavior varied considerably among guards. A more accurate conclusion is that certain situational factors can facilitate cruel behavior in some people under certain conditions, particularly when authority figures encourage or model such behavior, but this is quite different from claiming anyone would inevitably become evil.

How does the Stanford Prison Experiment relate to real-world atrocities like the Holocaust or Abu Ghraib?

Zimbardo and others have drawn parallels between the experiment and real-world abuses, arguing that similar situational factors—dehumanization of victims, hierarchical authority, diffusion of responsibility, gradual escalation—operate in both contexts. These parallels have some validity in identifying mechanisms that can facilitate large-scale cruelty. However, the comparison has important limitations. Real-world atrocities typically involve much more complex factors including ideology, propaganda, economic interests, historical grievances, and systematic training in cruelty that weren’t present in a six-day college experiment. The Holocaust involved years of deliberate dehumanizing propaganda, elaborate bureaucratic systems, and ideological indoctrination—not just putting ordinary people in guard roles and seeing what happens. Similarly, the Abu Ghraib abuses occurred within a complex military and political context involving explicit discussions of interrogation techniques, intelligence priorities, and chain-of-command issues beyond simple situational pressures. While the Stanford Prison Experiment may illustrate some mechanisms relevant to understanding real-world abuses, it cannot fully explain them and shouldn’t be used to minimize the role of ideology, hatred, explicit policy decisions, and individual choice in atrocities.

What happened to the participants after the experiment ended?

Zimbardo conducted debriefing sessions with all participants immediately after the experiment ended and followed up with them over subsequent months and years. According to Zimbardo’s accounts, no participants showed long-lasting psychological harm from the experience. Many reported that participating had been educational and had given them insights into institutional dynamics and their own capacity for behavior they hadn’t expected from themselves. Several guards expressed discomfort and regret about how they’d behaved. Some prisoners reported that the experience had been genuinely distressing during the experiment but that they recovered after it ended. Critics have questioned whether follow-ups were extensive enough to identify potential long-term effects and whether participants might have been reluctant to report problems to the researcher who created the difficult situation. Some participants have given interviews over the years, with varying perspectives on the experience—some viewing it as valuable and others expressing criticism of how it was conducted. The lack of rigorous long-term follow-up studies means we can’t be certain about long-term impacts, though there’s no strong evidence of serious lasting harm.

Should the Stanford Prison Experiment still be taught in psychology classes?

Yes, but with critical analysis rather than uncritical acceptance of Zimbardo’s original interpretation. The experiment remains historically important as an influential study that shaped thinking about situational influences on behavior, and students should understand why it became famous. However, it should be taught alongside the methodological critiques, the alternative interpretations from researchers like Reicher and Haslam, and discussion of what the study actually demonstrated versus what it’s often claimed to demonstrate. Teaching the experiment provides opportunities to discuss research ethics, the importance of methodology in determining what can be concluded from studies, how researcher biases can shape outcomes, and how scientific understanding evolves through critical examination of even famous findings. The study serves as a valuable case study in critical thinking about research and the difference between a dramatic demonstration and rigorous science. Rather than presenting it as definitive proof about human nature, instructors should present it as a flawed but historically important study that raises important questions about power, institutions, and behavior while also demonstrating the importance of methodological rigor and ethical research practices.

By citing this article, you acknowledge the original source and allow readers to access the full content.

PsychologyFor. (2025). The Stanford Prison Experiment: The Concept of “Evil”. https://psychologyfor.com/the-stanford-prison-experiment-the-concept-of-evil/