The Marshmallow Test: Delayed Gratification, Self-Control, and the Problem with Willpower
Few psychology studies have been turned into a morality tale as efficiently as the marshmallow test.
A child sits in front of a treat. Eat it now and get one. Wait, and get two. From this, the popular version somehow built an entire sermon about willpower, success, and the kind of child who apparently grows up to have clean spreadsheets, stable relationships, and a pension plan.
The real research is more interesting and rather less smug.
Walter Mischel’s delayed gratification studies were not simply about whether children could resist a marshmallow. They were about attention, temptation, strategy, trust, context, and how people manage conflict between what they want now and what might benefit them later. The problem is that the public version of the marshmallow test became too neat. It turned a complex developmental task into a tiny edible personality test.
The later evidence makes that story harder to swallow. Self-control matters, but the ability to wait is not just a private act of willpower. It is also shaped by environment, experience, social class, cognitive development, and whether a child has learned that adults usually keep their promises.
In other words, the marshmallow test did not simply reveal who had discipline. It also revealed who had reason to believe waiting would pay off.
What was the marshmallow test?
The marshmallow test refers to a set of delay of gratification studies conducted by Walter Mischel and colleagues at Stanford University in the late 1960s and early 1970s.
The basic design was simple. A young child was placed in a room with a desirable treat, often described as a marshmallow, although the studies used other rewards too. The child was told they could eat the treat immediately, or they could wait until the researcher returned and receive a larger reward.
This is the part everyone remembers: one treat now, two treats later.
The task was designed to study delay of gratification, which means resisting an immediate reward in order to gain a better future reward. That sounds simple until you remember the participants were preschool children being asked to sit alone with something delicious and behave like tiny accountants.
Some children ate the treat quickly. Some tried to wait. Some looked away, covered their eyes, sang, fidgeted, talked to themselves, or turned the whole situation into a private endurance sport. These behaviours became important because they showed that delay was not just about wanting the second treat more. It was about how children managed attention and temptation.
That was one of Mischel’s key insights. Self-control was not simply a fixed trait sitting inside the child like a moral battery. It involved strategies.
What the original research seemed to show
The marshmallow test became famous partly because of later follow-up studies.
Mischel and colleagues reported that children who had delayed gratification for longer in the original studies tended to show better outcomes later in life. These included higher academic performance, better coping, greater social competence, and stronger emotional regulation.
The most widely cited follow-up was published in 1989 by Mischel, Shoda, and Rodriguez. It linked preschool delay of gratification to later adolescent outcomes, helping establish the test as one of psychology’s most famous examples of early self-control predicting future success.
This was a powerful idea. It was also very easy to oversell.
The public version became something like this: children who can resist the marshmallow become successful; children who cannot are doomed to a life of poor impulse control and possibly suspicious snack management.
That was never a fair reading of the research. But it was a very marketable one.
Parents, educators, journalists, policymakers, and self-help writers loved the marshmallow test because it seemed to offer a simple lesson. Teach children self-control and they will do better later. Delay gratification. Think long term. Resist temptation. Become the kind of person who owns matching storage containers.
There is some truth in that. Self-regulation is important. The trouble is that the story made it sound too individual, too clean, and too detached from the child’s actual world.
The hot and cool systems of self-control
One of the most useful parts of Mischel’s work is the hot/cool system model, developed with Janet Metcalfe.
The hot system is fast, emotional, impulsive, and reward-focused. It sees the marshmallow and says, quite reasonably, “There it is. Eat it.”
The cool system is slower, more reflective, and more cognitive. It helps a person think about future goals, use strategies, and manage immediate impulses.
Successful delay often involves cooling down the temptation. A child might look away from the treat, imagine it as a picture rather than food, distract themselves, sing a song, or focus on something else entirely. The aim is not to win a heroic internal battle through sheer moral force. The aim is to make the battle less interesting.
This is still one of the most useful lessons from the marshmallow research. Self-control often works best when people change the situation, change their attention, or change how they think about the temptation.
Adults do this too. We put biscuits out of sight, block distracting apps, avoid certain shops, set automatic savings, prepare workspaces, and make decisions in advance because we know perfectly well that our future selves cannot always be trusted. This is not weakness. This is self-knowledge with furniture.
How the marshmallow test became a willpower myth
The marshmallow test became popular because it seemed to offer a tidy moral: successful people delay gratification.
That sounds plausible. It also fits comfortably with a culture that likes to explain outcomes through personal discipline. If someone succeeds, they must have waited for the second marshmallow. If someone struggles, perhaps they lacked self-control.
Convenient. Also suspicious.
The problem with this interpretation is that it treats waiting as if it happens in a vacuum. It imagines the child and the marshmallow, but not the family, the classroom, the neighbourhood, the household income, the reliability of adults, the child’s previous experiences, or the basic question of whether waiting has usually been rewarded in that child’s life.
For some children, waiting makes sense. If adults usually keep promises, food is secure, routines are stable, and rewards reliably arrive, then delaying gratification may be a good strategy.
For other children, waiting may be less rational. If promises are often broken, resources are scarce, or adults are inconsistent, taking the available reward now may not be impulsive at all. It may be sensible.
This is where the simple willpower story starts to wobble.
A child who eats the marshmallow quickly may not be revealing a defective future. They may be revealing a perfectly reasonable expectation: things offered later do not always arrive.
The 2018 replication problem
In 2018, Tyler Watts, Greg Duncan, and Haonan Quan published a major conceptual replication of the marshmallow test using a larger and more diverse sample than the original Stanford work.
Their findings complicated the famous story.
Delay of gratification was still associated with later achievement, but the relationship was much smaller than earlier accounts suggested. Once the researchers controlled for factors such as family background, socioeconomic status, and early cognitive ability, the predictive power of waiting was substantially reduced.
That does not mean the original research was worthless. It means the popular interpretation was too confident.
The test was not measuring pure willpower. It was capturing a mixture of self-control, cognitive development, home environment, social class, trust, and opportunity. Which is less catchy, admittedly, but rather more psychologically honest.
This matters because self-control stories can easily become blame stories. If we say children succeed because they learn to wait, we may quietly imply that those who struggle simply failed to discipline themselves early enough. The replication work pushes against that. It reminds us that behaviour develops inside contexts, not inside motivational posters.
Trust, reliability, and the second marshmallow
Another important challenge came from research by Celeste Kidd, Holly Palmeri, and Richard Aslin.
Their study asked a beautifully awkward question: what if children’s waiting depends partly on whether they trust the adult?
Before the marshmallow-style task, children experienced either a reliable or unreliable adult. In the reliable condition, the adult kept a promise. In the unreliable condition, the adult failed to deliver what they had promised. Later, when children were asked to wait for a better reward, those who had experienced a reliable adult waited much longer.
This changes the meaning of the task.
Waiting is not only a sign of self-control. It can also be a judgement about the world. If the adult has already shown that promises are flimsy, why wait? Take the treat. The system has spoken, and it is not especially trustworthy.
This finding is one of the best antidotes to the lazy marshmallow myth. Children are not just impulse machines. They are also making predictions. They may not phrase it as “environmental reliability moderates my delay behaviour,” because they are four, but the logic is there.
The second marshmallow only works as a reward if the child believes it will actually arrive.
What later adult follow-ups found
The marshmallow test has also been revisited in adulthood.
A 2024 study examined whether early delay of gratification reliably predicted adult outcomes. The answer was not especially flattering to the simple version of the story. The researchers found that marshmallow test performance did not reliably predict adult functioning across a broad range of outcomes.
Again, this does not make self-control irrelevant. It means the original task should not be treated as a crystal ball.
Human lives are too complicated for that. Adult outcomes are shaped by education, income, family, health, opportunity, discrimination, personality, relationships, culture, timing, luck, and a depressing number of admin tasks. A preschool snack decision was never going to carry all of that on its tiny sugary back.
The more cautious conclusion is that delay of gratification can tell us something, but not everything. It may be one small behavioural clue inside a much larger developmental picture.
What the marshmallow test still teaches us
The marshmallow test is still useful if we stop asking it to do too much.
It teaches us that self-control is strategic. Children who waited successfully often used attention and distraction. They did not simply stare at temptation and radiate virtue.
It teaches us that environment matters. A child’s willingness to wait depends partly on whether waiting has been a good bet before.
It teaches us that self-regulation can be supported. People can learn strategies that make delay easier, especially when their surroundings are structured in ways that help rather than constantly sabotage them.
It teaches us that early behaviour may relate to later outcomes, but those links are not simple, fixed, or independent of social context.
Most importantly, it teaches us to be suspicious of psychological stories that are too morally satisfying.
The idea that successful people are simply better at waiting is attractive because it is tidy. It turns inequality into temperament and life chances into snack discipline. But psychology is rarely that neat, and childhood is certainly not.
Practical lessons without the smugness
There are still useful lessons to take from the marshmallow research.
For education, it suggests that children can benefit from learning self-regulation strategies. This includes distraction, planning, emotional regulation, goal-setting, and changing how they think about temptation.
For parenting, it suggests that consistency matters. Children are more likely to wait when adults are reliable, routines are stable, and promises mean something.
For therapy and clinical work, it supports the idea that impulse control is not simply a matter of trying harder. People often need strategies, environmental changes, emotional support, and realistic alternatives.
For adults, it offers a mildly irritating but useful reminder: do not build your life around heroic self-control if you can build a system instead. Move the temptation. Reduce the friction. Make the better choice easier. Do not rely on your 9 p.m. self to make the decision your 9 a.m. self wrote down with such touching optimism.
The best lesson from the marshmallow test is not “be stronger.” It is “understand the situation well enough to stop making strength your only plan.”
Simply Put
The marshmallow test became famous because it seemed to show that children who could delay gratification were more likely to succeed later in life.
The truth is more complicated.
Mischel’s research did show that self-control, attention, and delay strategies are important. Children who waited were not just being virtuous. They were often using clever ways to manage temptation.
But later research showed that waiting is not just about willpower. It is also about trust, family background, socioeconomic conditions, cognitive development, and whether a child has learned that delayed rewards usually arrive.
So the marshmallow test is still valuable, but not as a tiny edible destiny machine.
It tells us something about self-regulation. It tells us something about context. It tells us something about how easily psychology can become a morality tale when people prefer simple lessons to messy explanations.
The child who waits is not necessarily a future success story. The child who eats the marshmallow is not necessarily doomed. Sometimes they are just responding to the world they know.
And honestly, depending on the world, that may be the more rational choice.
References
Casey, B. J., Somerville, L. H., Gotlib, I. H., Ayduk, O., Franklin, N. T., Askren, M. K., Jonides, J., Berman, M. G., Wilson, N. L., Teslovich, T., Glover, G., Zayas, V., Mischel, W., & Shoda, Y. (2011). Behavioral and neural correlates of delay of gratification 40 years later. Proceedings of the National Academy of Sciences, 108(36), 14998–15003. https://doi.org/10.1073/pnas.1108561108
Mischel, W., Ebbesen, E. B., & Zeiss, A. R. (1972). Cognitive and attentional mechanisms in delay of gratification. Journal of Personality and Social Psychology, 21(2), 204–218. https://doi.org/10.1037/h0032198
Mischel, W., Shoda, Y., & Rodriguez, M. L. (1989). Delay of gratification in children. Science, 244(4907), 933–938. https://doi.org/10.1126/science.2658056
Watts, T. W., Duncan, G. J., & Quan, H. (2018). Revisiting the marshmallow test: A conceptual replication investigating links between early delay of gratification and later outcomes. Psychological Science, 29(7), 1159–1177. https://doi.org/10.1177/0956797618761661