What is the free energy principle in neuroscience, in plain language?

Key takeaways

  • The free energy principle states that the brain acts as a prediction engine, constantly working to minimize surprise and uncertainty to keep the organism alive and safe from entropy.
  • Organisms minimize surprise through active inference, either by updating their internal beliefs to match sensory data or taking physical action to change the world to match their predictions.
  • The brain relies on precision weighting to balance internal expectations against sensory data. Imbalances in this system can mechanically explain psychiatric conditions like schizophrenia and autism.
  • Studies with lab-grown neurons learning to play Pong show biological networks naturally reorganize to avoid unpredictable stimuli, offering a new framework for artificial intelligence.
  • While highly influential, critics argue the principle is so broad it borders on being a mathematical tautology rather than a falsifiable scientific theory.
The free energy principle fundamentally redefines the brain as a proactive prediction machine rather than a passive computer. By continuously generating top-down expectations about the world, the brain attempts to minimize surprise and resist the natural pull of entropy. When expectations fail, the brain either updates its internal models or physically acts upon the environment to make its predictions come true. Ultimately, this framework provides a unifying mathematical explanation for how biological life survives, and it is actively reshaping our approach to artificial intelligence.

What Is the Free Energy Principle in Neuroscience

The free energy principle is a unifying theory in neuroscience suggesting that the brain is fundamentally a prediction engine driven to minimize "surprise." Rather than passively processing sensory information, the brain constantly generates predictions about the world and acts to make those predictions come true, thereby keeping the organism in safe, survivable conditions.

The Physics of Life and the Problem of Existing

To understand how the brain works, we first have to ask a much more fundamental question: how do living things manage to stay alive? In the physical universe, everything is governed by the second law of thermodynamics. This law dictates that closed systems naturally move toward entropy, which is a state of disorder, randomness, and equilibrium. If you leave a biological organism to its own devices without an active drive to survive, it will inevitably decay, disintegrate, and merge with its environment. As neuroscientist Anil Seth has noted, without an active process to maintain our boundaries, we would simply "dissolve into mush on the floor" 1.

The question of how organisms resist this cosmic pull toward disorder is the foundation of the free energy principle. Developed largely by Karl Friston - a renowned psychiatrist, neuroscientist, and the pioneer of modern neuroimaging - the free energy principle attempts to explain life, perception, and action through a single mathematical imperative 23.

Biological systems are open systems that are in continuous exchange with their environments. However, to remain alive, they must occupy a very limited set of physical and chemical states. A human being, for instance, must maintain a core body temperature near 98.6 degrees Fahrenheit, specific blood oxygen levels, and precise hydration parameters. Out of all the possible atomic configurations a human body could take, only a microscopically small fraction of them are compatible with life 1. Therefore, the fundamental imperative of any living thing is to avoid surprising, unexpected states (like hypothermia or starvation) and remain within its narrow window of expected, survivable states 14.

The Concept of the Markov Blanket

For a living system to exist and resist entropy, it must maintain a physical and statistical boundary that separates it from the external world 1. In the mathematical framework of the free energy principle, this boundary is formalized using a concept from probability theory known as a "Markov blanket" 455.

A Markov blanket acts as a statistical firewall. It defines a system by separating the universe into two distinct domains: internal states (the inside of the organism, the cell, or the brain) and external states (the outside environment) 57. Crucially, the internal and external states do not interact directly. The outside world cannot instantly rewrite your internal biology, and your internal biology cannot instantly alter the physical world. They only interact through the "blanket" itself, which consists of two types of states:

State Type Role in the System Biological Example
Sensory States How the external world influences the internal system. The "input" layer. Photoreceptors in the retina detecting light; chemical receptors on a cell membrane 6.
Active States How the internal system influences the external world. The "output" layer. Muscles contracting to move a limb; a single-celled organism moving its flagellum 76.
Internal States The private workings of the system, insulated from the outside world. The firing of internal neurons; intracellular metabolic processes 57.
External States The hidden causes in the environment that the system cannot directly access. A physical predator; a sudden drop in ambient temperature 57.

For an organism to survive, its internal states must "track" or model the external states, but it can only do so by interpreting the activity on its Markov blanket 5. It must minimize its uncertainty about the environment to ensure it remains within its safe boundaries 77.

Defining "Free Energy" in Information Theory

When physicists talk about "free energy" (such as Gibbs or Helmholtz free energy), they are usually referring to the amount of energy in a thermodynamic system that is available to do physical work. However, in the context of Friston's principle, "variational free energy" is a concept borrowed from information theory and statistical physics 58.

In this mathematical framework, variational free energy is essentially a proxy for "surprise" or "uncertainty" 711. Because a biological system is trapped behind its Markov blanket, it cannot directly measure all the hidden complexities of the universe. Consequently, it cannot directly calculate the absolute probability of its own survival or its true "surprise." Instead, it computes and minimizes variational free energy, which serves as a mathematically tractable upper bound - or a safe ceiling - on surprise 49.

Because entropy is the average of surprise over time, continuously keeping this free energy as low as possible ensures that the organism restricts its exposure to unexpected, chaotic, and potentially fatal environmental states 48. Minimizing free energy is therefore mathematically equivalent to resisting entropy 7.

The Brain as a Prediction Machine

For decades, the dominant metaphor in cognitive science and traditional neuroscience was the brain as a computer processor 13. In this classical "bottom-up" model, perception is viewed as a passive event. Light enters the eyes, travels up the optic nerve as raw data, gets processed by the visual cortex, and eventually, the brain pieces these signals together to form a coherent picture of the world, subsequently deciding how to act 10.

The free energy principle completely upends this model. Building on ideas first proposed by the 19th-century physicist Hermann von Helmholtz - who suggested that perception is a process of "unconscious inference" - the modern framework suggests that the brain is not a passive receiver of data, but a proactive "prediction machine" 410111217.

The Mechanics of Predictive Coding

Under the umbrella of the free energy principle lies the neurobiological mechanism of predictive coding 1213. To understand why predictive coding is necessary, consider the brain's physical predicament: it is locked inside a dark, silent skull. It has no direct access to the outside world; all it receives are ambiguous, noisy electrical impulses arriving from the sensory organs 812.

To make sense of these meaningless electrical spikes, the brain relies on an internal "generative model" - a complex, hierarchical web of prior beliefs and historical experiences that simulate how the world works 817. Instead of waiting for sensory data to arrive and then calculating what it means from scratch, the brain actively anticipates the data. It constantly generates top-down predictions about what it expects to hear, see, or feel in the next fraction of a second 1017.

When these predictions cascade down through the neural hierarchy, they meet the incoming sensory data flowing from the bottom up. Two things can happen: * When predictions match reality: The sensory data is effectively ignored or "explained away" by the brain. The brain saves massive amounts of metabolic energy because it doesn't need to deeply process information it already perfectly anticipated 1217. * When predictions fail: The gap between the brain's expectation and the actual sensory input generates a "prediction error" 1713.

This prediction error is the exact biological manifestation of variational free energy 8. Only the prediction errors - the news, the surprises - are allowed to flow back up the neural hierarchy. The brain then uses these errors to refine and update its internal models, ensuring its future predictions will be more accurate, thereby suppressing future prediction errors 1713.

Perception as a "Controlled Hallucination"

Because our experience of reality is driven more by our brain's top-down predictions than by the bottom-up data of the world, researchers like Anil Seth have provocatively referred to waking perception as a "controlled hallucination" 1314.

Under this framework, we do not see the world as it objectively is. We are always hallucinating our reality based on our brain's best guesses 14. When those hallucinations are continuously tethered to reality and kept in check by sensory prediction errors, we call it normal perception 14. This explains a wealth of psychological phenomena. For example, if you hear someone begin the sentence "The grass is...", your brain automatically pre-activates the auditory and linguistic neural networks for the word "green" before the speaker even finishes the sentence 10. The brain builds reality from the inside out just as much as from the outside in 14.

Active Inference: Changing the Mind vs. Changing the World

While the free energy principle provides the overarching mathematical law (that systems must minimize surprise), the process by which biological systems actually achieve this is called active inference 511.

If the brain's sole purpose is to minimize prediction errors, there are theoretically two distinct ways it can do this. Active inference posits that humans and animals utilize both strategies constantly and seamlessly 7815.

Research chart 1

  1. Perception (Changing your mind): When confronted with a prediction error, the brain can update its internal generative model to reflect the new sensory reality. By changing its beliefs to fit the world, the brain ensures that the next time it encounters this situation, its predictions will be accurate, and surprise will be minimized 715.
  2. Action (Changing the world): Alternatively, the brain can rigidly hold onto its internal prediction, and instead move the body to physically change the environment (or its relationship to the environment) so that the incoming sensory data matches the prediction 5715.

Consider a basic survival scenario: an organism steps outside into the snow without a coat. Its internal, evolutionary model strongly predicts a core body temperature of 98.6°F. The freezing air causes sensory input that wildly contradicts this expectation, generating a massive prediction error (a dangerous spike in free energy) 713.

If the organism only used the perceptual strategy, it would simply update its internal model to believe, "I guess I am now a freezing creature," and it would rapidly die of hypothermia. Instead, through active inference, the organism uses action. It moves its muscles to put on a coat, build a fire, or walk back inside, thereby altering the incoming sensory data until it aligns once again with the brain's unyielding prediction of warmth 7.

Action as a Self-Fulfilling Prophecy

From the perspective of active inference, even basic motor control is driven by predictions. When you want to raise your arm to catch a ball, your brain does not act like a computer calculating a complex chain of kinetic motor commands and joint angles 13. Instead, your brain firmly and stubbornly predicts that your arm is already raised 7.

The discrepancy between the proprioceptive reality (your arm is currently resting at your side) and the brain's firm prediction (your arm is raised) creates an immediate, localized prediction error in the motor cortex. The fastest, most metabolically efficient way for the nervous system to resolve this error is for the spinal reflexes to simply move the arm to the predicted position, thereby wiping out the error 57. Action, in this view, is simply the body moving to fulfill a neurological prophecy.

This framework applies universally, regardless of biological complexity. Whether it is a single-celled bacterium swimming up a glucose gradient to maintain its metabolic expectations, a housecat seeking a patch of sunlight, or a human being navigating complex social relationships to avoid social surprise, all are engaging in active inference to minimize free energy 711.

Passive Processing vs. Active Inference

To grasp how radical this shift is, it is helpful to compare the traditional view of neuroscience with the new paradigm of active inference.

Conceptual Feature Passive Perception (Classical Neuroscience) Active Inference (Free Energy Principle)
Direction of Information Flow Primarily Bottom-up (Environment $\rightarrow$ Senses $\rightarrow$ Brain) 10 Primarily Top-down (Brain $\rightarrow$ Senses $\rightarrow$ Environment) 1017
Core Function of the Brain An input-processing computer mapping data to internal representations 13. A statistical inference engine actively projecting hypotheses onto the world 513.
Role of Sensory Data To build a faithful picture of external reality from scratch 10. To calibrate and correct the brain's internal predictions via prediction errors 14.
Mechanism of Action Computed and executed only after perception is fully processed 10. Simultaneous with perception; used to actively sample the environment to fulfill expectations 8.
Ultimate Biological Goal To accurately map the objective external world 1314. To minimize "surprise" (variational free energy) and maintain physiological boundaries 45.

The Bayesian Brain and Precision Weighting

Active inference is deeply rooted in a mathematical framework known as Bayesian inference 89. Named after the 18th-century statistician Thomas Bayes, Bayesian inference describes the optimal mathematical way to update a probability estimate when new, uncertain evidence is introduced 2.

In a Bayesian framework, an entity starts with a "prior" - an initial belief or expectation about the state of the world 214. When new evidence arrives (sensory data), the entity must weigh the reliability of its prior against the reliability of the new evidence. If the new evidence is highly reliable, the entity significantly updates its belief, forming a new "posterior" belief 2.

The free energy principle essentially argues that the brain evolved to be an approximate Bayesian inference machine 4811. However, calculating exact Bayesian probabilities for every single variable in a complex, chaotic world is mathematically intractable - it would require more computing power and time than any biological brain possesses, leading to a paralysis of analysis that would be fatal in the wild 9.

To solve this, the brain uses "variational" inference. Instead of calculating exact probabilities across an infinite hypothesis space, the brain transforms complex probability problems into simpler optimization problems 89. By minimizing variational free energy, the brain is effectively taking a mathematical shortcut, approximating ideal Bayesian inference without needing infinite computational power 89.

The Volume Dial of Attention

If the brain is constantly comparing top-down predictions with bottom-up sensory errors, how does it know which one to trust? If your prior belief says one thing, but your eyes say another, which wins? This is where the concept of "precision" comes into play 12.

The brain must constantly estimate the reliability, or precision, of both its predictions and its sensory inputs 1214. If you are walking through a thick, dark forest at night, the visual data you receive is noisy and unreliable (low precision). In this case, your brain turns down the volume on the sensory data and relies more heavily on its internal priors - which is exactly why you might mistake a shadowy tree stump for a crouching bear. Your brain is filling in the blanks based on its survival priors.

Conversely, in bright daylight, sensory data is highly precise, so the brain turns up the volume on bottom-up prediction errors, allowing them to overwrite internal models easily 12.

In the free energy framework, "attention" is not a magical spotlight shining on objects in the world. Instead, attention is simply the brain increasing the "precision weighting" of specific sensory channels 512. Paying attention to a specific sound in a crowded room simply means your brain has decided the auditory prediction errors arriving from that specific source are currently highly reliable and should be heavily weighted in updating your internal model of the world 12.

When the Machine Misfires: Implications for Psychiatry

Because Karl Friston's background is in psychiatry, one of the most compelling applications of the free energy principle is its ability to explain mental health conditions and neurological disorders. If healthy brain function is a delicate, dynamic balance of minimizing prediction error through well-weighted priors and sensory data, then psychopathology can be viewed as a mechanical breakdown in this predictive machinery 1316.

Hallucinations and Schizophrenia

Under the active inference framework, a hallucination is not seen as a mysterious or supernatural break from reality, but as a severe imbalance in precision weighting 1416.

As established, all normal perception is a "controlled hallucination" constrained by sensory data 14. However, if the brain systematically overweights the precision of its own top-down prior beliefs - and simultaneously underweights the bottom-up sensory prediction errors coming from the eyes and ears - it will effectively ignore the outside world. The brain's internal generative model runs unchecked and uncorrected. An individual will begin to hear voices or see visions because their brain is prioritizing its internal predictions over the physical evidence 1416. To the brain, the hallucinations are entirely real; they are simply uncorrected by the environment.

Autism and Sensory Overload

Conversely, predictive coding offers a compelling mechanical explanation for the sensory processing differences often seen in autism spectrum disorder. If a brain consistently overweights the precision of bottom-up sensory prediction errors (or underweights the reliability of its top-down priors), it will struggle to "explain away" normal, everyday stimuli.

In a neurotypical brain, the background hum of a refrigerator is quickly predicted, explained away, and subsequently ignored. If a brain places too much weight on sensory errors, every sound, flickering light, and scratchy fabric feels brand new, unpredictable, and highly surprising. This inability to use prior beliefs to dampen sensory input can lead to chronic sensory overload. It also elegantly explains why many individuals on the autism spectrum prefer highly structured, repetitive, and predictable environments - these environments naturally minimize free energy and provide relief from a chaotic world 13.

Expected Free Energy: Anxiety and Depression

While variational free energy deals with resolving immediate surprise in the present moment, biological organisms also have to plan for the future. To do this, they compute "expected free energy" 822. An organism must choose actions that minimize the surprise it expects to encounter later 811.

Chronic anxiety can be interpreted through this lens as a generative model that constantly predicts high future uncertainty and disastrous outcomes. The brain is trapped in a state of hyper-vigilance as it struggles to map a behavioral policy that leads to safety.

Depression might be viewed as an overly rigid internal model where the brain firmly predicts that no action it takes will successfully minimize expected free energy. If the brain believes that all environments are hostile and all actions are futile, the most mathematically optimal response is lethargy - shutting down energy expenditure because the system expects all motor actions to fail 713.

Recent Empirical Evidence in Neuroscience

For years, the free energy principle was criticized as being purely theoretical - a beautiful, elegant mathematical formalism imported from physics that was too abstract to prove in wet, messy biology 917. Critics argued that the theory was unfalsifiable 924. However, recent advances in cellular neuroscience and bioengineering have begun to establish its predictive validity in undeniable ways.

In a landmark 2023 study published in major scientific journals, researchers provided empirical evidence for the free energy principle using biological neurons in a petri dish (often referred to in the media as the "DishBrain" experiment) 17. The researchers took in vitro networks of rat cortical neurons and hooked them up to a computer simulation, effectively teaching the brain cells to play a simplified version of the vintage video game Pong.

Crucially, the researchers did not use traditional "rewards" (like a simulated dopamine chemical drip) to train the cells, which is how traditional artificial intelligence is trained. Instead, they relied purely on the mechanics of the free energy principle 817. * When the neural network successfully moved the digital paddle and hit the ball, the researchers sent a predictable, highly structured electrical stimulus to the cells. * When the network missed the ball, the researchers delivered a random, chaotic, and highly unpredictable electrical stimulus (white noise) 8.

According to the free energy principle, biological systems inherently despise surprise and unpredictability, as unpredictability equates to high free energy 5. True to the theory, the neurons rapidly reorganized their synaptic connections to get better at hitting the ball, simply because hitting the ball was the only way to ensure their environment remained predictable 17. The study successfully proved that minimizing variational free energy can quantitatively predict the self-organization, learning, and plasticity of biological neural networks, even without a body or a survival instinct 817.

From Biology to Artificial Intelligence

The implications of the free energy principle extend far beyond the wetware of biology. As artificial intelligence researchers search for the elusive path toward Artificial General Intelligence (AGI), many are turning to Karl Friston's theories as a powerful, biomimetic alternative to current machine learning paradigms 322.

Today's dominant AI systems, such as Large Language Models (LLMs) like ChatGPT, rely on Deep Learning and backpropagation 22. These models are trained on massive datasets using brute-force computational power. They are passive processors; they do not have an active drive to minimize uncertainty, nor do they possess a true "world model" that they test against reality through physical action 2218. Furthermore, modern AI demands monstrous, highly unsustainable amounts of energy to train and operate 22.

Deep Active Inference vs. Reinforcement Learning

In the realm of autonomous agents, self-driving cars, and robotics, the current industry standard is Reinforcement Learning (RL). In RL, an AI is programmed to maximize a specific, human-engineered external reward (like maximizing points in a video game or reaching a destination) 2426. The core problem with RL is that the reward function must be manually created, and the agents are incredibly brittle - they often struggle to adapt if the environment changes slightly 2619.

Active Inference offers a fundamentally different architecture for artificial intelligence 318. Instead of seeking a human-defined reward, an active inference agent is driven entirely by a single, intrinsic objective: to minimize expected free energy 26.

Because expected free energy mathematically decomposes into two parts - an imperative to achieve a preferred state (exploitation) and an imperative to resolve uncertainty (exploration) - active inference agents naturally exhibit curiosity 826. If an active inference robot is dropped into an unknown environment, it will intrinsically actively explore to reduce its uncertainty about the world, without needing a human to specifically program an "exploration bonus" 28.

Proponents argue that building AI based on active inference and the free energy principle will lead to agents that are more adaptable, more autonomous, and far more energy-efficient 2218. By mimicking the principle of "least action" found in natural biological intelligence, future AI could learn continuous causal structures from their environments using a fraction of the data and electricity required by today's deep learning models, bridging the gap between artificial and natural intelligence 222.

Critiques, Controversies, and the Limits of the Theory

Despite its sweeping explanatory power, the free energy principle is not without its detractors. Because the theory attempts to explain everything from the folding of microscopic proteins to the structure of human societies, it is frequently criticized for being so broad that it borders on being unfalsifiable 7924.

Is it a Scientific Theory or a Mathematical Truism?

Some physicists and computational neuroscientists argue that the free energy principle is less a biological theory and more a mathematical truism 59. In a sense, saying "living things minimize free energy" is tautological; it is somewhat like saying "living things continue to exist." Since free energy is a measure of the likelihood of a state, any system that remains structurally intact over time is, by mathematical definition, minimizing its free energy 59.

Critics argue that because it is a foundational mathematical principle - akin to calculus - it cannot be "falsified" in the traditional scientific sense. Consequently, it struggles to make specific, testable predictions about how particular neural circuits in the brain work without bolting on highly specific, auxiliary assumptions 924. As neuroscientist Samuel Gershman has pointed out, while the mathematical soundness of the principle is unobjectionable, its utility as a scientific theory depends entirely on how it is practically implemented 9. In many cases, when applied practically to models of cognition, active inference requires strict computational constraints that make it almost indistinguishable from pre-existing theories of Bayesian inference 9.

The Panpsychism Problem

Another major critique arises from the principle's scale-free nature. The mathematics of Markov blankets and free energy minimization theoretically apply to any self-organizing system 7. Karl Friston himself has noted that a single-celled paramecium, a plant, a human brain, and even a sprawling economic system can all be described as minimizing free energy 711.

Some evolutionary biologists and philosophers argue this stretches the definition of "cognition" and "inference" to the breaking point. If a drop of oil suspended in water maintains its boundary by minimizing thermodynamic free energy, is it performing "inference"? Is a rock minimizing surprise? While supporters argue this provides a beautiful, seamless continuum from physics to biology to cognition, skeptics view it as a category mistake, applying psychological and neuroscientific terms ("belief," "surprise," "inference") inappropriately to simple physical phenomena 720.

Nevertheless, despite these philosophical debates, the framework continues to gain immense traction across multiple disciplines precisely because of its ambitious ability to bridge the gap between physics and biology, providing a rigorous mathematical language to describe the behavior of all adaptive systems 3.

Bottom line

The free energy principle suggests that the brain is not a passive computer, but a proactive statistical prediction engine hardwired to minimize surprise and resist the disorder of the natural world. By actively constructing an internal model of reality and either updating its beliefs (perception) or altering its physical environment (action), the brain ensures an organism stays within the narrow biological boundaries required for survival. While critics debate its falsifiability and its sweeping, scale-free nature, recent biological experiments and advancements in artificial intelligence suggest this framework offers one of the most profound, unified explanations for how life, cognition, and intelligent behavior emerge from the fundamental laws of physics.

About this research

This article was produced using AI-assisted research using mmresearch.app and reviewed by human. (NobleMarten_34)