
<(From Left) Professor Sang Wan Lee, Ph.D candidate Yoondo Sung, (Upper Left) Dr. Mattia Rigotti>
Humans possess a remarkable balance between stability and flexibility, enabling them to quickly establish new plans and adjust goals even in the face of sudden changes. However, "Model-Free reinforcement learning," which is widely used in robotics and exemplified by AlphaGo’s famous match against Lee Sedol, struggle to achieve these two capabilities simultaneously. KAIST's research team has discovered that the secret lies in the unique information processing method within the prefrontal cortex, a principle that could serve as the foundation for developing "Brain-like AI” that is both flexible and stable.
KAIST announced on December 14th that a research team led by Professor Sang Wan Lee from the Department of Brain and Cognitive Sciences, in collaboration with IBM AI Research, has deciphered how the human brain manages goal changes in uncertain situations, suggesting a new direction for next-generation reinforcement learning.
The research team highlighted a critical limitation of current reinforcement learning models: they lose the balance between flexibility for goals pursuit and stability in uncertain environments. Humans, however, achieve both simultaneously. The team hypothesized that this difference arises from how the prefrontal cortex represents information.
Using functional MRI (fMRI) experiments, reinforcement learning models, and advanced AI analyses, the team revealed that the human prefrontal cortex has a unique embedding structure that represents "goal information" and "uncertainty information" separately to prevent interference. Individuals with more distinct separation between these channels were able to adapt strategies when goals shifted, while maintaining stable judgment despite environmental uncertainty. The team likened this mechanism to "multiplexing" in communication technology, where multiple signals are transmitted simultaneously without interference.
In this way, the human prefrontal cortex operates through two "channels": one that sensitively tracks goal changes to ensure flexibility in decision-making, and another that isolate environmental uncertainty to maintain stable judgment.
An interesting point is that the prefrontal cortex goes beyond simple executing control guided by the first channel; it uses the second channel to actually choose which learning strategy to use depending on the situation.
This demonstrates the brain’s "meta-learning capabilities," meaning it learns not only what to learn but also how to learn – by choosing which learning strategy to use. This is why humans remain resilient in constantly changing situations.
The implication of this research extend across various fields, including the analysis of individual reinforcement and meta-learning abilities, personalized education design, cognitive diagnosis, and human-computer interaction (HCI). Moreover, embedding brain-inspired representation structures into AI could lead to "brain-like thinking AI", allowing AI to better understand human intentions and values, reducing dangerous judgments, and enabling safer cooperation with humans.

<Figure 1. Balance between Flexibility and Stability in Humans and AI>

<Figure 2. Topological Structure of Goal Representation in the Prefrontal Cortex and Environmental Uncertainty Information>
Lead researcher Professor Sang Wan Lee emphasized the significance of the findings: "This study clarifies the brain's fundamental operating principles—from flexibly following changing goals to stably establishing plans—from an AI perspective. These principles will serve as a core foundation for next-generation AI, allowing it to adapt like a human and learn more safely and intelligently."
This study featured PhD candidate Yoondo Sung as the first author and Dr. Mattia Rigotti of IBM AI Research as the second author, with Professor Sang Wan Lee serving as the corresponding author. The research results were published on November 26 in the international academic journal Nature Communications.
(Paper Title: Factorized embedding of goal and uncertainty in the lateral prefrontal cortex guides stably flexible learning / DOI: 10.1038/s41467-025-66677-w)
Notably, this research was conducted with support from the "Frontier R&D Project" of the Ministry of Science and ICT.
The Graduate School of Global Digital Innovation (GDI) of KAIST will host the "AI⁺ Global Prosperity Forum 2026" on June 24 at the Chung Kunmo Conference Hall (5F), KAIST Academic Cultural Complex (E9). KAIST Graduate School of Global Digital Innovation (GDI) is carrying out the "ICT Global Specialized Convergence Talent Cultivation Program" supported by the Ministry of Science and ICT and the Institute of Information & Communications Technology Planning & Evaluation (IITP). Since t
2026-06-11< (From left) Professor Chang D. Yoo, Tung M. Luu (PhD candidate, first author) at the back center, and Hwanhee Kim (M.S candidate, second author) at the front right > “Robots that make judgments like humans are coming faster than we think.” A core technology that will accelerate the era where robots understand human intentions and choose the correct actions on their own has been developed in South Korea. KAIST researchers solved a key challenge in the commercialization o
2026-06-10<Human Behavior and Mental Health Symposium Poster> KAIST announced the official launch of the KAIST Mind Care & Growth Center (KMCG), a new integrated platform that strengthens mental health support for students and faculty while advancing digital mental health research. To mark the occasion, KAIST hosted an international symposium titled "Human Behavior and Mental Health" on June 10, 2026, at the Cho Su-mi Hall in the Chang Young Shin Student Activity Center on its main Daejeon ca
2026-06-10<(From Left) Hyun-Bin Oh, Takida Yuhta, Uesaka Toshimitsu, Tae-Hyun Oh, Mitsufuji Yuki> When people watch a scene in the film Jurassic Park where a giant dinosaur walks toward them, they naturally imagine a heavy, rumbling sound, as if the ground were shaking. This is because humans predict sound by considering not only the shape of an object, but also physical properties such as its size, weight, and speed of movement. However, existing video-to-audio generation AI mainly generates sou
2026-05-27KAIST announced on May 22nd that the entire faculty of the Graduate School of AI welcomes South Korea's hosting of the 'Global AI Hub.' The faculty determined that hosting this will serve as a crucial momentum builder for South Korea to earnestly contribute to international cooperation and the responsible use of technology in the artificial intelligence (AI) era. In a joint statement, the faculty of the KAIST Graduate School of AI expressed, "Hosting the Global AI Hub goes beyond simply attr
2026-05-22