
<(From Left) M.S candidate Soyoung Choi, Ph.D candidate Seong-Hyeon Hwang, Professor Steven Euijong Whang>
Just as human eyes tend to focus on pictures before reading accompanying text, multimodal artificial intelligence (AI)—which processes multiple types of sensory data at once—also tends to depend more heavily on certain types of data. KAIST researchers have now developed a new multimodal AI training technology that enables models to recognize both text and images evenly, enabling far more accurate predictions.
KAIST (President Kwang Hyung Lee) announced on the 14th that a research team led by Professor Steven Euijong Whang from the School of Electrical Engineering has developed a novel data augmentation method that enables multimodal AI systems—those that must process multiple data types simultaneously—to make balanced use of all input data.
Multimodal AI combines various forms of information, such as text and video, to make judgments. However, AI models often show a tendency to rely excessively on one particular type of data, resulting in degraded prediction performance.
To solve this problem, the research team deliberately trained AI models using mismatched or incongruent data pairs. By doing so, the model learned to rely on all modalities—text, images, and even audio—in a balanced way, regardless of context.
The team further improved performance stability by incorporating a training strategy that compensates for low-quality data while emphasizing more challenging examples. The method is not tied to any specific model architecture and can be easily applied to various data types, making it highly scalable and practical.

<Model Prediction Changes with a Data-Centric Multimodal AI Training Framework>

Professor Steven Euijong Whang explained, “Improving AI performance is not just about changing model architectures or algorithms—it’s much more important how we design and use the data for training.” He continued, “This research demonstrates that designing and refining the data itself can be an effective approach to help multimodal AI utilize information more evenly, without becoming biased toward a specific modality such as images or text.”
The study was co-led by doctoral student Seong-Hyeon Hwang and master’s student Soyoung Choi, with Professor Steven Euijong Whang serving as the corresponding author. The results will be presented at NeurIPS 2025 (Conference on Neural Information Processing Systems), the world’s premier conference in the field of AI, which will be held this December in San Diego, USA, and Mexico City, Mexico.
※ Paper title: “MIDAS: Misalignment-based Data Augmentation Strategy for Imbalanced Multimodal Learning,” Original paper: https://arxiv.org/pdf/2509.25831
The research was supported by the Institute for Information & Communications Technology Planning & Evaluation (IITP) under the projects “Robust, Fair, and Scalable Data-Centric Continual Learning” (RS-2022-II220157) and “AI Technology for Non-Invasive Near-Infrared-Based Diagnosis and Treatment of Brain Disorders” (RS-2024-00444862).
<(From Left) Ph.D candidate Hyojin Son, Professor Gwan-su Yi> Proteins in our body function like switches. When a drug binds to a protein, the structure at the binding site changes, and this structural change propagates throughout the protein, turning its function on or off. Google DeepMind’s AlphaFold3 successfully predicted whether drugs bind to proteins and the three-dimensional structure of binding sites. However, it could not predict how signals propagate inside the protein a
2026-03-09<(From left) Professor Sang Wan Lee, Myoung Hoon Ha, and Dr. Yoondo Sung> Artificial intelligence now plays Go, paints pictures, and even converses like a human. However, there remains a decisive difference: AI requires far more electricity than the human brain to operate. Scientists have long asked the question, “How can the brain learn so intelligently using so little energy?” KAIST researchers have moved one step closer to the answer. KAIST (President Kwang Hyung Lee) an
2026-03-06<(From left) KAIST Ph.D. Candidate HyunWoo Chang, Professor EunAe Cho. (Top, from left) Seoul National University Professor Won Bo Lee, Dr. Jae Hyun Ryu.> In the era of climate crisis, hydrogen vehicles are emerging as an alternative for eco-friendly mobility. However, the fuel cell, known as the ‘heart of the hydrogen car,’ still faces limitations of high cost and short lifespan. The core cause is the platinum catalyst. While it is a decisive material for generating electri
2026-02-27< Sang Yup Lee, Senior Vice President for Research at KAIST (Inaugural Chairman of the Korea Synthetic Biology Association) > KAIST announced on February 27th that Sang Yup Lee, Distinguished Professor of the Department of Chemical and Biomolecular Engineering and Senior Vice President for Research, has been appointed as the inaugural chairman of the Korea Synthetic Biology Association (KSBA). This appointment was officially ratified during the association's 5th regular general meeting
2026-02-27< Progress Report Meeting of the Deep-Tech Scale-up Valley Project > KAIST announced on February 27th that it held the "Deep-Tech Scale-up Valley Project Progress Report Meeting" at its main campus in Daejeon on the 26th. During the meeting, the university unveiled its Physical AI strategies and execution structures, currently being developed with a focus on robotics. The Deep-Tech Scale-up Valley Promotion Project is a joint initiative by the Ministry of Science and ICT, Daejeon Metro
2026-02-27