AI unlocks cardiac MRI reading without manual labels, beating general models by 35%

· Medical Xpress

by Cleveland Clinic

edited by Sadie Harley, reviewed by Andrew Zinin

Sadie Harley

Scientific Editor

Meet our editorial team
Behind our editorial process

Andrew Zinin

Lead Editor

Meet our editorial team
Behind our editorial process Editors' notes

This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

peer-reviewed publication

trusted source

proofread

The GIST Add as preferred source


Credit: Pixabay/CC0 Public Domain

A team of researchers from Carnegie Mellon University, in collaboration with Cleveland Clinic's Cardiovascular Innovation Research Center, has developed an artificial intelligence (AI) system capable of interpreting some of the most complex heart scans in medicine, cardiac magnetic resonance imaging (MRI), without the need for manually labeled training data.

The novel system, called CMR-CLIP, is designed to interpret cardiac MRI scans by connecting moving images of the heart with corresponding clinical radiology reports.

The research was published in Nature Communications.

In testing, it significantly outperformed general-purpose AI models, in some cases by more than 35%. The system also showed strong potential for improving cardiac imaging analysis, case retrieval, and clinical decision support.

"This work demonstrates that domain-specific foundation models can significantly outperform general-purpose AI systems in specialized clinical applications," said Ding Zhao, associate professor in Carnegie Mellon University's Department of Mechanical Engineering and co-principal investigator on the study.

"By designing models that reflect the structure and complexity of cardiac MRI data, rather than adapting generic image models, we can unlock new levels of performance and clinical utility."

David Chen, Ph.D., of Cleveland Clinic, a co-principal investigator on the project, emphasized the clinical implications of the work, saying, "Cardiac MRI interpretation is highly specialized and time intensive. Systems like CMR-CLIP have the potential to support clinicians through automated screening and interpretation support, particularly in settings where expert readers are limited. Such reader assistant tools are critical to improving patient access to this powerful diagnostic technology."

Cardiac MRI is widely regarded as the gold standard for evaluating heart structure, function, and tissue health. A single scan can provide a comprehensive view of the heart, including pumping performance, muscle damage, blood flow, and structural abnormalities.

However, each study can contain hundreds to thousands of images across multiple views and time points. Even for trained specialists, interpreting a single exam can take 40 minutes or more. Because the technology is expensive and concentrated in major medical centers, there is a limited supply of experts available to meet growing clinical demand.

This combination of complexity and limited data has also made cardiac MRI one of the most challenging domains for AI. Most machine learning systems rely on large, carefully labeled datasets, but in cardiac imaging, expert annotations are scarce, time-consuming to produce, and costly to scale.

To overcome this barrier, the research team leveraged a resource already embedded in routine clinical workflows: radiology reports. Every cardiac MRI exam is paired with a written summary in which clinicians document key findings in an "impression" section.

Instead of relying on manual labels, the team trained CMR-CLIP to align MRI image sequences with these natural language clinical summaries, enabling the model to learn directly from how physicians describe and interpret scans in practice.

Rather than treating cardiac MRI as a collection of static images, CMR-CLIP represents each study as a video of the beating heart. The model processes multiple standard views of the heart alongside time-resolved sequences that capture motion and tissue behavior. This lets the model capture both structure and movement, much like a cardiologist does when reviewing a scan.

Trained on more than 13,000 de-identified real patient studies from Cleveland Clinic, the system learned from over a million images and hundreds of thousands of motion sequences collected over more than a decade.

When tested, CMR-CLIP was able to identify cardiac conditions in a "zero-shot" setting, meaning it had never been directly trained on those specific labels, simply by matching images to descriptive prompts like "enlarged left ventricle."

Even more striking, with just a single example of a condition, CMR-CLIP could often match the performance of other systems that required dozens of labeled cases.

In more specialized diagnostic tasks, the model reached near-clinical levels of performance, including accuracy rates as high as 99% for certain heart conditions. It also demonstrated the ability to search through large databases of scans using natural language, retrieving similar cases in a way that could one day help clinicians quickly compare patients with rare or complex presentations.

A key test of whether the system was truly learning meaningful representations came when it was evaluated outside the institution where it was trained. The model still performed strongly on two entirely separate datasets (one collected in France, one in Cleveland Clinic, Florida), suggesting it could generalize beyond a single hospital system.

"This work highlights a new direction for medical AI by showing how large-scale clinical data can be used to train models without requiring time-consuming manual labeling," said Deborah Kwon, M.D., Director of Cardiac MRI at Cleveland Clinic, clinical lead and co-author of this study.

"This technology has the potential to not only improve efficiency but also quality of reporting to support more consistent and clinically meaningful interpretations, as well as serve as an important teaching tool in a highly specialized and complex imaging field."

Looking ahead, the research team plans to extend the model to additional cardiac imaging sequences, including perfusion imaging, T2-weighted imaging, and parametric mapping, as well as explore applications in automated report generation and interactive clinical decision support systems in resource-limited applications.

Publication details

Contrastive language image pretraining for a cardiac magnetic resonance image embedding with zero-shot capabilities", Nature Communications (2026). DOI: 10.1038/s41467-026-73022-2

Journal information: Nature Communications

Key medical concepts

Left ventricular hypertrophy

Clinical categories

CardiologyDiagnostic radiology Provided by Cleveland Clinic Who's behind this story?

Sadie Harley

BSc Life Sciences & Ecology. Microbiology lab background with pharmaceutical news experience in oil, gas, and renewable industries. Full profile →

Andrew Zinin

Master's in physics with research experience. Long-time science news enthusiast. Plays key role in Science X's editorial success. Full profile →

Citation: AI unlocks cardiac MRI reading without manual labels, beating general models by 35% (2026, May 21) retrieved 21 May 2026 from https://medicalxpress.com/news/2026-05-ai-cardiac-mri-manual-general.html This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.