AI and Annotated Medical Images – OpenPath, PLIP, and “Medical Twitter”

By Michael Awood

September 1, 2023

The lack of available annotated medical images has historically hindered healthcare innovation. However, a solution is emerging as healthcare professionals start to share anonymised images and insights on public platforms – which includes the social media site previously known as Twitter (X). This has led to the creation of OpenPath, a comprehensive dataset of over 200,000 pathology images coupled with natural language descriptions, marking it as the largest public dataset of its kind.

Researchers have used OpenPath to develop Pathology Language–Image Pre-training (PLIP), a multimodal AI trained on this dataset. PLIP has demonstrated impressive results in zero-shot learning and transfer learning for classifying new pathology images across various tasks. Additionally, PLIP enables users to locate similar cases using either image or natural language search, encouraging knowledge sharing.

The researchers collected over 240,000 public pathology images using popular pathology-related hashtags and expanded the collection with data from other online sources. After thorough data quality checks, they assembled over 200,000 pathology image-text pairs named OpenPath, which they used to develop the versatile PLIP.

PLIP outperformed previous models in tasks such as zero-shot learning, linear probing, and text-to-image and image-to-image retrieval. Unlike other digital pathology machine learning methods, PLIP can adapt to new datasets and provide zero-shot predictions based on any text input, making it a flexible tool for potential new disease subtypes.

The study did note some limitations, including irrelevant data in the image-text pairs and challenges in accounting for varying magnification levels and staining styles. However, researchers are optimistic that PLIP can adjust to images with diverse magnification levels and staining protocols. They expect that OpenPath and PLIP will significantly contribute to advancing AI in pathology and encourage a data-focused approach in this area.

Reference url

Recent Posts

Donanemab Alzheimer’s Approval: A New Era in Treatment

By João L. Carapinha

August 8, 2025

The EMA granted marketing authorization for donanemab, a monoclonal antibody targeting beta-amyloid plaques. It treats early symptomatic Alzheimer's disease (mild cognitive impairment or mild dementia with confirmed amyloid pathology). This approval highlights donanemab's role in Alzheimer's trea...
Building a Better Ecosystem for Digitally Enabled Diabetes Care in Europe
Digitally enabled diabetes care is rapidly transforming the management of Type 2 diabetes (T2D) across Europe. How does expanding access to these digital solutions improve health outcomes and reduce costs for individuals and health systems? Early evidence and recent policy recommendations (publis...
AI in Real-World Evidence: Insights from Mitch Higashi

By Staff Writer

August 7, 2025

In this update, we review the Q&A with Mitch Higashi, PhD, ISPOR’s Associate Chief Science Officer published on Pharmalive. It discusses AI in Real-World Evidence (RWE) and health technology assessments (HTA). Key themes include AI-driven early disease detection and NLP tackling unstructured ...