AI and Annotated Medical Images – OpenPath, PLIP, and “Medical Twitter”

By Michael Awood

September 1, 2023

The lack of available annotated medical images has historically hindered healthcare innovation. However, a solution is emerging as healthcare professionals start to share anonymised images and insights on public platforms – which includes the social media site previously known as Twitter (X). This has led to the creation of OpenPath, a comprehensive dataset of over 200,000 pathology images coupled with natural language descriptions, marking it as the largest public dataset of its kind.

Researchers have used OpenPath to develop Pathology Language–Image Pre-training (PLIP), a multimodal AI trained on this dataset. PLIP has demonstrated impressive results in zero-shot learning and transfer learning for classifying new pathology images across various tasks. Additionally, PLIP enables users to locate similar cases using either image or natural language search, encouraging knowledge sharing.

The researchers collected over 240,000 public pathology images using popular pathology-related hashtags and expanded the collection with data from other online sources. After thorough data quality checks, they assembled over 200,000 pathology image-text pairs named OpenPath, which they used to develop the versatile PLIP.

PLIP outperformed previous models in tasks such as zero-shot learning, linear probing, and text-to-image and image-to-image retrieval. Unlike other digital pathology machine learning methods, PLIP can adapt to new datasets and provide zero-shot predictions based on any text input, making it a flexible tool for potential new disease subtypes.

The study did note some limitations, including irrelevant data in the image-text pairs and challenges in accounting for varying magnification levels and staining styles. However, researchers are optimistic that PLIP can adjust to images with diverse magnification levels and staining protocols. They expect that OpenPath and PLIP will significantly contribute to advancing AI in pathology and encourage a data-focused approach in this area.

Reference url

Recent Posts

regulatory validation taletrectinib
Advancing Regulatory Validation Taletrectinib for Enhanced NSCLC Treatment

By João L. Carapinha

June 30, 2026

The MHRA’s validation of Nuvation Bio’s marketing authorisation application marks an important regulatory validation taletrectinib for adults with advanced ROS1-positive non-small cell lung cancer. This milestone, achieved via the International Recognition Procedure in parallel with EMA review, f...
Portuguese Biotech Innovation
Global Impact of Portuguese Biotech Innovation

By João L. Carapinha

June 30, 2026

Portuguese Biotech Innovation has matured into a compelling global force, with Solfarcos marking a decade of translating University of Minho research into commercial technologies that span cosmetics and pharmaceuticals. The Braga-based spin-off has built a unified platform of peptides and protein...
cannabis regulatory exploitation
Exposing Cannabis Regulatory Exploitation in Portugal’s Medicinal Sector

By João L. Carapinha

June 30, 2026

Portugal’s medicines regulator INFARMED has been shaken by a major case of cannabis regulatory exploitation in which criminal networks weaponised medicinal cannabis licences to traffic drugs on an industrial scale. Insufficient inspectors, rapid sector growth and the recruitment of former agency ...