18 Free and Open-source Whole Slide Imaging Pathology Projects and Libraries, a Comprehensive Guide for Bioengineers and Bio Data Scientists (2024)

18 Free and Open-source Whole Slide Imaging Pathology Projects and Libraries, a Comprehensive Guide for Bioengineers and Bio Data Scientists (2024)

Digital pathology is a cutting-edge field that transforms traditional pathology by digitizing glass slides into high-resolution whole slide images (WSIs). These WSIs capture the entire tissue sample on a slide, enabling detailed analysis and diagnostics through digital means.

By using advanced imaging techniques, digital pathology allows pathologists to view, analyze, and share pathology data more efficiently.

WSIs are essential in various applications, including cancer diagnosis, research, education, and telepathology, where they support remote consultations and collaborative studies, improving the accuracy and accessibility of pathological assessments.

Our Digital Pathology Archive

Pathology - MEDevel.com: Open-source for Healthcare, and Education
Collecting and validating open-source software for healthcare, education, enterprise, development, medical imaging, medical records and digital pathology
Orbit: Digital Pathology meets AI (Machine learning & Deep learning) & Big Data with Open-source flavour
Orbit is an open source software package built to empower whole slide images (Virtual slides) processing/analysis with powerful AI models for researchers and pathologists. Orbit is an advanced whole-slide image viewer with advanced image analysis algorithms, built-in machine learning models for tissue quantification, script editor to write, edit models
Cytomine: Free Open source Web-based Digital Pathology (WSI) solution with Machine learning flavor
Cytomine is a web-based open source solution, aiming to empower whole-slide image processing, & analysis with machine learning algorithms. It’s built to ease collaboration among researchers. Cytomine is built by a group of researchers from Montefiore Institute (University of Liège, Belgium) who are developing machine learning algorithms and big data software
Open Microscopy Environment (OME): Digital Pathology Redefined with Open source
OME (Open Microscopy Environment) is an open-source initiative aiming to produce open-source software and format standards for microscopy data. The project is started by researchers from the University of Dundee, later it gathered the attention and support from researchers, developers, & scientists from all over the world, from universities, institutes, laboratories,
OpenSlide: The Whole-Slide image processing library that stands behind most of Digital Pathology Projects
OpenSlide is a C library for processing, reading whole-slide images (virtual slides). It’s the engine behind many digital pathology projects open source and closed source. It provides a simple performance-focused interface for parsing and processing virtual slides images format (WSI). OpenSlide has Python, Java, and Julia language bindings. Developers who

In the following list, we offer you the most active and free digital pathology and WSI (Whole Slide Image) projects in 2024.

1. QuPath

QuPath is an active open-source project for Bioimage analysis & digital pathology.

Features & Tools

  • Lots of tools to annotate and view images, including whole slide & microscopy images
  • Workflows for brightfield & fluorescence image analysis
  • New algorithms for common tasks, including cell segmentation, tissue microarray dearraying
  • Interactive machine learning for object & pixel classification
  • Customization, batch-processing & data interrogation by scripting
  • Easy integration with other tools, including ImageJ
GitHub - qupath/qupath: QuPath - Bioimage analysis & digital pathology
QuPath - Bioimage analysis & digital pathology. Contribute to qupath/qupath development by creating an account on GitHub.

2. CLAM

CLAM (Computational Pathology) is an innovative open-source software developed by the Mahmood Lab. It leverages deep learning models to analyze whole-slide images (WSIs) of histopathological specimens.

CLAM is designed to assist in various aspects of computational pathology, enabling the automatic segmentation, classification, and quantification of tissue regions in WSIs.

This tool is particularly useful in the field of bioinformatics for advancing cancer research and diagnosis by providing a robust and scalable platform for processing large-scale pathology datasets.

GitHub - mahmoodlab/CLAM: Data-efficient and weakly supervised computational pathology on whole slide images - Nature Biomedical Engineering
Data-efficient and weakly supervised computational pathology on whole slide images - Nature Biomedical Engineering - mahmoodlab/CLAM

3. Tifffile Python Library

Tifffile is a versatile Python library designed for storing and reading images and metadata from various TIFF (Tagged Image File Format) and TIFF-like files commonly used in bioimaging. The library supports a wide range of file formats, including TIFF, BigTIFF, OME-TIFF, GeoTIFF, Adobe DNG, and several specialized formats like Zeiss LSM, ImageJ hyperstack, and more.

Tifffile is particularly useful for researchers working with large-scale image data in bioinformatics and related fields, providing robust tools for both reading and writing complex image file formats.

Features

  • Storage and Reading: Tifffile allows for the storage of NumPy arrays in TIFF files and supports reading image data and metadata from numerous TIFF and TIFF-like file formats.
  • Supported Formats: The library can handle files such as BigTIFF, OME-TIFF, GeoTIFF, Adobe DNG, and others used in various bioimaging and scientific applications.
  • Image Data Handling: It can read image data as NumPy arrays or Zarr arrays/groups from different structures like strips, tiles, pages, SubIFDs, higher-order series, and pyramidal levels.
  • Writing Capabilities: Tifffile supports writing image data to TIFF, BigTIFF, OME-TIFF, and ImageJ hyperstack files in various forms, including multi-page, volumetric, pyramidal, and compressed formats.
  • Compression Support: The library integrates with imagecodecs to support multiple compression and predictor schemes like LZW, JPEG, JPEG 2000, WebP, and others.
  • Advanced Functionality: Tifffile can inspect TIFF structures, handle multi-dimensional file sequences, patch TIFF tag values, and parse proprietary metadata formats, making it a comprehensive tool for managing TIFF files in scientific research.
GitHub - cgohlke/tifffile: Read and write TIFF files
Read and write TIFF files. Contribute to cgohlke/tifffile development by creating an account on GitHub.

4- OpenSlide

OpenSlide is an open-source C library that facilitates the reading and manipulation of whole-slide images (WSIs), which are high-resolution images of tissue samples used in digital pathology. This library is crucial for bioinformatics and medical imaging research, enabling the efficient handling of large image files produced by digital slide scanners.

OpenSlide can read WSI formats from various scanner vendors, including Aperio, Hamamatsu, Leica, MIRAX, Sakura, Trestle, and Ventana, making it a versatile tool for researchers working

GitHub - openslide/openslide: C library for reading virtual slide images
C library for reading virtual slide images. Contribute to openslide/openslide development by creating an account on GitHub.

5. Slideflow

Slideflow is an open-source Python library designed to facilitate deep learning workflows with whole-slide images (WSIs) in computational pathology. Developed with flexibility and scalability in mind, Slideflow enables researchers to efficiently manage, preprocess, and analyze large-scale WSI datasets using deep learning models.

Slideflow is particularly valuable for researchers in digital pathology and bioinformatics, offering a comprehensive toolkit for developing and deploying deep learning models on WSI data.

Primary Features

  • WSI Management: Slideflow offers tools for managing and processing WSIs, including tissue masking, patch extraction, and image normalization, which are essential steps in preparing data for deep learning.
  • Deep Learning Integration: The library is built to integrate seamlessly with popular deep learning frameworks, making it easier to train, evaluate, and deploy models on pathology datasets.
  • Dataset Handling: Slideflow provides functionalities for creating and managing large-scale WSI datasets, including support for distributed computing to handle extensive datasets efficiently.
  • Flexible Architecture: The library is designed to be adaptable, allowing users to customize and extend it for specific research needs in computational pathology.

Other Features

  • Easy-to-use, highly customizable training pipelines
  • Robust slide processing and stain normalization toolkit
  • Support for training with weakly-supervised or strongly-supervised labels
  • Multiple-instance learning (MIL)
  • Self-supervised learning (SSL)
  • Generative adversarial networks (GANs)
  • Explainability tools: Heatmaps, mosaic maps, saliency maps, synthetic histology
  • Robust layer activation analysis tools
  • Uncertainty quantification
  • Interactive user interface for model deployment
GitHub - jamesdolezal/slideflow: Deep learning library for digital pathology, with both Tensorflow and PyTorch support.
Deep learning library for digital pathology, with both Tensorflow and PyTorch support. - jamesdolezal/slideflow

6. Bio-Formats

Bio-Formats is a Java library for reading and writing data in life sciences image file formats. It is developed by the Open Microscopy Environment. Bio-Formats is released under the GNU General Public License (GPL); commercial licenses are available from Glencoe Software.

GitHub - ome/bioformats: Bio-Formats is a Java library for reading and writing data in life sciences image file formats. It is developed by the Open Microscopy Environment. Bio-Formats is released under the GNU General Public License (GPL); commercial licenses are available from Glencoe Software.
Bio-Formats is a Java library for reading and writing data in life sciences image file formats. It is developed by the Open Microscopy Environment. Bio-Formats is released under the GNU General P…

7. DeepSlide

DeepSlide is an open-source Python library designed for deep learning-based analysis of whole-slide images (WSIs) in digital pathology.

The project is developed by the BMIRDS team, DeepSlide provides a streamlined workflow for researchers and clinicians working on computational pathology projects, enabling efficient training, validation, and deployment of deep learning models on WSIs.

DeepSlide is a valuable resource for the computational pathology community, providing a comprehensive toolkit for developing and deploying deep learning models on WSIs. Whether for research or clinical applications, DeepSlide offers the features needed to advance the field of digital pathology.

Features

  • Whole-Slide Image Handling: DeepSlide simplifies the processing of WSIs, including patch extraction, tissue segmentation, and image preprocessing, which are essential for accurate deep learning analysis.
  • Model Training and Evaluation: The library provides tools to facilitate the training, validation, and testing of deep learning models on pathology image data, making it easier to achieve high-performance results.
  • Customizable Pipelines: DeepSlide allows users to create and customize pipelines tailored to specific research needs, offering flexibility in how data is processed and models are trained.
  • Integration with Popular Frameworks: DeepSlide is compatible with major deep learning frameworks, such as TensorFlow and PyTorch, enabling seamless integration into existing machine learning workflows.
  • Scalability: The library supports distributed computing, making it suitable for handling large datasets and complex analyses in a scalable manner.
  • Visualization Tools: DeepSlide includes visualization tools that help researchers interpret model outputs, such as heatmaps and overlay images, to better understand the results of their analysis.
GitHub - BMIRDS/deepslide: Code for the Nature Scientific Reports paper “Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks.” A sliding window framework for classification of high resolution whole-slide images, often microscopy or histopathology images.
Code for the Nature Scientific Reports paper "Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks." A sliding window f…

8. DeepLIIF

DeepLIIF is an open-source Python library developed by the Nadeem Lab, designed to enhance and streamline the analysis of immunofluorescence (IF) images in computational pathology.

It leverages deep learning techniques to automate the segmentation, quantification, and analysis of IF images, which are commonly used in biomedical research for visualizing specific proteins, cells, or tissues.

DeepLIIF simplifies the traditionally manual and time-consuming process of analyzing IF images, making it more efficient and reproducible. By providing a robust toolset for researchers, DeepLIIF aids in the accurate interpretation of complex biological data, contributing to advancements in fields like cancer research and cellular biology.

The library's ability to handle large datasets and produce consistent results makes it an essential tool for researchers working with immunofluorescence microscopy.

GitHub - nadeemlab/DeepLIIF: Deep Learning Inferred Multiplex ImmunoFluorescence for IHC Image Quantification (https://deepliif.org) [Nature Machine Intelligence′22, CVPR′22, MICCAI′23, Histopathology′23, MICCAI′24]
Deep Learning Inferred Multiplex ImmunoFluorescence for IHC Image Quantification (https://deepliif.org) [Nature Machine Intelligence'22, CVPR'22, MICCAI'23, Histopathology'23, MICCA…

9. SlideSeg (Python)

This is an open-source (MIT) Python module that produces image patches and annotation masks from whole slide images for deep learning in digital pathology.

GitHub - btcrabb/SlideSeg: A Python module that produces image patches and annotation masks from whole slide images for deep learning in digital pathology.
A Python module that produces image patches and annotation masks from whole slide images for deep learning in digital pathology. - GitHub - btcrabb/SlideSeg: A Python module that produces image p…

10. WSITools

Tools for whole slide image (WSI) processing. Especially for (pairwise) patch extraction, annotation parsing and data preparation for deep learning purposes.

GitHub - smujiang/WSITools: Tools for whole slide image (WSI) processing. Especially for (pairwise) patch extraction, annotation parsing and data preparation for deep learning purposes.
Tools for whole slide image (WSI) processing. Especially for (pairwise) patch extraction, annotation parsing and data preparation for deep learning purposes. - smujiang/WSITools

11. WholeSlideData

This is an open-source framework for working with whole-slide images (WSIs) in digital pathology. It includes tools and scripts for handling, processing, and analyzing WSIs, making it easier for researchers to manage large-scale pathology datasets.

The framework is designed for efficient data handling, supporting various formats and enabling streamlined workflows in computational pathology.

GitHub - DIAGNijmegen/pathology-whole-slide-data: A package for working with whole-slide data including a fast batch iterator that can be used to train deep learning models.
A package for working with whole-slide data including a fast batch iterator that can be used to train deep learning models. - DIAGNijmegen/pathology-whole-slide-data

12. A graph-transformer for whole slide image classification

The tmi2022 repository by vkola-lab contains the code and resources related to a study published in the IEEE Transactions on Medical Imaging in 2022. The project focuses on developing and evaluating deep learning models for medical image analysis. It provides the necessary scripts, models, and datasets used in the study, allowing for replication and further research in the field of medical imaging.

GitHub - vkola-lab/tmi2022: A graph-transformer for whole slide image classification
A graph-transformer for whole slide image classification - vkola-lab/tmi2022

13. HistomicsStream

The HistomicsStream project is an open-source app by DigitalSlideArchive provides tools for real-time streaming and annotation of whole-slide images (WSIs). It supports live image processing, allowing users to perform annotations and analyses on WSIs as they are being streamed.

This tool is useful for collaborative pathology workflows, enabling multiple users to interact with and analyze large pathology images simultaneously.

GitHub - DigitalSlideArchive/HistomicsStream: A whole-slide image reader for TensorFlow
A whole-slide image reader for TensorFlow. Contribute to DigitalSlideArchive/HistomicsStream development by creating an account on GitHub.

14. wsireg

wsireg performs multi-modal or mono-modal whole slide image registration in a graph structure for complex registration tasks using elastix. For detailed introduction and installation and usage instructions see the docs.

Features

  • Graph based approach to defining modalities and arbitrary transformation paths between associated images
  • Use of elastix (through ITKElastix) to perform registration
  • Support for linear and non-linear transformation models
  • Transform associated data (masks, shape data) along the same path as the images.
  • Supports images converted to OME-TIFF using bioformats2raw -> raw2ometiff pipeline as well as array_like images from memory (np.ndarrayzarr.Arrayda.core.Array from numpyzarr, and dask, respectively)
  • All registered images exported as pyramidal OME-TIFF or OME-zarr that can be viewed in software such as Vitessce,vizarrQuPathOMERO or any platform that supports these formats.
  • All transforms for complex registration paths are internally composited and only 1 interpolation step is performed, avoiding accumulation of interpolation error from many registrations
  • Shape data (polygons, point sets, etc.) in GeoJSON format (portable format for QuPath detection/annotation data since v0.3.0) can be imported and transformations applied producing a modified GeoJSON
  • Some support for reading native WSI formats: currently reads .czi and .scn but could be expanded to other formats supported by tifffile
GitHub - NHPatterson/wsireg: multimodal whole slide image registration in a graph structure
multimodal whole slide image registration in a graph structure - NHPatterson/wsireg

15. WSInfer (Python)

This is an open-source Python-based tool for inferring cell-level information from whole-slide images (WSIs) in digital pathology. It uses machine learning models to analyze WSIs and extract detailed information about cellular structures, aiding in the study and diagnosis of diseases.

The tool is designed to integrate seamlessly with existing workflows in computational pathology, offering a reliable solution for cell-level analysis from high-resolution pathology images.

WSInfer: blazingly fast inference on whole slide images — WSInfer documentation
GitHub - SBU-BMI/wsinfer: 🔥 🚀 Blazingly fast pipeline for patch-based classification in whole slide images
🔥 🚀 Blazingly fast pipeline for patch-based classification in whole slide images - SBU-BMI/wsinfer

16. Exact

Exact is a free and open source online platform for collaborative image labeling of almost everything.

GitHub - DeepMicroscopy/Exact: An open source online platform for collaborative image labeling of almost everything
An open source online platform for collaborative image labeling of almost everything - DeepMicroscopy/Exact

17. pyslide

The pyslide is a Python library designed for the analysis and processing of whole-slide images (WSIs). It provides tools for extracting, handling, and analyzing regions of interest within WSIs, making it useful for tasks in computational pathology and related research areas.

GitHub - PingjunChen/pyslide: Digital Pathology Whole Slide Image Analysis Toolbox
Digital Pathology Whole Slide Image Analysis Toolbox - PingjunChen/pyslide

18- wsic

Whole Slide image (WSI) conversion and compression tool for brightfield histology images.

Features

  • Reading and writing several container formats.
  • Support for a wide range of compression codecs.
  • Custom tile size
  • Lossless repackaging / transcoding (to zarr/NGFF or TIFF) from:
    • SVS (JPEG compressed)
    • OME-TIFF (single image, JPEG and JPEG2000 (J2K) compressed)
    • Generic Tiled TIFF (JPEG, JPEG2000, and WebP compressed)
    • DICOM WSI (JPEG and JPEG2000 (J2K) compressed)
GitHub - John-P/wsic: Whole Slide image (WSI) conversion for brightfield histology images
Whole Slide image (WSI) conversion for brightfield histology images - John-P/wsic








Read more

EHrapy: The Ultimate Open-Source Tool for Simplifying Healthcare Data and Medical Records Analysis

EHrapy: The Ultimate Open-Source Tool for Simplifying Healthcare Data and Medical Records Analysis

Healthcare researchers and data scientists often grapple with processing vast amounts of complex, sensitive Electronic Health Records (EHR) data. EHrapy, an open-source Python package developed by TheisLab, tackles these challenges head-on. It streamlines the handling, analysis, and exploration of healthcare data—especially EHR data—in a privacy-preserving and user-friendly manner.




Open-source Apps

9,500+

Medical Apps

500+

Lists

450+

Dev. Resources

900+

/