Yipei Song1, Francesco Cisternino5, Joost Mekke2, Gert Jan de Borst2, Dominique P.V. de Kleijn2, Gerard Pasterkamp3, Aryan Vink4, Craig Glastonbury5, Sander W. van der Laan3*, Clint L. Miller1*. * Authors contributed equally.
1) Center for Public Health Genomics, Department of Public Health Sciences, Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908, USA. 2) Department of Vascular Surgery, Division Surgical Specialties, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands. 3) Central Diagnostics Laboratory, Division Laboratories, Pharmacy, and Biomedical Genetics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands. 4) Department of Pathology, Division Laboratories, Pharmacy, and Biomedical Genetics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands. 5) Human Technopole, Viale Rita Levi-Montalcini, 1, 20157, Milano, Italy.
EntropyMasker
is a fully automated approach for separating foreground (tissue) and background in bright-field microscopic whole-slide images of (immuno)histologically stained samples. This method is unaffected by changes in scanning or image processing conditions, by using a measure of local entropy and generating corresponding binary tissue masks.
Background: Tissue segmentation of histology whole-slide images remains a critical challenge in digital pathology for both disease diagnosis and phenotyping for research purposes. When the tissue structure of a specimen is relatively porous and heterogeneous, such as for atherosclerotic plaques, precise tissue segmentation is essential for a correct interpretation.
Method: In this study, we developed a unique approach called EntropyMasker
based on statistical analysis and binary morphology to tackle the issue of fore- and background segmentation (masking) in histopathological whole-slide images (WSIs), which has yet to be solved.
Results and conclusion: We tested our system on large extracts from 97 high-resolution WSIs of hematoxylin and eosin and 8 other staining types on atherosclerotic plaques from the Athero-Express Biobank Study each with different staining quality and acquired under different imaging settings. We compared our method with four other widely used methods and found that our method had the highest sensitivity and Jaccard similarity index. We envision a positive impact of EntropyMasker
to WSI-preprocessing in image analyses for disease phenotyping beyond the field of atherosclerosis.
EntropyMasker
is dependent on python3
. You can run it locally or through a high-performance compute cluster (HPC).
To run EntropyMasker
run the following code:
python EntropyMaster/entropyMasker.py --input_img --output_img
Flag | Required/Optional | Description |
---|---|---|
-i/--input_img |
required | complete path to the whole-slide image to mask |
-o/--output_img |
required | complete path to the masked output-image |
-h/--help |
optional | to get a usage-description |
Description of commands.
File | Description | Usage |
---|---|---|
README.md | Description of project | Human editable |
EntropyMasker.Rproj | R project file to generate this README.md | Loads R project; reads only |
LICENSE | User permissions | Read only |
EntropyMaster | The EntropyMasker program | Human editable |
AEDB.EM.baseline.Rmd | R Markdown notebook to produce baseline table and plots | Human editable |
ae_baseline | Baseline for the 97 samples used | Human editable |
ae_output | Output folder for the R Markdown notebook | Human editable |
ae_plots | Plotss folder for the R Markdown notebook | Human editable |
images | Images used in this README.md | Human editable |
notebook | Jupyter notebook for this project | Human editable |
scripts | QuPath scripts used for this project | Human editable |
Do you have burning questions or do you want to discuss usage with other users? Do you want to report an issue? Or do you have an idea for improvement or adding new features to our method and tool? Please use the Issues tab.
Using our EntropyMasker
method? Please cite our work:
An automatic entropy method to efficiently mask histology whole-slide images
Yipei Song, Francesco Cisternino, Joost Mekke, Gert Jan de Borst, Dominique P.V. de Kleijn, Gerard Pasterkamp, Aryan Vink, Craig Glastonbury, Sander W. van der Laan, Clint L. Miller.
medRxiv 2022.09.01.22279487; doi: https://doi.org/10.1101/2022.09.01.22279487
The whole-slide images used in this project are available through a DataverseNL repository. There are restrictions on use by commercial parties, and on sharing openly based on (inter)national laws, regulations and the written informed consent. Therefore these data (and additional clinical data) are only available upon discussion and signing a Data Sharing Agreement (see Terms of Access) and within a specially designed UMC Utrecht provided environment.
We are thankful for the support of the Netherlands CardioVascular Research Initiative of the Netherlands Heart Foundation (CVON 2011/B019 and CVON 2017-20: Generating the best evidence-based pharmaceutical targets for atherosclerosis [GENIUS I&II]), the ERA-CVD program 'druggable-MI-targets' (grant number: 01KL1802), and the Leducq Fondation 'PlaqOmics'.
Funding for this research was provided by National Institutes of Health (NIH) grant nos. R00HL125912 and R01HL14823 (to Clint L. Miller), and a Leducq Foundation Transatlantic Network of Excellence ('PlaqOmics') grant no. 18CVD02 (to Dr. Clint L. Miller and Dr. Sander W. van der Laan), and EU H2020 TO_AITION grant no. 848146 (to Dr. Sander W. van der Laan).
Dr. Sander W. van der Laan has received Roche funding for unrelated work.
Dr Craig A. Glastonbury has stock options in BenevolentAI and is a paid consultant for BenevolentAI, unrelated to this work.
Plaque samples are derived from arterial endarterectomies as part of the Athero-Express Biobank Study which is an ongoing study in the UMC Utrecht. We would like to thank all the (former) employees involved in the Athero-Express Biobank Study of the Departments of Surgery of the St. Antonius Hospital Nieuwegein and University Medical Center Utrecht for their continuing work. In particular we would like to thank (in no particular order) Marijke Linschoten, Arjan Samani, Petra H. Homoed-van der Kraak, Tim Bezemer, Tim van de Kerkhof, Joyce Vrijenhoek, Evelyn Velema, Ben van Middelaar, Sander Reukema, Robin Reijers, Joëlle van Bennekom, and Bas Nelissen. Lastly, we would like to thank all participants of the Athero-Express Biobank Study; without you these studies would not be possible.
Version: v1.0.1</br>
Last update: 2022-09-05</br>
Written by: Yipei Song; Craig Glastonbury; Sander W. van der Laan; Clint L. Miller.
* v1.0.1 Made public. Added citation to preprint. Fix in the readme.
* v1.0.0 Initial version.
Copyright (c) 2022. Yipei Song | Craig Glastonbury | Sander W. van der Laan | Clint L. Miller.
This is a human-readable summary of (and not a substitute for) the license. You are free to: Share, copy and redistribute the material in any medium or format. The licencor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution --- You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- NonCommercial --- You may not use the material for commercial purposes.
- NoDerivatives --- If you remix, transform, or build upon the material, you may not distribute the modified material.
- No additional restrictions --- You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices: You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation. No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.