Lung and Colon Cancer Histopathological Image Dataset (LC25000)

Bibliographic Details
Title: Lung and Colon Cancer Histopathological Image Dataset (LC25000)
Authors: Borkowski, Andrew A., Bui, Marilyn M., Thomas, L. Brannon, Wilson, Catherine P., DeLand, Lauren A., Mastorides, Stephen M.
Publication Year: 2019
Collection: Computer Science
Quantitative Biology
Subject Terms: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Quantitative Biology - Quantitative Methods
More Details: The field of Machine Learning, a subset of Artificial Intelligence, has led to remarkable advancements in many areas, including medicine. Machine Learning algorithms require large datasets to train computer models successfully. Although there are medical image datasets available, more image datasets are needed from a variety of medical entities, especially cancer pathology. Even more scarce are ML-ready image datasets. To address this need, we created an image dataset (LC25000) with 25,000 color images in 5 classes. Each class contains 5,000 images of the following histologic entities: colon adenocarcinoma, benign colonic tissue, lung adenocarcinoma, lung squamous cell carcinoma, and benign lung tissue. All images are de-identified, HIPAA compliant, validated, and freely available for download to AI researchers.
Comment: 2 pages
Document Type: Working Paper
Access URL: http://arxiv.org/abs/1912.12142
Accession Number: edsarx.1912.12142
Database: arXiv
More Details
Description not available.