Dataset RibCXR

logo-VinBigData-2020-02

VinDr-RibCXR: A Benchmark Dataset for Automatic Segmentation and Labeling of Individual Ribs on Chest X-rays

Dataset description

A wide range of diagnostic tasks can benefit from an automatic system that is able to segment and label individual ribs on chest X-ray (CXR) images. Recently, deep learning (DL) has shown superior performance to other methods in the segmentation and labeling of individual ribs [1]. However, developing DL algorithms for this task requires annotated images for each rib structure at pixel-level. To the best of our knowledge, there exists no such benchmark datasets and protocols. Hence, we introduce a new benchmark dataset, namely VinDr-RibCXR, for automatic segmentation and labeling of individual ribs from chest X-ray (CXR) scans. The VinDr-RibCXR contains 245 CXRs with corresponding ground truth annotations provided by human experts.

The raw images in DICOM format were sourced from VinDrCXR dataset [2], for which all scans have been de-identified to protect patient privacy. Each image was assigned to an expert, who manually segmented and annotated each of 20 ribs, denoted as L1→L10 (left ribs) and R1→R10 (right ribs). The masks of ribs (see Figure 1) were then stored in a JSON file that can later be used for training instance segmentation models.

To the best of our knowledge, the VinDr-RibCXR is the first publicly released dataset that includes segmentation annotations of the individual ribs, and for both anterior and posterior ribs. To develop and evaluate segmentation algorithms, we divided the whole dataset into a training set of 196 images and a validation set of 49 images.

Figure 1: Ground truth masks provided by the VinDr-SpineXR and segmentation results obtained from our deep learning model [2.]

Download Dataset

To download the VinDr-RibCXR dataset, please sign our Data Use Agreement (DUA) and send the signed DUA to Hieu Pham (v.hieuph4@vinbigdata.org) for obtaining the downloadable link.

Visualization

The image and annotation quality of the dataset can be via VinDr Laboratory – our hub for all public datasets. To access the data hub, users are required to complete a request access form.

Author List and Affiliations

Hoang C. Nguyen [1], Tung T. Le [1], Hieu H. Pham [1,2], Ha Q. Nguyen [1,2]

[1] Medical Imaging Center, Vingroup Big Data Institute, Hanoi, Vietnam

[2] College of Engineering & Computer Science, VinUniversity, Hanoi, Vietnam

*  Corresponding author: Hieu H. Pham (hieuph4@vinbigdata.org)

 

Citation

For any publication that explores this resource, the authors must cite this original paper as follows:

 

@article{nguyen2021ribcxr,
 title={VinDr-RibCXR: A Benchmark Dataset for Automatic Segmentation and Labeling of  
 Individual Ribs on Chest X-rays},
 author={Nguyen, Hoang C and Le, Tung T and Pham, Hieu H and Nguyen, Ha Q},
 year={2021}
}

Contact

We welcome any comments, suggestions or feedback you have for us that help improve the dataset, correspondence should be addressed to: Hieu H. Pham (hieuph4@vinbigdata.org).

References

[1] Joran Wessel, Mattias P Heinrich, Jens von Berg, Astrid Franz, and Axel Saalbach. Sequential rib labeling and segmentation in chest X-ray using Mask R-CNN. arXiv preprint arXiv:1908.08329, 2019.

[2] Ha Q Nguyen, Khanh Lam, Linh T Le, Hieu H Pham, Dat Q Tran, Dung B Nguyen, Dung D Le, Chi M Pham, Hang TT Tong, Diep H Dinh, et al. VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations. arXiv preprint arXiv:2012.15029, 2020.

en_USEN