VinDr-RibCXR: A Benchmark Dataset for Automatic Segmentation and Labeling of Individual Ribs on Chest X-rays

Dataset Description

A wide range of diagnostic tasks can benefit from an automatic system that is able to segment and label individual ribs on chest X-ray (CXR) images. Recently, deep learning (DL) has shown superior performance to other methods in the segmentation and labeling of individual ribs [1]. However, developing DL algorithms for this task requires annotated images for each rib structure at pixel-level. To the best of our knowledge, there exists no such benchmark datasets and protocols. Hence, we introduce a new benchmark dataset, namely VinDr-RibCXR, for automatic segmentation and labeling of individual ribs from chest X-ray (CXR) scans. The VinDr-RibCXR contains 245 CXRs with corresponding ground truth annotations provided by human experts.

The raw images in DICOM format were sourced from VinDrCXR dataset [2], for which all scans have been de-identified to protect patient privacy. Each image was assigned to an expert, who manually segmented and annotated each of 20 ribs, denoted as L1→L10 (left ribs) and R1→R10 (right ribs). The masks of ribs (see Figure 1) were then stored in a JSON file that can later be used for training instance segmentation models.

To the best of our knowledge, the VinDr-RibCXR is the first publicly released dataset that includes segmentation annotations of the individual ribs, and for both anterior and posterior ribs. To develop and evaluate segmentation algorithms, we divided the whole dataset into a training set of 196 images and a validation set of 49 images.

Figure 1: Ground truth masks provided by the VinDr-SpineXR and segmentation results obtained from our deep learning model [2.]

Download

To download the VinDr-RibCXR dataset, please sign our Data Use Agreement (DUA) and send the signed DUA to Ha Nguyen (v.hanq3@vinbigdata.com) for obtaining the downloadable link.

Visualization

The images and annotations of the dataset can be visualized via VinDr Laboratory – our hub for all public datasets.

Citation

For any publication that explores this resource, the authors must cite this original paper:

Hoang C. Nguyen, Tung T. Le, Hieu H. Pham, and Ha Q. Nguyen, “VinDr-RibCXR: A Benchmark Dataset for Automatic Segmentation and Labeling of Individual Ribs on Chest X-rays,” in Proceedings of the 2021 International Conference on Medical Imaging with Deep Learning (MIDL 2021)

Contact

Correspondence should be addressed to: Ha Nguyen (v.hanq3@vinbigdata.com).

References

[1] Joran Wessel, Mattias P Heinrich, Jens von Berg, Astrid Franz, and Axel Saalbach. Sequential rib labeling and segmentation in chest X-ray using Mask R-CNN. arXiv preprint arXiv:1908.08329, 2019.

[2] Ha Q Nguyen, Khanh Lam, Linh T Le, Hieu H Pham, Dat Q Tran, Dung B Nguyen, Dung D Le, Chi M Pham, Hang TT Tong, Diep H Dinh, et al. VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations. arXiv preprint arXiv:2012.15029, 2020.

Ribs Segmentation