VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations
Date
2022Author
Nguyen, Ha Q.
Lam, Khanh
Le, Linh T.
Pham, Hieu H.
Tran, Dat Q.
Nguyen, Dung B.
Le, Dung D.
Pham, Chi M.
Tong, Hang T. T.
Dinh, Diep H.
Do, Cuong D.
Doan, Luu T.
Nguyen, Cuong N.
Nguyen, Binh T.
Nguyen, Que V.
Hoang, Au D.
Phan, Hien N.
Nguyen, Anh T.
Ho, Phuong H.
Ngo, Dat T.
Nguyen, Nghia T.
Nguyen, Nhan T.
Dao, Minh
Vu, Van
Metadata
Show full item recordAbstract
Most of the existing chest X-ray datasets include labels from a list of findings without specifying their locations on the radiographs. This limits the development of machine learning algorithms for the detection and localization of chest abnormalities. In this work, we describe a dataset of more than 100,000 chest X-ray scans that were retrospectively collected from two major hospitals in Vietnam. Out of this raw data, we release 18,000 images that were manually annotated by a total of 17 experienced radiologists with 22 local labels of rectangles surrounding abnormalities and 6 global labels of suspected diseases. The released dataset is divided into a training set of 15,000 and a test set of 3,000. Each scan in the training set was independently labeled by 3 radiologists, while each scan in the test set was labeled by the consensus of 5 radiologists. We designed and built a labeling platform for DICOM images to facilitate these annotation procedures. All images are made publicly available in DICOM format along with the labels of both the training set and the test set.
Collections
- Pham Huy Hieu, PhD. [27]
Related items
Showing items related by title, author, creator and subject.
-
Awareness and preparedness of healthcare workers against the first wave of the COVID-19 pandemic: A cross-sectional survey across 57 countries
Nguyen, Tien Huy; Chico, R. Matthew; Vuong, Thanh Huan; Shaikhkhalil, Hosam Waleed; Vuong, Ngoc Thao Uyen; Qarawi, Ahmad Taysir Atieh; Alhady, Shamael Thabit Mohammed; Nguyen, Lam Vuong; Le, Van Truong; Luu, Mai Ngoc; Dumre, Shyam Prakash; Imoto, Atsuko; Lee, Peter N.; Dao, Ngoc Hien Tam; Ng, Sze Jia; Hashan, Mohammad Rashidul; Matsui, Mitsuaki; Nguyen, Tran Minh Duc; Karimzadeh, Sedighe; Koonrungsesomboon, Nut; Smith, Chris; Cox, Sharon; Moji, Kazuhiko; Hirayama, Kenji; Abbas, Kirellos Said; Le, Khac Linh; Tran, Nu Thuy Dung; AL-Ahdal, Tareq Mohammed Ali; Balogun, Emmanuel Oluwadare; Nguyen, The Duy; Eltaras, Mennatullah Mohamed; Huynh, Trang; Nguyen, Thi Linh Hue; Bui, Diem Khue; Gad, Abdelrahman; Tawfik, Gehad Mohamed; Kubota, Kazumi; Nguyen, Hoang Minh; Pavlenko, Dmytro; Le; Vu, Thi Thu Trang; Le, Thuong Vu; Tran, Hai Yen; Nguyen, Thi Yen Xuan; Luong, Thi Trang; Vinh, Dong; Sharma, Akash; Vu, Quoc Dat; Soliman, Mohammed; Abdul Aziz, Jeza; Shah, Jaffer; Pham, Dinh Long Hung; Jee, Yap Siang; Dang, Thuy Ha Phuong; Tran, Thuy Huong Quynh; Hoang, Thi Nam Giang; Vy, Thi Nhat Huynh; Nguyen, Anh Thi; Truc, Phan; Nguyen, Hai Nam; Dhouibi, Nacir; Duru, Vincent; Ghozy, Sherief (2021-12-22)Since the COVID-19 pandemic began, there have been concerns related to the preparedness of healthcare workers (HCWs). This study aimed to describe the level of awareness and preparedness of hospital HCWs at the time of the ... -
The Correlation between Peripheral Blood Index and Immune Cell Expansion in Vietnamese Elderly Lung Cancer Patients
Nguyen, Hoang-Phuong; Bui, Viet Anh; Hoang, Ai-Xuan Thi; Nguyen, Phong Van; Nguyen, Dac-Tu; Mai, Hien Thi; Le, Hai-Anh; Nguyen, Thanh-Luan; Hoang, Nhung Thi My; Nguyen, Liem Thanh; Nguyen, Xuan-Hung (2023-02-21)(1) Background: The dysfunction and reduced proliferation of peripheral CD8+ T cells and natural killer (NK) cells have been observed in both aging and cancer patients, thereby challenging the adoption of immune cell therapy ... -
FedDRL: Deep reinforcement learning-based adaptive aggregation for non-IID data in federated learning
Pham, Huy Hieu; Nguyen, Nang Hung; Nguyen, Duc Long; Nguyen, Thuy Dung; Nguyen, Truong Thao; Nguyen, Thanh Hung; Nguyen, Phi Le (2022-08-04)The uneven distribution of local data across different edge devices (clients) results in slow model training and accuracy reduction in federated learning. Naive federated learning (FL) strategy and most alternative solutions ...