Show simple item record

dc.contributor.authorWong, Kok-Seng
dc.contributor.authorKaur, Wandeep
dc.contributor.authorBalakrishnan, Vimala
dc.date.accessioned2024-06-28T13:21:31Z
dc.date.available2024-06-28T13:21:31Z
dc.date.issued2022
dc.identifier.urihttps://vinspace.edu.vn/handle/VIN/105
dc.description.abstractOver recent years, the emergence of electronic text processing systems has generated a vast amount of structured and unstructured data, thus creating a challenging situation for users to rummage through irrelevant information. Therefore, studies are continually looking to improve the classification process to produce more accurate results that would benefit users. This paper looks into the weighted information gain method that re-assigns wrongly classified features with new weights to provide better classification. The method focuses on the weights of the frequency bins, assuming every time a certain word frequency bin is iterated, it provides information on the target word feature. Therefore, the more iteration and re-assigning of weight occur within the bin, the more important the bin becomes, eventually providing better classification. The proposed algorithm was trained and tested using a corpus extracted from dedicated Facebook pages related to diabetes. The weighted information gain feature selection technique is then fed into a co-trained Multinomial Naïve Bayes classification algorithm that captures the labels' dependencies. The algorithm incorporates class value dependencies since the dataset used multi-label data before converting string vectors that allow the sparse distribution between features to be minimized, thus producing more accurate results. The results of this study show an improvement in classification to 61%.en_US
dc.subjecttext classificationen_US
dc.subjectmulti-labelen_US
dc.subjectfeature selectionen_US
dc.subjectweighted information gainen_US
dc.subjectmultinomial naïve bayesen_US
dc.titleImproving multi-label text classification using weighted information gain and co-trained multinomial naïve bayes classifieren_US
dc.typeArticleen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

  • Kok-Seng Wong, PhD [16]
    Associate Professor, Computer Science program, College of Engineering and Computer Science

Show simple item record


Vin University Library
Da Ton, Gia Lam
Vinhomes Oceanpark, Ha Noi, Viet Nam
Phone: +84-2471-089-779 | 1800-8189
Contact: library@vinuni.edu.vn