AUC Maximization for Low-Resource Named Entity Recognition

Nguyen, Ngoc Dang; Tan, Wei; Du, Lan; Buntine, Wray; Beare, Richard; Chen, Changyou

dc.contributor.author	Nguyen, Ngoc Dang
dc.contributor.author	Tan, Wei
dc.contributor.author	Du, Lan
dc.contributor.author	Buntine, Wray
dc.contributor.author	Beare, Richard
dc.contributor.author	Chen, Changyou
dc.date.accessioned	2024-08-22T03:46:46Z
dc.date.available	2024-08-22T03:46:46Z
dc.date.issued	2023-04-13
dc.identifier.uri	https://vinspace.edu.vn/handle/VIN/221
dc.description.abstract	Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize the underlying NER model. Both of these traditional objective functions for the NER problem generally produce adequate performance when the data distribution is balanced and there are sufficient annotated training examples. But since NER is inherently an imbalanced tagging problem, the model performance under the low-resource settings could suffer using these standard objective functions. Based on recent advances in area under the ROC curve (AUC) maximization, we propose to optimize the NER model by maximizing the AUC score. We give evidence that by simply combining two binary-classifiers that maximize the AUC score, significant performance improvement over traditional loss functions is achieved under low-resource NER settings. We also conduct extensive experiments to demonstrate the advantages of our method under the low-resource and highly-imbalanced data distribution settings. To the best of our knowledge, this is the first work that brings AUC maximization to the NER setting. Furthermore, we show that our method is agnostic to different types of NER embeddings, models, and domains. The code of this work is available at https://github.com/dngu0061/NER-AUC-2T.	en_US
dc.language.iso	en	en_US
dc.title	AUC Maximization for Low-Resource Named Entity Recognition	en_US
dc.type	Article	en_US

Files in this item

Name:: AUC Maximization for Low-Resource ...
Size:: 1.228Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Wray Buntine, PhD. [13]
College of Engineering and Computer Science Director, Computer Science program

Show simple item record