ID | 118256 |
著者 |
Shi, Xuefeng
Hefei University of Technology
呉, 雨濃
Chengdu Senton Netease Co., Ltd.
|
キーワード | Active learning
complementary sampling
class-biased multi-label classification
text emotion
|
資料タイプ |
学術雑誌論文
|
抄録 | High-quality corpora have been very scarce for the text emotion research. Existing corpora with multi-label emotion annotations have been either too small or too class-biased to properly support a supervised emotion learning. In this paper, we propose a novel active learning method for efficiently instructing the human annotations for a less-biased and high-quality multi-label emotion corpus. Specifically, to compensate annotation for the minority-class examples, we propose a complementary sampling strategy based on unlabeled resources by measuring a probabilistic distance between the expected emotion label distribution in a temporary corpus and an uniform distribution. Qualitative evaluations are also given to the unlabeled examples, in which we evaluate the model uncertainties for multi-label emotion predictions, their syntactic representativeness for the other unlabeled examples, and their diverseness to the labeled examples, for a high-quality sampling. Through active learning, a supervised emotion classifier gets progressively improved by learning from these new examples. Experiment results suggest that by following these sampling strategies we can develop a corpus of high-quality examples with significantly relieved bias for emotion classes. Compared to the learning procedures based on traditional active learning algorithms, our learning procedure indicates the most efficient learning curve and estimates the best multi-label emotion predictions.
|
掲載誌名 |
IEEE Transactions on Affective Computing
|
ISSN | 19493045
|
出版者 | IEEE
|
巻 | 14
|
号 | 1
|
開始ページ | 523
|
終了ページ | 536
|
発行日 | 2020-11-16
|
権利情報 | © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
|
EDB ID | |
出版社版DOI | |
出版社版URL | |
フルテキストファイル | |
言語 |
eng
|
著者版フラグ |
著者版
|
部局 |
理工学系
|