ID | 113245 |
タイトル別表記 | Emotion Recognition for Japanese Short Sentences Including Slangs
|
著者 | |
キーワード | Youth Slang
Unknown Words
Bag of Concepts
Word Embedding
k-nearest neighbor algorithm
Maximum Entropy Method
Unsupervised Clustering
|
資料タイプ |
学術雑誌論文
|
抄録 | The growth of Internet communication sites such as weblogs and social networking sites brought younger people especially in teens and in their 20s to create new words and to use them very often. We prepared an emotion corpus by collecting weblog article texts including new words, analyzed the corpus statistically, and proposed a method to estimate emotions of the texts. Most slang words such as Youth Slang are too ambiguous in sense classification to be registered into the existing dictionaries such as thesaurus. To cope with these words, we created a large scale of Twitter corpus and calculated sense similarities between words. We proposed to convert unknown word to semantic class id so that we might be able to process the words that were not included in the learning data. For calculation similarities between words and converting the word into word cluster id, we used the word embedding algorithms such as word2vec, or GloVe. We defined this method as a method using Bag of Concepts as feature. As a result of the evaluation experiment using several classifiers, the proposed method was proved its robustness for unknown expressions.
|
掲載誌名 |
Current Analysis on Instrumentation and Control
|
出版者 | Mesford Publisher
|
巻 | 2019
|
号 | 2
|
開始ページ | 9
|
終了ページ | 18
|
発行日 | 2019-02-01
|
権利情報 | This is an open access article licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (Attribution-NonCommercial 4.0 International CC-BY-NC 4.0)(https://creativecommons.org/licenses/by-nc/4.0/deed.ja)
© 2018 Mesford Publisher INC |
EDB ID | |
フルテキストファイル | |
言語 |
eng
|
著者版フラグ |
出版社版
|
部局 |
理工学系
|