直近一年間の累計
アクセス数 : ?
ダウンロード数 : ?
ID 113245
タイトル別表記
Emotion Recognition for Japanese Short Sentences Including Slangs
著者
キーワード
Youth Slang
Unknown Words
Bag of Concepts
Word Embedding
k-nearest neighbor algorithm
Maximum Entropy Method
Unsupervised Clustering
資料タイプ
学術雑誌論文
抄録
The growth of Internet communication sites such as weblogs and social networking sites brought younger people especially in teens and in their 20s to create new words and to use them very often. We prepared an emotion corpus by collecting weblog article texts including new words, analyzed the corpus statistically, and proposed a method to estimate emotions of the texts. Most slang words such as Youth Slang are too ambiguous in sense classification to be registered into the existing dictionaries such as thesaurus. To cope with these words, we created a large scale of Twitter corpus and calculated sense similarities between words. We proposed to convert unknown word to semantic class id so that we might be able to process the words that were not included in the learning data. For calculation similarities between words and converting the word into word cluster id, we used the word embedding algorithms such as word2vec, or GloVe. We defined this method as a method using Bag of Concepts as feature. As a result of the evaluation experiment using several classifiers, the proposed method was proved its robustness for unknown expressions.
掲載誌名
Current Analysis on Instrumentation and Control
出版者
Mesford Publisher
2019
2
開始ページ
9
終了ページ
18
発行日
2019-02-01
権利情報
This is an open access article licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (Attribution-NonCommercial 4.0 International CC-BY-NC 4.0)(https://creativecommons.org/licenses/by-nc/4.0/deed.ja)
© 2018 Mesford Publisher INC
EDB ID
フルテキストファイル
言語
eng
著者版フラグ
出版社版
部局
理工学系