ID | 478 |
Title Transcription | センケイ ハンベツシキ ニヨル ブンタイ ノ ケイリョウ
|
Title Alternative | Stylometrics with Linear Discriminant
|
Author | |
Content Type |
Departmental Bulletin Paper
|
Description | This paper deals with subjects of authorship attribution through statistical analysis. Critics often assume that there are stylistic differences between authors, but it is obscure, how to discriminate the styles. In this paper, 12 works by two German writers, Theodor Storm and Adalbert Stifter, are analysed through one of the multivariate analysis methods, Linear Discriminant Analysis. The basis for analysis is a word list of 8 high frequent particles in the 12 works, which is made by Java and Perl programs. Based on the list, ten of the works are used to compute discriminants. The rest two works are set aside for validation estimate. And the difference between the styles are computed with Mahalanobis distance. Various equations can be composed according to combinations of selected variables, and they are compared based on statistical tests. The best equation is a non-linear one, but for simplicity, a second best equation, which is linear, is preferred. This equation, using two variables, manages to discriminate the two authors, and a validy of Discriminant Analysis for literary styles is confirmed.
|
Journal Title |
言語文化研究
|
ISSN | 13405632
|
NCID | AN10436724
|
Volume | 12
|
Start Page | 85
|
End Page | 113
|
Sort Key | 85
|
Published Date | 2005-02
|
Remark | 公開日:2010年1月24日で登録したコンテンツは、国立情報学研究所において電子化したものです。
|
EDB ID | |
FullText File | |
language |
jpn
|
departments |
Integrated Arts and Sciences
|