CGMVQA : A New Classification and Generative Model for Medical Visual Question Answering

Ren, Fuji; Zhou, Yangyang

doi:10.1109/ACCESS.2020.2980024

直近一年間の累計

アクセス数 : ? 件

ダウンロード数 : ? 件

この文献の参照には次のURLをご利用ください : https://repo.lib.tokushima-u.ac.jp/115141

ID	115141
タイトル別表記	New Classification and Generative Model for Medical Visual Question Answering
著者	任, 福継 University of Tokushima 徳島大学教育研究者総覧 KAKEN研究者をさがす Zhou, Yangyang University of Tokushima
キーワード	Classification model generative model medical image transformer visual question answering
資料タイプ	学術雑誌論文
抄録	Medical images are playing an important role in the medical domain. A mature medical visual question answering system can aid diagnosis, but there is no satisfactory method to solve this comprehensive problem so far. Considering that there are many different types of questions, we propose a model called CGMVQA, including classification and answer generation capabilities to turn this complex problem into multiple simple problems in this paper. We adopt data augmentation on images and tokenization on texts. We use pre-trained ResNet152 to extract image features and add three kinds of embeddings together to deal with texts. We reduce the parameters of the multi-head self-attention transformer to cut the computational cost down. We adjust the masking and output layers to change the functions of the model. This model establishes new state-of-the-art results: 0.640 of classification accuracy, 0.659 of word matching and 0.678 of semantic similarity in ImageCLEF 2019 VQA-Med data set. It suggests that the CGMVQA is effective in medical visual question answering and can better assist doctors in clinical analysis and diagnosis.
掲載誌名	IEEE Access
ISSN	21693536
出版者	IEEE
巻	8
開始ページ	50626
終了ページ	50636
発行日	2020-03-11
権利情報	This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
EDB ID	364179
出版社版DOI	10.1109/ACCESS.2020.2980024
出版社版URL	https://doi.org/10.1109/ACCESS.2020.2980024
フルテキストファイル	access_8_50626.pdf 2.16 MB
言語	eng
著者版フラグ	出版社版
部局	理工学系