Paper Abstract and Keywords |
Presentation |
2022-09-10 14:00
An Analysis of Stopwords in Document Classification Tasks with BERT Yuki Kuwabara, Yu Suzuki (Gifu Univ.) DE2022-14 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
When researchers classify documents, they sometimes use stopwords to improve accuracy without checking their effectiveness. We tested the effectiveness of stopwords in improving accuracy in document classification tasks with BERT. We classified documents using different stopwords. We compared the accuracy of the document classification tasks. We did not see a significant improvement in accuracy by removing stopwords. We found that the stopwords we used were not effective for document classification tasks with BERT. We would like to find stopwords that are effective in improving accuracy in document classification tasks with BERT. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
stopwords / document classification / BERT / / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 122, no. 176, DE2022-14, pp. 41-46, Sept. 2022. |
Paper # |
DE2022-14 |
Date of Issue |
2022-09-02 (DE) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
DE2022-14 |
Conference Information |
Committee |
DE IPSJ-DBS IPSJ-IFAT |
Conference Date |
2022-09-09 - 2022-09-10 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Toyama Prefectural Hall |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Bigdata management, information retrieval, knowledge discovery, etc. |
Paper Information |
Registration To |
DE |
Conference Code |
2022-09-DE-DBS-IFAT |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
An Analysis of Stopwords in Document Classification Tasks with BERT |
Sub Title (in English) |
|
Keyword(1) |
stopwords |
Keyword(2) |
document classification |
Keyword(3) |
BERT |
Keyword(4) |
|
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Yuki Kuwabara |
1st Author's Affiliation |
Gifu University (Gifu Univ.) |
2nd Author's Name |
Yu Suzuki |
2nd Author's Affiliation |
Gifu University (Gifu Univ.) |
3rd Author's Name |
|
3rd Author's Affiliation |
() |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2022-09-10 14:00:00 |
Presentation Time |
25 minutes |
Registration for |
DE |
Paper # |
DE2022-14 |
Volume (vol) |
vol.122 |
Number (no) |
no.176 |
Page |
pp.41-46 |
#Pages |
6 |
Date of Issue |
2022-09-02 (DE) |
|