AUTHOR(S):
|
TITLE Automatic Labeling to Classify News Articles Based on Paragraph Vector |
ABSTRACT Getting useful information from the Internet plays an important role. A news site is one of the Internet services often used for obtaining information on the Internet. The news site has advantages such that information update is fast and there are abundant kinds of information, and in recent years there are sites that collaborate with multiple newspaper companies and post bulk content. However, as there are a lot of articles, there are problems that it is difficult to find the articles we would like to read. Therefore, how to classify and present articles is an important issue. In this study, we consider the category classification of documents using a distributed representation of sentences. Specifically, we propose a method to classify articles by extracting words with similar meanings from sentence vectors of each category and assigning them as labels. |
KEYWORDS Distributed representation, paragraph vector, neural network, automatic labeling, text classification, category classification |
REFERENCES [1] Fujitsu Laboratory, http://www.fujitsu.com/jp/group/fri/report/ cyber/research/4/title07.html |
Cite this paper Taishi Saito, Osamu Uchida. (2018) Automatic Labeling to Classify News Articles Based on Paragraph Vector. International Journal of Computers, 3, 27-32 |
|