Nursing-care text classification using additional term information from Web.

Conference PaperinIEEE International Conference on Fuzzy Systems · January 2011with2 Reads
DOI: 10.1109/FUZZY.2011.6007540 · Source: DBLP
Conference: FUZZ-IEEE 2011, IEEE International Conference on Fuzzy Systems, Taipei, Taiwan, 27-30 June, 2011, Proceedings


    In this paper, for improving performance of the nursing-care text classification, we introduce a mechanism of retrieving terms from Web. Every year, the nursing-care texts are collected by using Web application to improve nursing-care quality in Japan. The collected nursing-care texts are decomposed into morphemes (i.e., terms), and then terms are stored as a term list. Each text is represented as a feature vector by using the term list and classified using a SVM based classification system. The training data sets for constructing SVM based classification system are different from the evaluation data sets. That is, there are differences between the term lists of the nursing-care texts because the nursing-care texts are collected and evaluated every year. To cover this difference, we introduce a mechanism of retrieving terms from Web. A new term which appeared in the evaluation data sets is used as a query of a search engine. The terms in the term list are also used as queries. Terms are represented by the search results, and then are compared with each other. We use the most similar term in the term list as an alternative of the new term. From experimental results, we show effectiveness of our proposed method.