%0 Journal Article %A ZHANG Yanping1 %A 2 %A LIU Chao1 %A 2 %A QU Yonghua3 %T Text categorization model based on WCBVSM and SACA %D 2012 %R %J Computer Engineering and Applications %P 137-142 %V 48 %N 11 %X A new text categorization model based on the method which combines WCBVSM with SACA is proposed. The traditional methods of vector space model adopt the key words as the document semantic carrier. These traditional methods ignore the semantic information between the words of text. According to the word co-occurrence model, the Word Co-Occurrence Model Based VSM(WCBVSM) is presented. The model counts the word co-occurrence information of the texts, and adds this information into VSM. Therefore, it is easy to get the semantic information. In addition, because of the conflict between validity and extensibility in cross covering algorithm, this paper presents a Cross Cover Algorithm based on Simulated Annealing algorithm(SACA). This algorithm improves the situation that the selection of cross cover’s center is random, and reduces the number of cover by increasing the sample number in each cover. It enhances the extensibility of the cover classification. The test results show that the proposed model accelerates the speed of recognition and improves the classification accuracy. %U http://cea.ceaj.org/EN/abstract/article_28437.shtml