Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Application Research of Computers》 2001-09
Add to Favorite Get Latest Update

Research and Implementation of Text Categorization System Based on VSM

PANG Jian feng,BU Dong bo,BAI Shuo (Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100080,China)  
In recent years , information processing turns more and more important for us to get useful information . Text categorization, the automated assigning of natural language texts to predefined categories based on their contents, is a task of increasing importance. This paper gives a research to several key techniques about text categorization , including vector space model , feature extraction , machine learning . It also describes a text categorization model based on VSM, and gives the evaluations and results .
【CateGory Index】: TP393
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
【References】
Chinese Journal Full-text Database 10 Hits
1 HONG Ying(Computer Information Center,Beijing Institute of Fashion Technology,Beijing 100029,China);Research of Intelligent Personalized Information Retrieval System Based on the Improved VSM Algorithm[J];Journal of Beijing Institute of Clothing Technology(Natural Science Edition);2010-01
2 ZHAN Shou-yi, JING Xin(Department of Computer Science and Engineering,School of Information Science and Technology, Beijing Institude of Technology, Beijing 100081, China);Personal Information Filtering Technology with Time Factor[J];Journal of Beijing Institute of Technology;2005-09
3 He Yuanjiao1 Zhang Guoying2(1 School of Information Science & Technology,Beijing University of Chemical Technology,Beijing 1000291;2 Department of Automation,Beijing Institute of Petro-chemical Technology,Beijing 102617);Semantic Simple Vector Distance Classification Based on Ontology[J];Journal of Beijing Institute of Petro-Chemical Technology;2007-03
4 DAI Jin,HU Feng,WANG Guo-yin(Institute of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,P.R.China);Research and application of text classification based on incomplete information system[J];Journal of Chongqing University of Posts and Telecommunications(Natural Science);2006-03
5 MA Jian-bin~1 LI Ying~2 TENG Gui-fa~1 WANG Fang~1 ZHAO Yang~1 1.College of Information Science and Technology,Agricultural University of Hebei,Baoding 071001,China; 2.College of Science,Agricultural University of Hebei,Baoding 071001,China;The comparison studies on the algorithm of KNN and SVM for chinese text classification[J];Journal of Agricultural University of Hebei;2008-03
6 CHEN Zi-xing City College,Dongguan Univarsity of Technology,Dongguan 523106,China;Design and Lmplementation of Text Classificaton System Based on SVM[J];Journal of Dongguan University of Technology;2008-03
7 LI Yue~1,AN Jie~2,LI Xing~1(1.Department of Electronic Engineering,Tsinghua Univ.,Beijing 100084,China;2.Network Center,Tsinghua Univ.,Beijing 100084,China);Application of rank aggregation to campus network search engine[J];Journal of Dalian University of Technology;2005-S1
8 SHAO Le,YU Hong,Liu Xi-jing,QI Xiao-ji,LIANG Xiao-na(School of Information Engineering,Dalian Fisheries Univ.,Dalian 116023,China);Fishery text classification based on number of Nave Bayes[J];Journal of Dalian Fisheries University;2010-01
9 Zhao Junjie Sheng Jianfeng Tao Xinmin;A KNN Algorithm in Text Classification Based on Feature Weighting[J];Computer Study;2010-02
10 ;Research of Information Filtering Technology based on BP ANN[J];Computer Development & Applications;2007-06
China Proceedings of conference Full-text Database 10 Hits
1 Zhao Shuanzhu Chen Junjie Guo Xin College of Computer and Software Taiyuan University of Technology,Taiyuan,Shanxi,Chnia,030024;Research on the Frame Structure and Its Implement of a Special Field-based Content Information Mining System on Web[A];[C];2005
2 Zhang Juan, Wang Huifeng(East China University of Science and Technology, Shanghai, 200237);Application of Text Classification to Mass Financial Information Processing[A];[C];2005
3 NIU Qiang, WANG Zhi-xiao, CHEN Dai, XIA Shi-xiong (School of Computer Science & Technology, CUMT, Xuzhou 221008 China);Web Document Classification Based on SVM[A];[C];2006
4 DU Luyan, MIAO Zhengjiang(Beijing Jiaotong University, Institute of Information Sciences, Beijing, 100044);Chinese Text Classification System Based on Language Model[A];[C];2009
5 CHEN Qing-xuan,ZHENG De-quan,ZHEN Bo-wen,ZHAO Tie-jun,LI Sheng (MOE-MS Key Laboratory of Natural Language Processing and Speech,Harbin Institute of Technology,Harbin 150001,China);Text feature selection based on document frequency distribution for Chinese text classification[A];[C];2010
6 WANG Xiu-juan, GUO Jun, ZHENG Kang-feng (School of Information and Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876 China);Feature Selection Based on Mutual Information[A];[C];2006
7 Dan Wang~1 Hongliu Cai~1 Bin Wang~2 (1 Engineering department of information,Project institutes of armoured forces,Beijing 100072 2 The navy stays in professional representative's room of guided missile of area of Shenyang,Shenyang 110043);Digital Watermark Algorithm on the Basis of the Chaos Array[A];[C];2007
8 Shi Yanrong Sun Danning He Yongqiang Shandong Institute of Business and Technology,Yan Tai,264005;Research and Performance Analysis of A Content-Based Email Filtering System[A];[C];2007
9 Wang Jingzhong Zhang Lu Department of Information Engineering,North China University of Technology,Beijing 100041;Text Similar Computing Based on HNC Context Framework Model[A];[C];2008
10 Cheng xinrong Yang Rengang (College of Information and Electrical Engineering,China Agricultural University,Beijing 100083,China);Automatic classification of web texts using on search engine[A];[C];2007
【Co-citations】
Chinese Journal Full-text Database 10 Hits
1 SHI Lei et al(College of Information and Management Science,Henan Agricultural University,Zhengzhou,Henan 450002);Application of Ensemble Learning Technique in Agriculture[J];Journal of Anhui Agricultural Sciences;2008-26
2 SHI Lei et al (College of Information and Management Science,Henan Agricultural University,Zhengzhou,Henan 450002);Research on the Classification of Agricultural Data Based on Support Vector Machine[J];Journal of Anhui Agricultural Sciences;2009-05
3 SHI Lei et al(College of Information and Management Science,Henan Agricultural University,Zhengzhou,Henan 450002);Research on the Diagnosis of Soybean Diseases Based on Naive Bayes Algorithm[J];Journal of Anhui Agricultural Sciences;2009-11
4 LIU Xiao-zhi,HUANG Hou-kuan,SHANG Wen-qian(School of Computer and Information Technology,Beijing Jiaotong University, Beijing 100044,China);Feature Selection with Term Library[J];Journal of Beijing Jiaotong University;2006-02
5 SUN Jian, WANG Wei, ZHONG Yi xin (Information Engineering School, Beijing University of Posts and Telecommunications, Beijing 100876, China);Automatic Text Categorization Based on K-Nearest Neighbor[J];Journal of Beijing University of Posts and Telecommunications;2001-01
6 LI Ning,XU Hong(Dept.of Computers,CUIT,Chengdu 610225,China);Application of semantic smoothing based on categorization to language model[J];Journal of Chengdu University of Information Technology;2008-03
7 Xiong Xiaomei Liu Yonglang (Jiangxi BlueSky University,Nanchang 330098);Application of quadratic dimension reduction method based on LSA in classification of the chinese legal text[J];Electronic Measurement Technology;2007-10
8 Zheng De-quan Li Sheng Zhao Tie-jun Yu Hao (MOE-MS Key Laboratory of Natural Language Processing and Speech, Harbin Institute of Technology, Harbin 150001, China);Research on Automatic Text Classification Based on a Hybrid Language Model[J];Journal of Electronics & Information Technology;2007-03
9 SHI Lei,HU Xiao-hong,XI Lei(College of Information and Management Science,HeNan Agricultural University,Henan Zhengzhou 450002);Naive Bayes Classification Algorithm and its Application Research[J];CD Technology;2008-11
10 TANG Yi-fang,NIU Li,FU Sai-xiang,YAN Xiao-wei(The Key Laboratory of Intelligent Information Processing,Institute of Computing Technology,Beijing 100080,China; Department of Computer Science,Guangxi Normal University,Guilin 541004,China);AUTOMATED TEXT CLASSIFICATION[J];Journal of Guangxi Normal University(Natural Science);2001-04
China Proceedings of conference Full-text Database 10 Hits
1 ZHU Yan-hui, WANG Ping, ZHOU Yong-mei (Department of Computer Science and Technology, Hunan University of Technology, Zhuzhou 412008 China);An Automatic Chinese Web Information Retrieving System Based on Agent[A];[C];2006
2 Chenggen Shi and Jie Lu Faculty of Information Technology, University of Technology, Sydney Po Box 123, Broadway, NSW 2007, Australia;An Information Retrieval Model by Using Weighting Technology[A];[C];2003
3 Nuanwan Soonthornphisaj, Kanokwan Chaikulseriwat, Piyanan Tang-On Department of Computer Science,Faculty of Science, Kasetsart University Bangkok, Thailand;Anti-Spam Filtering: A Centroid-Based Classification Approach[A];[C];2002
4 SHI Hong-Bo;WANG Zhi-Hai;HUANG Hou-Kuan;Jing Li-Ping School of Computer and Information Technology, Northem Jiaotong University, Beiing, 100044;Text Classification Based on the TAN Model[A];[C];2002
5 Huang Ke;Ma Shaoping State Key Lab of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, China;Text Categorization Based On Concept Indexing and Principal Component Analysis[A];[C];2002
6 Son Doan and Susumu Horiguchi Graduate School of Information Science Japan Advance Institute of Science and Technology Asahidai 1-1, Tatsunokuchi, Ishikawa 923-1292, Japan Graduate School of Information Science Tohoku University, Aoba 09, Sendai, 980-8579, Japan;A COMPARATIVE STUDY OF ROCHIO AND NAIVE BAYES ALGORITHMS ON REUTERS DATASET IN TEXT CATEGORIZATION[A];[C];2005
7 Zhou Xuezhong Fang Qing Wu Zhaohui College of Computer Science,Zhejiang University,Hangzhou 310027;A Comparative Study on Text Representation and Classifiers in Chinese Text Categorization[A];[C];2003
8 Liu Gongshen Li Jianhua Li Shenghong (School of Information Security Engineering,Shanghai Jiantong University,Shanghai 200030);New Feature Selection and Weighting Methods Based on Category Information[A];[C];2004
9 Wuzheng Lv Xiaoli Jin Yaohong ( Linguistry Management Institute&Com.Ltd, Dazheng, Beijing, Beijing, 100081 ) ( Institute of Acoustics, CAS, Beijing, 100080);Discussion about Introducing HNC Domain into Text Categorization[A];[C];2005
10 Xinhao WANG, Dingsheng LUO, Xihong WU, Huisheng CHI National Laboratory on Machine Perception, School of Electronics Engineering & Computer Science, Peking University, No.5 Summer Palace Road, Handian District, Beijing, 100871;Improving Chinese Text Categorization by Outlier Learning[A];[C];2005
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved