Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Journal of Chinese Information Processing》 2004-04
Add to Favorite Get Latest Update

Rule-based Automatic Category Application on Text Category

LI Yu qin 1,SUN Li hua 2 (1.Beijing Information Technology Institute,Beijing 100101, China; 2.TRS Infromaton Technology Limited Company,Beijing 100101, China)  
The technique of text automatic category is to classify texts into one or more classes according to some strategy.This paper firstly reports three kinds of technique of text automatic category based on statistic ( k nearest neighbor ,support vector machine and nave bayes),and analyses their advantages and disadvantages.The weakness of statistic based automatic category is the category precision decrease while the character intersect within classes increase, especially in the case of multi layers classifying. In order to improve statistic based automatic category performance, rule based automatic category is used. we combine statistic based category with rule based classifying method , design and realize a system of mixing category lastly, which has and has had very good performance in category.
【CateGory Index】: TP391.1
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
【References】
Chinese Journal Full-text Database 10 Hits
1 Xu Chaojun;Designing an Educational Resources Sharing Platform Supported by Theme-based Retrieval Technology[J];Distance Education in China;2010-03
2 CAI Hua-li,LIU Lu,WANG Li(School of Economics and Management,Beihang University,Beijing 100191,China);Automated Multiple Hierarchical Classification of Web News of Unexpected Events[J];Journal of Beijing University of Technology;2011-06
3 WANG Bin,PAN Wen-feng (Institute of Computing Technology, Chinese Academy of Sciences,Beiji ng 100080,China);A Survey of Content-based Anti-spam Email Filtering[J];Journal of Chinese Information Processing;2005-05
4 ZOU Juan~1,ZHOU Jing-ye~1,DENG Cheng~1,GAO Nan-sha~2(1.Information Engineering College of Xiangtan University,Xiangtan,Hunan 411105,China;2.Software institute of Dongnan University,Nanjing,Jiangsu 210000,China);A New Method for Synonymous Processing in Feature Word Extraction of Text Categorization[J];Journal of Chinese Information Processing;2005-06
5 WANG Ji-ming,YANG Guo-lin(College of Information Engineering,Inner Mongolia University of Technology,Huhhot 010051,China);Research and Application of Web Mining in Building Website[J];Journal of Inner Mongolia Normal University(Natural Science Edition);2007-02
6 XU Guixian1,2,XIANG Chuncheng1,WENG Yu1,2,ZHAO Xiaobing1,2,YANG Guosheng1(1.College of Information Engineering,Minzu University of China,Beijing 100081,China;2.Minority Languages Branch,National Language Resource Monitoring & Research Center,Beijing 100081,China);Automatic Text Classification of Tibetan Web Pages Based on Column[J];Journal of Chinese Information Processing;2011-04
7 ;文本自动分类技术及其对图书馆学的影响[J];;2006-09
8 Tan Jinbo(Department of Educational Technology, Shandong Normal University, Jinan 250014, China);A Rule-Based Classification Approach of Web Pages Using Ontology[J];New Technology of Library and Information Service;2007-03
9 Shi Congying Xu Chaojun Yang Xiaojiang(Department of Educational Technology,Nanjing Normal University,Nanjing 210097,China);Preschool Integrated Education Resources Classification Based on Rule and Rocchio Classifier[J];New Technology of Library and Information Service;2009-Z1
10 Xia Yan He Lin Pan Yunlai Ouyang Chenchen(College of Information and Technology,Nanjing Agricultural University,Nanjing 210095,China);Research on Recognition of Sudden Events on Web Based on Combination of Rules and Statistical Method[J];New Technology of Library and Information Service;2010-10
China Proceedings of conference Full-text Database 1 Hits
1 Sun Lihua Xiao shibin Shi shuicai TRS Information Technology Limited Company, Beijing 100101;Rule Category Technology Based on Vector Space Model[A];[C];2005
【Citations】
Chinese Journal Full-text Database 6 Hits
1 YUE Xi-Cai; WU Xiao-Yu;ZHENG Chong-Xun; YE Da-Tian (Department of Electrical Engineering, Tsinghua University, Beijing 100084) (Institute of Biomedicine Engineering, Xi'an Jiaotong University, Xi'an 710049);A NEURAL NETWORK METHOD OF CLASSIFICATION FOR LARGE NUMBER OF CATALOGS[J];JOURNAL OF COMPUTER RESEARCH AND DEVELOPMENT;2000-03
2 ;Text Mining on the Internet[J];Computer Science;2000-04
3 Diao Qian Wang Yongcheng Zhang Huihui * He Ji Institute of Computer Technology, Shanghai Jiao Tong University Shanghai 200030 * Bao Zhaolong Library of Shanghai Jiao Tong University Shanghai 200030;Term Weighting and Classification Algorithms[J];JOURNAL OF CHINESE INFORMATION PROCESSING;2000-03
4 HUANG Xuan jing 1 WU Li de 1 Ishizaki Hiroyuki 2 XU Guo wei 2 (1.Dept.of Computer Science,Fudan University Shanghai 200433; 2.FRDC Beijing 100081);Language Independent Text Categorization[J];JOURNAL OF CHINESE INFORMATION PROCESSING;2000-06
5 LI Hui 1,2 SHI Zhong zhi 1 XU Zhuo qun 2 (1.Key Laboratory of Intelligent Information Processing, The Institute of Computing Technology Chinese Academy of Sciences Beijing 100080 2.Computer Science and Technology Department Peking Univer;Improving the Performance of the Text Classifier Based on Support Vector Machine Using the Common Sense in Text Domain[J];Journal of Chinese Information Processing;2002-02
6 SUN Xue-gang,CHEN Qun-xiu,MA Liang (State Key Laboratory of Intelligent Technology and System Dept. of Computer Science & Technology, Tsinghua University,Beijing 100084,China);Study on Topic-Based Web Clustering[J];Journal of Chinese Information Processing;2003-03
【Co-references】
Chinese Journal Full-text Database 10 Hits
1 CHENG Wei-hua,YOU Jin-yuan(School of Software Engineering,Shanghai Jiaotong University,Shanghai 200030,China);The design and implementation of content-based anti-spam E-mail system[J];Journal of Anhui University(Natural Sciences);2007-03
2 ZHANG Jie~1,ZHAN Xue-gang~2,FENG Jin-ping~1,CHEN Wen-liang~3 (1.MOR Teaching and Researching Department,Artillery College of PLA,Hefei 230031,China; 2.Network Center,Anshan University of Science and Technology,Anshan 114044,China; 3.Institute of Computer Software,Northeastern University,Shenyang 110004,China);Evaluation of classifiers for Chinese text categorization[J];Journal of Anshan University of Science and Technology;2005-Z1
3 LI Wen-bin, LIU Chun-nian, HUANG Jia-jin ( Multimedia and Intelligent Software Technology Lab, College of Computer Science, Beijing University of Technology, Beijing 100022, China );Junk E-mail Filtering Method Based on Data Mining[J];Journal of Beijing Polytechnic University;2003-02
4 GONG Han-ming,ZHOU Chang-sheng (Department of Computer Science & Automation,Beijing Institute of Machinery, Beijing 100085, China);Chinese word segmentation system research[J];Journal of Beijing Institute of Machinery;2004-03
5 SONG Li-zhe~1,NIU Zhen-dong~(2,3),SONG Han-tao~1,YU Zheng-tao~1,SHI Xue-lin~1 (1.Department of Computer Science and Engineering, School of Information Science and Technology, Beijing Institute of Technology, Beijing100081,China; 2.School of Computer Software, Beijing Institute of Technology, Beijing 100081, China; 3.Beijing National Library Digital Technology Corp Ltd, Beijing100083,China);Study on the User Profile of Personalized Service in Digital Library[J];Journal of Beijing Institute of Technology;2005-01
6 LI Guan-jun~1,CHEN Xue-song~2,XU Jian-suo~3(1.School of Management,Tianjin University,Tianjin300072,China;2.School of Management,University of Beijing Science and Technology,Beijing100083,China;3.Henan Electricity Power Corporation,Zhengzhou,Henan450015,China);Method and Application of Decreasing Text Feature Based on Pattern Aggregation[J];Journal of Beijing Institute of Technology;2005-12
7 ZHONG Yi-xin (Center of Intelligence Science and Technology Research, Beijing University of Posts and Telecommunications, Beijing 100876, China);Comprehensive Information Based Methodology for Natural Language Understanding[J];Journal of Beijing University of Posts and Telecommunications;2004-04
8 Zhang Junli Zhang Fan(Department of Information Management,HuaZhong Normal University,Wuhan,Hubei,430079);The Application of KNN-FCM Clustering Algorithm in Text Filtering of Chinese Search Engine[J];Library and Information;2007-04
9 LIU Ming-chuan,PENG Chang-sheng(Chongqing University of Posts and Telecommunications,Chongqing 400065,P.R.China);Research on mail filter algorithm based on bayes probability model[J];Journal of Chongqing University of Posts and Telecommunications;2005-05
10 LIN Hong fei,ZHAN Xue gang,YAO Tian shun (School of Information Science and Engineering, Northeastern University, Shenyang 110006,China);Features Navigation for Chinese Text Mining[J];JOURNAL OF NORTHEASTERN UNIVERSITY;2000-03
【Secondary References】
Chinese Journal Full-text Database 10 Hits
1 CHENG Wei-hua,YOU Jin-yuan(School of Software Engineering,Shanghai Jiaotong University,Shanghai 200030,China);The design and implementation of content-based anti-spam E-mail system[J];Journal of Anhui University(Natural Sciences);2007-03
2 TAI De-yi,XIE Fei,HU Xue-gang(School of Computer and Information,Hefei University of Technology,Hefei 230009,China);Text categorization based on position weight of feature term[J];Journal of Anhui Technical College of Water Resources and Hydroelectric Power;2008-01
3 HUANG Wen-liang1,2,LI Shi-jian1,LIU Ju-xin1,XU Cong-fu1(1.College of Computer Science,Zhejiang University,Hangzhou 310027,China;2.Zhejiang Branch of China Unicom Corporation Limited,Hangzhou 310006,China);A Large-Scale Online Spam Short Message Filtering System[J];Journal of Beijing University of Posts and Telecommunications;2008-03
4 DONG Zhen-xing1,2,LI Rong1,CHEN Long1(1.College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,P.R.China;2.School of Information Science and Technology,SouthWest JiaoTong University,Chengdu 610031,P.R.China);An email filtering method based on active learning and TCM-EKNN[J];Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition);2011-01
5 LIU Yang (College of Information Science and Engineering,Bohai University,Jinzhou 121013,China);Study on spam email treatment model based on Bayesian method[J];Journal of Changchun Institute of Technology(Natural Sciences Edition);2007-03
6 TIAN Lin(Department of Computer,Sichuan University,Chengdu 610065,China);An Active Model Spam Filtering Technology Based on SMTP Session Control[J];Journal of Chuxiong Normal University;2009-06
7 SUN Jing-tao1,2,ZHANG Qiu-yu1,YUAN Zhan-ting1,and DONG Jian-she1(1.College of Computer and Communication,Lanzhou University of Technology Lanzhou 730050; 2.Gansu Oil Products Company,China Petroleum & Chemical Corporation Lanzhou 730030);Application of Game Theory for Email Feature Selection[J];Journal of University of Electronic Science and Technology of China;2011-01
8 DU Yingguo,SUN Siliang(College of Mathematics and Computer Science,Dali University,Dali,Yunnan 671003,China);Personal Information Constructing Method and Sharing Research Based on Ontology[J];Journal of Dali University;2010-04
9 ;Methods of Feature Weighted Value Computing based on Text Representation[J];Computer Development & Applications;2008-02
10 ZHUANG Suo-fa;CHEN Xin-mei (College of Sciechces,Anhui Science and Technology University,Fengyang 233100,China);Resource of Preventing Junk Mails at Network Terminal[J];Computer Knowledge and Technology;2006-23
China Proceedings of conference Full-text Database 4 Hits
1 Huang Wenliang1,2 Li Shijian1 Liu Jiuxin1 Xu Congfu1 (1 College of Computer Science, Zhejiang University, Hangzhou, Zhejiang, 310027; 2 Zhejiang Branch of China Unicom Corporation Lid, Hangzhou, Zhejiang, 310006);The Designing and Realizing of Large-Scale Online Spam Message Filtering System[A];[C];2008
2 Sui Su Hongfei Lin Zheng Ye Department of Computer Science and Engineering,Dalian University of Technology,Dalian 116024;Character-based Language Modeling Approach for Spam Filtering[A];[C];2008
3 LI Jin~1 YUE Kun~2 HANG Fei-lu~1 (School of Software,Yunnan University,Kunming 650091,China)1 (School of Information Science and Engineering,Yunnan University,Kunming 650091,China)2;Method for Filtering Chinese Spam Based on the Adaptive Markov Model[A];[C];2008
4 Yan LI Department of Information Engineering Henan Technical College of Construction ZhengZhou City,China;Bagging eEP-based classifiers for junk mail classification[A];[C];2010
【Secondary Citations】
Chinese Journal Full-text Database 3 Hits
1 HAN Ke song WANG Yong cheng CHEN Gui lin (School of Electronics & Information,Shanghai Jiaotong University Shanghai 200030) E mail:HKS80916@MAIL1.SJTU.EDU.CN;Research on Fast High-frequency Strings Extracting and Statistics Algorithm with no Thesaurus[J];Journal of Chinese Information Processing;2001-02
2 Jun Wu, Zuoying Wang, Feng Yu, Xia Wang(Department of Electronic Engineering,Tsinghua UniversityBeijing 100084,P.R.China);Automatic Classification of Chinese Texts[J];JOURNAL OF CHINESE INFORMATION PROCESSING;1995-04
3 Zou TaoWang JichengHuang YuanZhang Fuyan Department of Computer Science and TechnologyNanjing UniversityNanjing210093Email:tzou@graphics.nju.edu.cn;The Design and Implementation of an Automatic Chinese DocumentsClassification System[J];JOURNAL OF CHINESE INFORMATION PROCESSING;1999-03
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved