Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Journal of Changsha University》 2013-05
Add to Favorite Get Latest Update

Application of Improved Gini Index in the Text Classification

TANG Wei;LIU Fengnian;CHEN Chongbang;OU Xinliang;WANG Su;College of Computer and Communication,Hunan University of Technology;Department of Computer Science and Technology,Changsha University;  
In this paper,TF-IDF algorithm is used in text preprocessing,and the Gini coefficient measure function of the traditional Gini coefficient method is improved according to the purity principle of Gini coefficient so as to reduce dimensions of the original text feature space.Through comparing the experimental data,it is indicated that the improvement is feasible and effective,which is reflected by the facts that the complexity of time and space is small and the precision is high.
【Fund】: 湖南省自然科学基金(批准号:11JJ3002)资助项目;; 湖南省教育厅科技重点项目(批准号:09A010)
【CateGory Index】: TP391.1
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
【Citations】
Chinese Journal Full-text Database 2 Hits
1 LU Yu-Chang, LU Ming-Yu, LI Fan, and ZHOU Li-Zhu (Department of Computer Science and Technology, Tsinghua University, Beijing 100084);ANALYSIS AND CONSTRUCTION OF WORD WEIGHING FUNCTION IN VSM[J];Journal of Computer Research and Development;2002-10
2 OU Xing-liang~ 1,2 , CHEN Song-qao~1, Fang Kui~2 ~1 (Information Science & Engineering College Sentral South University, Changsha 410083, China) ~2 (Department of ComputerScience and Technology, Changsha University, Changsha 410003, China);Shape Analysis and Borderline Computing of a Free-Form Surface Based on Gauss Map[J];Journal of Chinese Computer Systems;2006-04
【Co-citations】
Chinese Journal Full-text Database 10 Hits
1 LI Wen-bin, LIU Chun-nian, CHEN Yi-ying (Beijing Municipal Key Laboratory of Multimedia and Intelligent Software Technology, College of Computer Science and Technology, Beijing University of Technology, Beijing 100022, China; School of Information Engineer, Shijiazhuang University of Economics, Shijizahuang 050031, China);Classifying Text Corpus Based on Information Gain Weight of Feature[J];Journal of Beijing University of Technology;2006-05
2 LI Yu-jian CAO Wei-ping ZHOU Lan-zhen (College of Computer Science and Technology Beijing University of Technology,Beijing 100022,China);Structured Vector Space Model and Its Application to Web Information Retrieval System[J];Journal of Beijing University of Technology;2008-04
3 GU Yi-jun~1,FAN Xiao-zhong~1,WANG Jian-hua~1,WANG Tao~1,HUANG Wei-jin~2(1.Department of Computer Science and Engineering, School of Information Science and Technology, Beijing Institute of Technology, Beijing100081, China; 2.Department of Information Security Science and Technology, China Security University, Beijing100038, China);Automatic Selection of Chinese Stoplist[J];Journal of Beijing Institute of Technology;2005-04
4 QU Yun1,YANG Peng2,ZHANG Wen-jing2 (1.Computer Experimental Teaching Center,Academic of Affairs Office,Agricultural Universiy of Hebei,Baoding 071001,China;2.College of Information Science & Technology,Agricultural University of Hebei,Baoding 071001,China);Topic similarity information retrieval based on information granularity[J];Journal of Agricultural University of Hebei;2011-01
5 BAI Fengfeng (Department of Computer Science,High College of Shanxi Lvliang,Lishi 033000);Unbalanced Data Sets Based on the Text Classification Technology Research[J];Computer Programming Skills & Maintenance;2010-06
6 ZHANG Hong(Southwest Guizhou Vocational and Technical College,Xingyi 562400,China);The Charactor Search Engine Based on Semantic[J];Computer Knowledge and Technology;2009-08
7 WANG Cheng-qiang(Computer Science and Information Engineering,Guizhou University,Guiyang 550025,China);Unbalanced Data Set Based on the Text Classification Techniques[J];Computer Knowledge and Technology;2009-36
8 CHEN Yan-long,ZHANG Zhi-ming(Department of Information Engineering,Zhengzhou College of Animal Husbandry Engineering,Zhengzhou 450011,China);The English Text Difficulty Measurement Based Vector Space Model[J];Computer Knowledge and Technology;2010-12
9 SU Li-hua1,ZHU Zhang-hua2,BAI Wen-hua1 (1.Xi'an Communications Institute,Xi'an 710106,China;2.Chongqing Communications Institute,Chongqing 400035,China);Term Weighting Algorithm in Text Categorization Based on VSM[J];Computer Knowledge and Technology;2010-33
10 Xiong Xiaomei Liu Yonglang (Jiangxi BlueSky University,Nanchang 330098);Application of quadratic dimension reduction method based on LSA in classification of the chinese legal text[J];Electronic Measurement Technology;2007-10
China Proceedings of conference Full-text Database 4 Hits
1 Zhang Aihua~1,Jing Hongfang~1,Wang Bin~1,Xu Yan~2 1 Institute of Computing Technology,Chinese Academy of Sciences,Beijing,100190 2 Beijing Language and Culture University,Beijing,100083;Research on Effects of Term Weighting Factors for Text Categorization[A];[C];2009
2 WANG Zhen~(1,2),Winira Musajan~(1,2),ZHAO Li-hong~(1,2) 1.College of Information Science & Engineering,Xinjiang University,P.R.China,830046 2.Xinjiang Laboratory of Multi-language Information Technology,P.R.China,830046;The Research of Automatic Classification in Uyghur Kazak Kirgiz Multiliteral Search Engine[A];[C];2010
3 ZHAO Yan-ping, LI Chao (School of Management and Economy,Beijing Institute of Technology,Beijing 100081,China);Feature Selection and Patent Analysis Research in Web Security Information Mining[A];[C];2004
4 Wang Huifang~1,Zhang Yong~2,Xing Chunxiao~2,Zhang Wenke~2,and Yang Jijiang~2 1(Department of Computer Science and Technology,Tsinghua University,Beijing 100084) 2(Research Institute of In formation Technology,Tsinghua University,Beijing 100084);Automatic Text Abstract Algorithm Integration and System Implementation[A];[C];2008
【Secondary Citations】
Chinese Journal Full-text Database 3 Hits
1 HU Jingsong,ZHANG Lifen,WANG Xiaohua,SONG Weijia,LONG Bin(Department of Computer Science and Engineering,Beijing Institute of Technology,Beijing 100081);New Algorithm for Simple Polygon and Point[J];Computer Engineering;2004-20
2 GUO Zhen Wang 1) JIANG Da Wei 2) 1) (Department of Mathematics, Air Force Radar Academy, Wuhan 430010) 2) (Department of Applied Mathematics, Northwestern Polytechnic University, Xi′an 710072);On Convexity of Developable Surface Patches[J];Journal of Computer Aided Design & Computer Graphics;2001-01
3 LI Fan, LU Mingyu, LU Yuchang(State Key Laboratory of Intelligent Technology and System Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China);Research about new methods of text feature extraction[J];Journal of Tsinghua University(Science and Technology);2001-07
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved