Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Journal of Jinan University(Natural Science & Medicine Edition)》 2009-01
Add to Favorite Get Latest Update

A rare-class classification approach based on Clustering and Ripper

YU Wen1,JIANG Sheng-yi1,HUANG Xing-quan2(1.College of Information,Guangdong University of Foreign Studies,Guangzhou 510006,China;2.Guangdong Lancoo Co,Limited,Guangzhou 510540,China)  
The rare-class classification is an important issue in many real life applications;this paper considers the rare-class datasets are easily ignored in the classification because of its low proportion of the whole datasets.We apply a rare-class classification approach based on clustering and Ripper.This approach is trying to find out the rare-class datasets after Cluster through recognizing every cluster whose proportion of the whole datasets is lower than 15% as the rare-class datasets.After that,Ripper algorithm is used to classify both the rare-class datasets and the normal-class datasets separately.The rule set of the whole datasets will be created by the certain method of this approach according to the model which has already been set up above.The experiments carried on benchmark datasets from the UCI Machine Learning Repository show that this approach creates high quality classifying.This approach can also be implemented to classify the rare-class datasets in some practical life applications.
【Fund】: 国家自然科学基金项目(60673191);; 广东省高等学校自然科学研究重点项目(06Z012);; 广东外语外贸大学科研创新团队项目(GW2006-TA-005)
【CateGory Index】: TP18
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
Chinese Journal Full-text Database 1 Hits
1 ZHI Wei-mei,FAN Ming(Department of Computer Science,Zhengzhou University,Zhengzhou 450052,China);Classification of Rare Classes by Essential Emerging Patterns in Two Phase[J];Microcomputer Development;2005-12
Chinese Journal Full-text Database 10 Hits
1 WANG Dong-xia1,ZHANG Nan2,LU Xiao-li(1.Department of Computer Science,Jiyuan Vocational and Technical College,Jiyuan 454650,China;2.Department of Mathematics,Northwest University,Xi'an 710127,China;3.School of Public Management,Northwest University,Xi'an 710127,China);Parameters selection of SVM based on breeding algorithm[J];Journal of Anhui University(Natural Sciences);2009-04
2 ZHOU Rui~(1,2),ZHU Zu-lin~1 (1.Ahui Radio & TV Universitiy,Hefei 230022; 2.School of Computer and Information,Hefei University of Technology,Hefei 230009,China);Studying on the Application of Decision Tree for Forecasting the Loss of Distance Learners Based on the Results Database[J];Journal of Anqing Teachers College(Natural Science Edition);2009-02
3 ji Yuejiang LvJia (Wuxi Professional College of Science and Technology School of Software and Service Outsourcing Wuxi 214028);Study on the Customer Segmentation Based on Cluster Analysis[J];Office Informatization;2009-08
4 MA Meng1,2,NIU Junqing1,NING Yan1,ZHENG Haoran1,WANG Xufa1 1 Department of Computer Science and Technology,University of Science and Technology of China,Hefei 230027;2 School of Computer Science and Technology,Anhui University,Hefei 230032;Applying clustering and association rule mining for analyzing gene expression data[J];Beijing Biomedical Engineering;2008-04
5 YU Hong,SHEN Qiang(Institute of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,P.R.China);A hierarchical search results clustering method based on K-Means[J];Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition);2010-03
6 JI Sheng-li,LI Bo(School of Electronic Information and Automation,Chongqing Institute of Technology,Chongqing 400050,China);Chinese Text Categorization Algorithm Based on SVM[J];Journal of Chongqing Institute of Technology(Natural Science);2008-07
7 WANG Yuan-ming,XIONG Wei (School of Mathematical Sciences,Anhui University,Hefei 230039,China);Outlier Detection Methods in Data Analysis[J];Journal of Chongqing Institute of Technology(Natural Science);2009-02
8 LI Chun-sheng,DAI Chun-ping(Daqing Petroleum Institute,Daqing 163318) RAN Kun(Daqing Oilfield Co.Ltd,Daqing 163511);Design of Three-layer Data-mining Model Based on Multi-Agent[J];;2007-03
9 SUN Da-chen(Mudanjiang University,Mudanjiang 157011);Improvement of the Embedding Delay Time and Its Window Based on C-C Method[J];Journal of Yangtze University(Natural Science Edition);2011-02
10 ZHANG Yang,CHEN Pei-you(College of Economic and Management,Heilongjiang Institute of Science and Technology,Harbin 150027,China);Application of Decision Tree Classification Algorithm in Estimating Loan Customer Credit based on Rough Sets[J];Science Technology and Industry;2008-01
China Proceedings of conference Full-text Database 4 Hits
1 Zhang Naiyue~1,Zhang Li~2,Zhang Xueyan~3 1(Department of Financial Information Engineering,Peking University,Beijing 100871,China) 2(Department of Computer Science,Peking University,Beijing 100871,China) 3(Department of Software Technology,Peking University,Beijing 100871,China);Algorithms and Applications of Data Mining Based on Match Fields from CRM[A];[C];2008
2 WEI Zheng-gang ZHOU Xiang (College of Information Science & Engineering,Ocean University of China,Qingdao 266100,China);Improved Algorithm Based on ID3[A];[C];2010
3 Cao Hui Si Gangquan Zhang Yanbin Jia Lixin (School of Electrical Engineering,Xi'an Jiaotong University,Xi'an 710049,China);Clustering-based fuzzy control algorithm for ball mill pulverizing system[A];[C];2007
4 Cao Hui Si Gangquan Zhang Yanbin Jia Lixin (School of Electrical Engineering,Xian Jiaotong University 710049,China);Ball mill pulverizing system optimization algorithm based on fuzzy time series data mining[A];[C];2007
Chinese Journal Full-text Database 10 Hits
1 CHENG Wei-hua,YOU Jin-yuan(School of Software Engineering,Shanghai Jiaotong University,Shanghai 200030,China);The design and implementation of content-based anti-spam E-mail system[J];Journal of Anhui University(Natural Sciences);2007-03
2 ZHANG Qiu-yu1,SUN Jing-tao1,YAN Xiao-wen2,HUANG Wen-han3 (1. School of Computer and Communication, Lanzhou University of Technology Lanzhou 730050; 2. Shaanxi Xiyu Highway Corporation Ltd Hancheng Shaanxi 715400; 3. Department of Computer Science and Technology, Shaanxi University of Technology Hanzhong Shaanxi 723003);Research of Spam Filtering System Based on Latent Semantic Analysis and MD5[J];Journal of University of Electronic Science and Technology of China;2007-06
3 Wang Yuan1,Wang Tiantian2(1.Network Center,Northwest Institute for Nonferrous Metal Research,Xi'an 710016,China;2.School of Computer Science and Technology,Xidian University,Xi'an 710071,China);Application of Improved Decision Tree Algorithm[J];Electronic Science and Technology;2010-09
4 SHEN Jian-ping1,WANG Xuan1,YU Cheng-long1,LI Xin-xin1(Computer Application Research Center,Shenzhen Graduate School,Harbin Institute of Technology,Shenzhen Guangdong 518055,China);Bayesian-Boosting Sentiment Classification Algorithm Based on Semantic[J];Journal of Guangxi Normal University(Natural Science Edition);2010-01
5 MIAO Ning,OU Lei(Anhui Radio Station,Hefei 230065,China);The Hazards and Countermeasure of Spam[J];Computer Knowledge and Technology;2010-16
6 DENG Chun-yan1,3,TAO Duo-xiu2,LV Yue-jin3 1.Department of Computer and Information Science,Hechi University,Yizhou,Guangxi 546300,China 2.College of Electrical Engineering,Guangxi University,Nanning 530004,China 3.College of Mathematics and Information Science,Guangxi University,Nanning 530004,China;Application of rough set and decision tree in e-mail classification and filtering[J];Computer Engineering and Applications;2009-16
7 SUN Ming-song,GAO Qing-guo,WANG Xuan-dan College of Computer Science & Technology,Harbin University of Science and Technology,Harbin 150080,China;Mail filtering by dual membership fuzzy support vector machine[J];Computer Engineering and Applications;2010-02
8 DENG Wei-bin1 2 HONG Zhi-yong2 1.E-commerce and Modern Logistics Laboratory Chongqing University of Posts and Telecommunications Chongqing 400065 China 2.College of Information Science and Technology Southwest Jiaotong University Chengdu Sichuan 610031 China;Double-stage spam filtering method based on rough set[J];Journal of Computer Applications;2010-08
9 XIA Chao,XU De-hua(College of Economics and Management,Tongji University,Shanghai 200092,China);An Improved Bayesian Mail Filtering Algorithm[J];Computer and Modernization;2010-10
10 HUANG Quan1,2, YANG Sheng2,CHEN Zhi-ping1 (Hunan University1,Changsha 410082,P.R.China; Computer Department,Hunan Institure of Hunanities,Science and Technology2 Loudi,417000,P.R.China);Design of E-mail Filtering System Based on Bayes Network[J];Science Technology and Engineering;2008-13
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved