Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Journal of Chinese Information Processing》 2005-05
Add to Favorite Get Latest Update

A Survey of Content-based Anti-spam Email Filtering

WANG Bin,PAN Wen-feng (Institute of Computing Technology, Chinese Academy of Sciences,Beiji ng 100080,China)  
The volume of junk emails on the Internet has grown tremendously in th e past few years and is causing serious problems. Content-based filtering is on e of the mainstream technologies used so far. This paper aims to provide an overv iew on the state of art in this research field, including benchmark corpora, eva luation methods and filtering approaches. Many filtering approaches, including R ipper, Decision Trees, Rough Sets, Rocchio, Boosting, Bayes, kNN, SVM and Winnow , are discussed and compared in this paper. The experimental results show that s ome approaches, such as Boosting, Flexible Bayes, SVM, Winnow, can achieve very good results on research corpora. However, much more work should be done for pra ctical use.
【Fund】: 国家973项目资助(2004CB318109)
【CateGory Index】: TP393.098
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
【References】
Chinese Journal Full-text Database 10 Hits
1 CHENG Wei-hua,YOU Jin-yuan(School of Software Engineering,Shanghai Jiaotong University,Shanghai 200030,China);The design and implementation of content-based anti-spam E-mail system[J];Journal of Anhui University(Natural Sciences);2007-03
2 HUANG Wen-liang1,2,LI Shi-jian1,LIU Ju-xin1,XU Cong-fu1(1.College of Computer Science,Zhejiang University,Hangzhou 310027,China;2.Zhejiang Branch of China Unicom Corporation Limited,Hangzhou 310006,China);A Large-Scale Online Spam Short Message Filtering System[J];Journal of Beijing University of Posts and Telecommunications;2008-03
3 DONG Zhen-xing1,2,LI Rong1,CHEN Long1(1.College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,P.R.China;2.School of Information Science and Technology,SouthWest JiaoTong University,Chengdu 610031,P.R.China);An email filtering method based on active learning and TCM-EKNN[J];Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition);2011-01
4 DENG Wen-tao,WANG Guo-yin,DONG Zhen-xing(Institute of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,P.R.China);A personalized E-mail classification method based on improved KNN[J];Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition);2011-06
5 LIU Yang (College of Information Science and Engineering,Bohai University,Jinzhou 121013,China);Study on spam email treatment model based on Bayesian method[J];Journal of Changchun Institute of Technology(Natural Sciences Edition);2007-03
6 TIAN Lin(Department of Computer,Sichuan University,Chengdu 610065,China);An Active Model Spam Filtering Technology Based on SMTP Session Control[J];Journal of Chuxiong Normal University;2009-06
7 SUN Jing-tao1,2,ZHANG Qiu-yu1,YUAN Zhan-ting1,and DONG Jian-she1(1.College of Computer and Communication,Lanzhou University of Technology Lanzhou 730050; 2.Gansu Oil Products Company,China Petroleum & Chemical Corporation Lanzhou 730030);Application of Game Theory for Email Feature Selection[J];Journal of University of Electronic Science and Technology of China;2011-01
8 ZHUANG Suo-fa;CHEN Xin-mei (College of Sciechces,Anhui Science and Technology University,Fengyang 233100,China);Resource of Preventing Junk Mails at Network Terminal[J];Computer Knowledge and Technology;2006-23
9 XU Wei(College of Computer Science and Technology of Suzhou University in Suzhou, Suzhou 215006,China);Design of Spam E-mail Filter Gateway[J];Computer Knowledge and Technology;2006-35
10 ZHANG Yi,KONG Ying,ZHU Xiang(Zhejiang University of Science & Technology,Hangzhou 310023,China);Design and Realization of Spam Filtering Model Based on Neural Model[J];Computer Knowledge and Technology;2010-12
China Proceedings of conference Full-text Database 4 Hits
1 Qi WANG Information Technology Center,China Mobile Group Liaoning Company Limited,ShenYang,China;A Spam Recognition Scheme Based On Bayesian Decision Tree Algorithm[A];[C];2011
2 Huang Wenliang1,2 Li Shijian1 Liu Jiuxin1 Xu Congfu1 (1 College of Computer Science, Zhejiang University, Hangzhou, Zhejiang, 310027; 2 Zhejiang Branch of China Unicom Corporation Lid, Hangzhou, Zhejiang, 310006);The Designing and Realizing of Large-Scale Online Spam Message Filtering System[A];[C];2008
3 Sui Su Hongfei Lin Zheng Ye Department of Computer Science and Engineering,Dalian University of Technology,Dalian 116024;Character-based Language Modeling Approach for Spam Filtering[A];[C];2008
4 LI Jin~1 YUE Kun~2 HANG Fei-lu~1 (School of Software,Yunnan University,Kunming 650091,China)1 (School of Information Science and Engineering,Yunnan University,Kunming 650091,China)2;Method for Filtering Chinese Spam Based on the Adaptive Markov Model[A];[C];2008
【Citations】
Chinese Journal Full-text Database 3 Hits
1 Zhao Xiaoming Zheng Shaoren (Institute of Communications Engineering, Science and Technology University of P.L.A, Nanjing 210016, China);Analysis and Design of Electronic Mail Filter System[J];Journal of Southeast Univwrsity(Natural Science Edition);2001-05
2 LIU Bin 1 HUANG Tie jun 2 CHENG Jun 3 GAO Wen 1 (1 Institute of Computing Technology Chinese Academy of Sciences Beijing 100080 2 Grduate School of Chinese Academy of Sciences Beijing 100080 3 The Library of Chinese Academy of Sciences Beijing 100080 China);A New Statistical-based Method in Automatic Text Classification[J];Journal of Chinese Information Processing;2002-06
3 LI Yu qin 1,SUN Li hua 2 (1.Beijing Information Technology Institute,Beijing 100101, China; 2.TRS Infromaton Technology Limited Company,Beijing 100101, China);Rule-based Automatic Category Application on Text Category[J];Journal of Chinese Information Processing;2004-04
【Co-citations】
Chinese Journal Full-text Database 10 Hits
1 HU Xi-heng (Anshan Normal University,Anshan Liaoning 114007,China);Research and Design of Spam Filtering System Model[J];Journal of Anshan Normal University;2009-02
2 CAI Hua-li,LIU Lu,WANG Li(School of Economics and Management,Beihang University,Beijing 100191,China);Automated Multiple Hierarchical Classification of Web News of Unexpected Events[J];Journal of Beijing University of Technology;2011-06
3 ZHAN Xu1,WANG Yue-xiu2,XIE Qian-he3(1.Dept.of Electronic and Information Engineering,Sichuan University of Science & Engineering,Zigong 643000,China;2.Shiqi Nam Road area of telecommunications on the 8th floor in Building 16th,Zhongshan 528400,China;3.Chengdu University of Information Technology,Chengdu 610225,China);Research of email virus and filtering system[J];Journal of Chengdu University of Information Technology;2009-01
4 Xu Chaojun;Designing an Educational Resources Sharing Platform Supported by Theme-based Retrieval Technology[J];Distance Education in China;2010-03
5 ;Chinese Text Categorization based on Mixed Features[J];Computer Development & Applications;2005-04
6 SONG Dong-feng,ZHANG Zhi-hao(Department of Computer Science and Technology,Tongji University,Shanghai 200092,China);Short-Text Categorization[J];Computer and Information Technology;2007-01
7 XIE Lei-tao1,HOU Song-li2 (1.Department of Information Technology of China Construction Bank Henan Branch,Zhengzhou 450053,China;2.Henan University of Computer Center,Kaifeng 475000,China);The Technologies and Measures of Anti-spam[J];Computer Knowledge and Technology;2006-17
8 DAI Shao-feng,GUO Wen-ming (Network Center,Southern Medical University,Guangzhou 510515,China);Study on Key Technologies of Email Filtering System Based on Qmail[J];Computer Knowledge and Technology;2006-23
9 XU Wei(College of Computer Science and Technology of Suzhou University in Suzhou, Suzhou 215006,China);Design of Spam E-mail Filter Gateway[J];Computer Knowledge and Technology;2006-35
10 LI Xing-peng, QING Chang-you (Suzhou Polytechnic Instituete of Agriculture, Suzhou 215008, China);The Research & Design on Comprehensive Filtering System of Spam Emails[J];Computer Knowledge and Technology(Academic Exchange);2007-19
China Proceedings of conference Full-text Database 10 Hits
1 Liu Zihao and Zhuang Yi (College of Information Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210016);An Email Sensitive Information Detection Algorithm[A];[C];2009
2 WANG Zhen-qi,LIU Jing Center of Information and Network Management, North China Electric Power University,Baoding 071003,China.;A model of anti - spam E - mail filter based on address and content[A];[C];2005
3 Hailei ZHANG Huizhen WANG Anhui WANG Jingbo ZHU (Nature Language Processing Lab,Insist Institute of Computer Software&Theory,Northeastern University,Shenyang 110004);A Comparison of Anti-Spam Filtering based on Nave Bayesian Models[A];[C];2007
4 Zhao Lin Xia Yingjv Huang Xuanjing Wu Lide Department of Computer Science and Engineering, Fudan University;Text Filtering Based on Winnow Algorithm[A];[C];2003
5 Lu Jiao-li Zheng Jia-heng (Institute of computer and information technology,Taiyuan,030006);The Research Of Text Categorization Based On Rough Set[A];[C];2004
6 Wenfeng Pan~(1,2) Bin Wang~1 Manquan Yu(1,2) Songbo Tan(1,2) (1.Software Division,ICT,CAS,Bcijing 100080;2.Graduate School,CAS,Beijing 100039);Research on Spam Filtering Using Winnow[A];[C];2004
7 Sun Lihua Xiao shibin Shi shuicai TRS Information Technology Limited Company, Beijing 100101;Rule Category Technology Based on Vector Space Model[A];[C];2005
8 Zhao Jiyuan, Li Hanjing, Zhao tiejun (Department of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001);Research and Analysis on Trajectory Recognition in Chinese Spatial Expression[A];[C];2006
9 Wuying Liu Ting Wang (School of Computer,National University of Defense Technology,Changsha 410073);An Ensemble Learning Method of Multi-filter for Spam Filtering[A];[C];2007
10 SunXiongyong LuoXiao Tongfang Knowledge Network Technology(Beijing)Co.,Ltd.Beijing 100084;Text Automatic Classification Based on Chinese Library Classification[A];[C];2008
【Co-references】
Chinese Journal Full-text Database 10 Hits
1 WANG Wei,YANG Zhi-hua (College of Media & Technology,Nanjing University of Posts and Telecommunications,Nanjing 210003,China);A Summarization of the Domestic Network Culture Research Hotspot in Recent Years[J];Journal of Anhui Electrical Engineering Professional Technique College;2008-02
2 CHENG Wei-hua,YOU Jin-yuan(School of Software Engineering,Shanghai Jiaotong University,Shanghai 200030,China);The design and implementation of content-based anti-spam E-mail system[J];Journal of Anhui University(Natural Sciences);2007-03
3 ZHAO Guo-qing (Dept.of Mana.Engn.,Anhui Institut e of Mechanical and Electrical Engin eering,Wuhu 241000,China);Customer classification in custome r relationship management[J];Journal of Anhui Institute of Mechanical and Electrical Engineering;2001-04
4 GAO Liang-cheng1,2,HOU Zen-fen1(1.School of Computer and Information,Hefei University of Technology,Hefei 230009,China;2.Department of Information Engneering,Tongling Vocational & Technical College,Tongling 244000,china);Spam Filtering System Based on the Client Side[J];Journal of Anhui Institute of Architecture & Industry(Natural Science);2008-04
5 ZHAO Shun~1,CHI Cheng-ying~2(1.School of Computer Science and Engineering,Anshan University of Science and Technology,Anshan 114044,China;2.School of Higher Vocational Technology,Anshan University of Science and Technology,Anshan 114044,China);Research on text categorization based on LSI and Rough sets[J];Journal of Anshan University of Science and Technology;2005-05
6 HU Xi-heng (Anshan Normal University,Anshan Liaoning 114007,China);Research and Design of Spam Filtering System Model[J];Journal of Anshan Normal University;2009-02
7 LIU Yang;Pay attention to the overrun note emerging undercurrent[J];Telecommunication Construction;2003-03
8 WU Xu,XU De(School of Computer and Information Technology, Northern Jiaotong University, Beijing 100044,China);Research and Implementation of Automatic Text Categorization System Based on VSM[J];Journal of Northern Jiaotong University;2003-02
9 WANG Ting-hua,TIAN Sheng-feng,HUANG Hou-kuan,LIAO Nian-dong(School of Computer and Information Technology,Beijing Jiaotong University, Beijing 100044,China);Support Vector Machine Based on Weightiness of Sample Attribute[J];Journal of Beijing Jiaotong University;2007-05
10 Li Gang,Lei Ai,Zhang Xianxian(Information Center,No.58 Research Institute of China Ordnance Industries,Mianyang 621000,China);Secrecy Network Information Security[J];Ordnance Industry Automation;2011-01
China Proceedings of conference Full-text Database 1 Hits
1 SHI Han-xiao (College of Computer and Information Engineering, Zhejiang Gongshang University, Hangzhou 310035, China.;Research on Application Mode of SM-based BIS for Non-large-scale Enterprises[A];[C];2006
【Secondary References】
Chinese Journal Full-text Database 10 Hits
1 DONG Zhen-xing1,2,LI Rong1,CHEN Long1(1.College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,P.R.China;2.School of Information Science and Technology,SouthWest JiaoTong University,Chengdu 610031,P.R.China);An email filtering method based on active learning and TCM-EKNN[J];Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition);2011-01
2 LIU Yang (College of Information Science and Engineering,Bohai University,Jinzhou 121013,China);Study on spam email treatment model based on Bayesian method[J];Journal of Changchun Institute of Technology(Natural Sciences Edition);2007-03
3 ZHANG Yue-xu(Department of Mathematics and Computer,Chaoyang Teacher's College,Chaoyang 122000,China);A New Spam Filtering System[J];Journal of Eastern Liaoning University(Natural Science);2009-02
4 SUN Jing-tao1,2,ZHANG Qiu-yu1,YUAN Zhan-ting1,and DONG Jian-she1(1.College of Computer and Communication,Lanzhou University of Technology Lanzhou 730050; 2.Gansu Oil Products Company,China Petroleum & Chemical Corporation Lanzhou 730030);Application of Game Theory for Email Feature Selection[J];Journal of University of Electronic Science and Technology of China;2011-01
5 HUANG Quan1,2 (1.Computer Department,Hunan Institute of Humanities,Science and Technology,Loudi 417000,China; 2.Hunan University,Changsha 4100820,China);Research and Development of Keeping Away Spam Technique[J];Computer Knowledge and Technology;2008-16
6 CAI Liang (CNPC ChangQing Oilfield Digitization & Information Department,Xi'an 710018,China);The Design and Implementation of Lightweight Short Message Platform[J];Computer Knowledge and Technology;2011-17
7 ZHEN Yan-jun(School of Computer and Information Science,Hubei Engineering University,Xiaogan 432100,China);Research of Data Mining-based Network Security System[J];Computer Knowledge and Technology;2012-17
8 Deng Wei Qin Zhiguang Liu Qiao Chen Hongrong (School of Computer Science and Engineering,University of Electronic Science and Technology of China,Chengdu 611731,China);Chinese spam filtering model for combating good word attacks[J];Journal of Electronic Measurement and Instrument;2010-12
9 OU Yang-zhengzheng,FENG Hong-cai (Wuhan Polytechnic University,Wuhan,430023,P.R.China);Naive Bayes Algorithm in The Anti-spam System[J];Computer Security;2008-04
10 KANG Li,WANG Yuan-zhe(Henan University of Technology,Network Education Management Center,Zhengzhou,Henan 450052,China);Expanded Response to New Spam Content Filtering[J];Computer Security;2010-09
China Proceedings of conference Full-text Database 3 Hits
1 Qi WANG Information Technology Center,China Mobile Group Liaoning Company Limited,ShenYang,China;A Spam Recognition Scheme Based On Bayesian Decision Tree Algorithm[A];[C];2011
2 Zhiming Xu,Yi Song,Zhiwei Feng,Sheng Li Harbin Institute of Technology School of Computer Science and Technology,Harbin,Heilongjiang 15001;A Classification-Based User Profile[A];[C];2010
3 ZHAO Shuang~1,ZHENG Kang-feng~1,ZHAO Jian-peng~2 (1.Information Security Centre,Beijing University of Posts and Telecommunications,Beijing 100876,China; 2.Northern Electronics Instrument Institute,Beijing 100191,China);Evaluation model of 3G network attack effectiveness based on AHP and GRA[A];[C];2012
【Secondary Citations】
Chinese Journal Full-text Database 10 Hits
1 YUE Xi-Cai; WU Xiao-Yu;ZHENG Chong-Xun; YE Da-Tian (Department of Electrical Engineering, Tsinghua University, Beijing 100084) (Institute of Biomedicine Engineering, Xi'an Jiaotong University, Xi'an 710049);A NEURAL NETWORK METHOD OF CLASSIFICATION FOR LARGE NUMBER OF CATALOGS[J];JOURNAL OF COMPUTER RESEARCH AND DEVELOPMENT;2000-03
2 ;Text Mining on the Internet[J];Computer Science;2000-04
3 Diao Qian Wang Yongcheng Zhang Huihui * He Ji Institute of Computer Technology, Shanghai Jiao Tong University Shanghai 200030 * Bao Zhaolong Library of Shanghai Jiao Tong University Shanghai 200030;Term Weighting and Classification Algorithms[J];JOURNAL OF CHINESE INFORMATION PROCESSING;2000-03
4 HUANG Xuan jing 1 WU Li de 1 Ishizaki Hiroyuki 2 XU Guo wei 2 (1.Dept.of Computer Science,Fudan University Shanghai 200433; 2.FRDC Beijing 100081);Language Independent Text Categorization[J];JOURNAL OF CHINESE INFORMATION PROCESSING;2000-06
5 LI Hui 1,2 SHI Zhong zhi 1 XU Zhuo qun 2 (1.Key Laboratory of Intelligent Information Processing, The Institute of Computing Technology Chinese Academy of Sciences Beijing 100080 2.Computer Science and Technology Department Peking Univer;Improving the Performance of the Text Classifier Based on Support Vector Machine Using the Common Sense in Text Domain[J];Journal of Chinese Information Processing;2002-02
6 SUN Xue-gang,CHEN Qun-xiu,MA Liang (State Key Laboratory of Intelligent Technology and System Dept. of Computer Science & Technology, Tsinghua University,Beijing 100084,China);Study on Topic-Based Web Clustering[J];Journal of Chinese Information Processing;2003-03
7 Jun Wu, Zuoying Wang, Feng Yu, Xia Wang(Department of Electronic Engineering,Tsinghua UniversityBeijing 100084,P.R.China);Automatic Classification of Chinese Texts[J];JOURNAL OF CHINESE INFORMATION PROCESSING;1995-04
8 Zou TaoWang JichengHuang YuanZhang Fuyan Department of Computer Science and TechnologyNanjing UniversityNanjing210093Email:tzou@graphics.nju.edu.cn;The Design and Implementation of an Automatic Chinese DocumentsClassification System[J];JOURNAL OF CHINESE INFORMATION PROCESSING;1999-03
9 Huang Xuanjing, Wu Lide (Dept. of Computer Science, Fudan University, Shanghai 200433);A VECTOR SPACE MODEL BASED DOCUMENT CLASSIFICATION SYSTEM[J];Pattern Recognition and Artificial Intelligence;1998-02
10 LIU Gui quan CHEN Xiao ping ZHANG Bo ZHAO Lei (Department of Computer Science and Technology,University of Science and Technology of China Hefei 230027);REALIZATION OF AGENT-BASED E-mail AUTOMATIC HANDLING SYSTEMS[J];Mini-micro Systems;2000-11
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved