Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Journal of Chinese Information Processing》 1989-01
Add to Favorite Get Latest Update

On Methods of Chinese Automatic Segmentation

Jie Chunyu, Liu Yuan, Liang Nanyuan  
Automatic segmentation of character string into words is now referred to as another bottle-neck problem after Chinese character code in the field of Chinese information processing, On the base of reviewing and analyzing the previous methods of Chinese automatic segmentation, this article established a structure model ASM(d,a,m)to represent all basic methods systematically. And with this model,two new kinds of basic meshods were put forth. Furthermore,the calculation was made on the time complexity of each basic method;the influence of time complexity upon segmentation speed,and that of each basic method upon segmentation accurracacy and intelligent processing were analyzed in detail. Some wrong points of view on the methods of Chinese automatic segmentation were criticized as well.
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
【References】
Chinese Journal Full-text Database 10 Hits
1 LI Guo-he1,LIU Guang-sheng1,WU Wei-jiang1,SUN Hong-jun2,3,TANG Xian-ming2,3,HAN Bao-dong2,3(1.College of Geophysics and Information Engineering,China University of Petroleum,Beijing 102249,China;2.The State Key Laboratory of Petroleum Resource and Prospecting3.Research Institute of Petroleum Exploration and Development,Sinopec,Beijing 100083,China);Method of Chinese word rough segmentation based on maximum match and ambiguity detection[J];Journal of Beijing Information Science & Technology University;2010-S2
2 ZHAO Chun-hong 1,GAO Xi-long 2,WANG Ning 1,3,ZHAO Wei 1,LIU Guo-hua 1(1.College of Information Science and Engineering,Yanshan University,Qinhuangdao,Hebei 066004,China;2.Hebei Vocation Institute of Architecture and Material Technolgy,Qinhuangdao,Hebei 066004,China;3.Department of Computer,Qiqihar University,Qiqihar,Heilongjiang 161006,China);A Chinese segmention method by applying divide-and-conquer strategy[J];Journal of Yanshan University;2009-05
3 Sun, Maosong, and Zou Jiayan;A critical appraisal of the research on Chinese word segmentation[J];Contemporary Linguistics;2001-01
4 HUANG De\|gen 1,2 , ZHU He\|he 2, WANG Kun\|lun 2, YANG Yuan\|sheng 2, ZHONG Wan\|xie 1 ( 1. Res. Inst. of Eng. Mech., Dalian Univ. of Technol., Dalian 116024, China; 2. Dept. of Comput. Sci. and Eng., Dalian Univ. of Technol., Dalian;Chinese automatic words segmentation based on maximum matching and second\|maximum matching[J];JOURNAL OF DALIAN UNIVERSITY OF TECHNOLOGY;1999-06
5 QU Wei-hua,WANG Qun(School of Information Engineering ,China University of Geosciences(Beijing),Beijing 100083,China);Introduce and Analyzing of Search Engine Principle[J];Computer Knowledge and Technology;2006-35
6 LI Yan-xin (department of computer,Shijiazhuang Railway Institute,Shijiazhuang 050043,China);Research of Chinese Word Segmentation in Search Engine[J];Computer Knowledge and Technology(Academic Exchange);2007-08
7 GUO Yi(School of Software Engineering,Tongji University,Shanghai 201804,China);An Improved Mechanism on the Chinese Word Segmentation[J];Computer Knowledge and Technology;2008-07
8 CUI Hong-yan(School of Information Engineering,Lanzhou Commercial College,Lanzhou 730020,China);Research on an improved Chinese segmentation algorithm based on word frequency statistic[J];Information Technology;2008-04
9 Li Honghin Fong Lianzhong (Dept.of Computer Science and Engineering);The Complexities of the Maximum Matching Method of Words with the Same Head Chineses Character and the Maximum Matching Method[J];Journal of Harbin Institute of Technology;1993-05
10 Qi Wenqing(School of Computer Science,Huangshi Institute of Technology,Huangshi Hubei 435003);An Improved Maximum Matching Method for Chinese Word Segmentation[J];Journal of Huangshi Institute of Technology;2007-04
China Proceedings of conference Full-text Database 2 Hits
1 FENG Xia College of Computer Science & Technology Civil Aviation University of China Tianjin,300300,China TANG Xian-chao College of Computer Science & Technology Civil Aviation University of China Tianjin,300300,China;An Improved Dictionary-based Chinese Word Segmentation Approach in Lucene[A];[C];2010
2 Guo Jing (Beijing Publishing House of Electronics Industry 100036);A Prototype of Search Engine Based on Automatic Chinese Phrase Segmentation[A];[C];2001
【Citations】
Chinese Journal Full-text Database 3 Hits
1 Liang Nanyuan;WRITTEN CHINESE AUTOMATIC DISTINGUISHING WORDS & A AUTOMATIC DISTINGUISHING WORDS SYSTEM-CDWS[J];;1984-04
2 LIANG NANYUAN (Beijing Institute of Aeronautics and Astronautics);AN INTRODUCTION TO AUTOMATIC DISTINGUISHING OF WRITTEN CHINESE WORDS[J];Computer Applications and Software;1987-03
3 Liu Yuan Liang NanyuanDept. of Computer Science. Beijing Instituue ofAeronautics and Astronautics;Basic Engineemig for Chinese Processing——Modern Chinese Words Frequency Count[J];Journal of Chinese Information Processing;1986-01
【Co-citations】
Chinese Journal Full-text Database 10 Hits
1 Liang Nanyuan;THE KNOWLEDGE OF CHINESE WORDS AUTOMATIC SEGMENTATION[J];;1988-04
2 Cheng Hua\ \ Yin Baolin (Beijing University of Aeronautics and Astronautics,Dept. Computer Science and Engineering);DESIGN AND IMPLEMENTATION OF A PINYIN CHINESE WORD CONVERSION SYSTEM[J];JOURNAL OF BEIJING UNIVERSITY OF AERONAUTICS AND ASTRONAUTICS;1996-04
3 Sun, Maosong, and Zou Jiayan;A critical appraisal of the research on Chinese word segmentation[J];Contemporary Linguistics;2001-01
4 Liu Xiaoying (Xiangtan University, Xiangtan 411105,China);The Development Trend of Chinese Automatic Segmentation Research[J];Library Work in Colleges and Universities;2005-04
5 Zhao Fujun; Huang Houkuan ;Yu Jingshan (Dept. of Computer and Information Science);Design of an Expectation-Based Chinese Word Distinguishing Model[J];;1990-02
6 Wu Yan;Li Xiukun ;Wang Kaizhu(Dept. of Computer, HIT);Mathematical Model of Text Semantic Paragraph Partition[J];JOURNAL OF HARBIN INSTITUTE OF TECHNOLOGY;1998-06
7 Wu Shengyuan(Department of Computer Engineering,Shandong University of Technology,jinan 250061);A NEW CHINESE PHRASE SEGMENTATION METHOD[J];JOURNAL OF COMPUTER RESEARCH AND DEVELOPMENT;1996-04
8 Ma Xiaona Yang Chenglei(School of Computer Science and Technology,Shandong University,Ji'nan 250061);Design and Implementation of a Limited Nature Language Query System Based on Object-oriented[J];Computer Engineering and Applications;2005-10
9 ;ON THE METHODS OF AUTOMATIC DISTINGUISHING OF WRITTEN CHINESE WORDS[J];Computer Engineering;1989-06
10 Pan Lingyun and Yang Changsheng(Zhejiang University);AN AUTO-SYSTEM FOR CONVERTING HANYUPINYIN TO CHINESE CHARACTERS[J];Chinese Journal of Computers;1990-04
China Proceedings of conference Full-text Database 2 Hits
1 ;一个特定人手写汉字识别系统的实现[A];[C];2002
2 Chen Xinying,Li Wenwen,Wang Yan,Wang Lu,Kan Minggang Communication University of China 100024;The Application of Chinese Quantitative Characteristics in Comparison of Language Style and Author Judgment—Triple Gates of Han Han and Never Flowers in Never Dreams of Guo Jingming as Examples[A];[C];2010
【Secondary References】
Chinese Journal Full-text Database 10 Hits
1 HU Xi-heng (Anshan Normal University,Anshan Liaoning 114007,China);Research and Design of Spam Filtering System Model[J];Journal of Anshan Normal University;2009-02
2 GENG Xin-qing,TAO Feng-mei,HUANG Hong-guang(Department of Mathematics,Anshan Normal University,Anshan Liaoning 114007,China);Jlppeccz:A New Word Segmentation Algorithm Based on Neiboring Match[J];Journal of Anshan Normal University;2010-04
3 WANG Yu-mei, RUAN Xiao-gang ( College of Electronic Information & Control Engineering. Beijing University of Technology, Beijing 100022, China );Expert System for the Syntax Analysis of Chinese Based on Man's Cognition Behavior[J];Journal of Beijing Polytechnic University;2003-01
4 MA Zhi-qiang,ZHOU Chang-sheng,DING Wei,YANG Na (Department of Computer Science & Automation,Beijing Institute of Machinery,Beijing 100085,China);The research and implement of campus net search engine[J];Journal of Beijing Institute of Machinery;2007-01
5 LI Guo-he1,LIU Guang-sheng1,WU Wei-jiang1,SUN Hong-jun2,3,TANG Xian-ming2,3,HAN Bao-dong2,3(1.College of Geophysics and Information Engineering,China University of Petroleum,Beijing 102249,China;2.The State Key Laboratory of Petroleum Resource and Prospecting3.Research Institute of Petroleum Exploration and Development,Sinopec,Beijing 100083,China);Method of Chinese word rough segmentation based on maximum match and ambiguity detection[J];Journal of Beijing Information Science & Technology University;2010-S2
6 YANG Shu-Lin (Beijing Institute of Graphic Communication, Beijing 102600,China);Design and Implementation of Open Answering Question System Based on Web[J];Journal of Beijing Institute of Graphic Communacation;2005-01
7 FENG Zhea,b,SUN Ji-guia,b,ZHANG Chang-shenga,b,WANG Yana,b(a.College of Computer Science and Technology;b.Key Laboratory of Symbolic Computation and Knowledge Engineering for Ministry of Education,Jilin University,Changchun 130012,China);Research Advance of Chinese Speech Synthesis[J];Journal of Jilin University(Information Science Edition);2007-02
8 GE Yu1,Liang Jing2,CHEN Xiaomin2(1.College of Fundamental Education,Sichuan Normal University,Chengdu 610068,China;2.Information and Calculation Science Department,Chengdu Electromechanical College,Chengdu 610031,China);A Probe Into the Hot Issues in the Search Engine System[J];Journal of Chengdu Electromechanical College;2009-04
9 ZHANG Lin-man,WU Sheng(Spatial Information Research Center,Fujian Province;Key Laboratory of Spatial Data Mining and Information Sharing,Ministry of Education,Fuzhou University,Fuzhou 350002,China);Research on place names and address segmentation in geocoding system[J];Science of Surveying and Mapping;2010-02
10 GUAN Li-he~1,YANGGang~2,LI Yong-li~2 ( 1.Department of Computera and Information, Chongqing Jiaotong University, Chongqing 400074,China; 2.School of Information Science and Engineering, Lanzhou University, Gansu Lanzhou 730000,China);Developing of the law-case automatic categorizing system based on law lexicons[J];Journal of Chongqing Jiaotong University;2004-01
China Proceedings of conference Full-text Database 10 Hits
1 H. Gao, D. G. Huang, Y. S. Yang Department of Computer Science and Engineering, Dalian University of Technology, Dalian, 116024 China;Foreign Person Name Recognition in Chinese Texts[A];[C];2006
2 Wang Huihui, Yang Guowei ( College of Computer Science and Engineering, University of Electronic Science and Technology of China, ChengDu, 610054);Research of Qestion Answering System Based On The Example[A];[C];2005
3 Cao Hong Yuan Jinsheng(Collage of Information, Beijing Forestry University, Beijing 100083);On Research of Multi-Topic Focused Search Engine[A];[C];2004
4 Liu Haiyan He Jing Wang Ziqiang (Department of Information Engineering, Armed Force Enginerring Institute, Beijing 100072);The Design and Implementation of a Secure Web Proxy[A];[C];2004
5 Gua Junjie~1,Wu Shuguo~1,Yi Shengwei~2 1.College of Computer Science and Technology, Beijing University of Technology,Beijing 100124, China. 2.National Laboratory of Software Development Environments,Beihang University,Beijing 100191, China;A Method of Extracting Chinese Term based on Term Effect[A];[C];2009
6 Su Liang Sun Bin School of Telecommunication & Network Technology Beijing University of Posts and Telecommunications,100876;Implementation of an Improved Hash Chinese Words Dictionary Segmentation Algorithm Base on Lucene[A];[C];2007
7 Xiaojun Tong1 , Minggen Cui 1 , Guolong Song 2 1 School of Computer Science & Technology, Harbin Institute of Technology at Weihai, Weihai 264209, CHINA 2 School of Information Science & Engineering, Northeastern University, Shenyang 110004, CHINA;Research on the Model of Integrating Chinese Word Segmentation with Part-of-speech Tagging[A];[C];2007
8 Xiaodan Zhu, Qian Diao & Zhou Joe FIntel China Research Center;A Two-character Hash Function For Chinese Words[A];[C];2001
9 Zhao Tiejun, Li Shen, Meng Yao, Huang Yu, Yang MuyunSchool of Computer Science & Technology Harbin Institute of Technology, Harbin, 15001;Research on Parsing Technology in Machine Translation System[A];[C];2001
10 Xu Chao Chen Xiaohe School of Literature, Nanjing Normal University, Nanjing, 210097;Comment on the Chinese Analysis Ability of Two Commercial Machine Translation Software[A];[C];2002
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved