Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Journal of Chinese Information Processing》 1990-01
Add to Favorite Get Latest Update

A Rule-Based Chinese Automatic Segmenting System

Yao Tian-Shun, Zhang Gui-Ping & Wu Ying-Ming(Northeast University of Technology)  
By means of analysis to the difficulty of the Chinese Automatic segmenting words, this paper discussed the relation between the word frequency and combinational ability. Put forward a set of the Chinese automatic segmenting method, machine segmenting and semantic correction.The system has been set up the list of absolute segmenting marks; changable length maximum matching method;2-3-1 priority rule set; intrinsic ambiguous correction and combinational ambiguous correction, etc.Some examples used the rules are given. This system is a part of CETRAN.A and programmed in C language at SUN 3-280 workstation.
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
【References】
Chinese Journal Full-text Database 10 Hits
1 Zhu Jingbo,Yao Tianshun;Chinese Information Automatic Extraction[J];JOURNAL OF NORTHEASTERN UNIVERSITY;1998-01
2 ZHAO Chun-hong 1,GAO Xi-long 2,WANG Ning 1,3,ZHAO Wei 1,LIU Guo-hua 1(1.College of Information Science and Engineering,Yanshan University,Qinhuangdao,Hebei 066004,China;2.Hebei Vocation Institute of Architecture and Material Technolgy,Qinhuangdao,Hebei 066004,China;3.Department of Computer,Qiqihar University,Qiqihar,Heilongjiang 161006,China);A Chinese segmention method by applying divide-and-conquer strategy[J];Journal of Yanshan University;2009-05
3 Sun, Maosong, and Zou Jiayan;A critical appraisal of the research on Chinese word segmentation[J];Contemporary Linguistics;2001-01
4 Wang Xiukun, Li Zheng, Jian Youliang, Liu Jian ( Research Institute of Computer Technology, DUT );Machine translation dictionary based on Hash method[J];JOURNAL OF DALIAN UNIVERSITY OF TECHNOLOGY;1996-03
5 LIU Li-dong (Department of Computer Dezhou University, Dezhou Shandong 253023,China);Method to pick-up professional phrases in information sources[J];;2002-02
6 LIU Li-dong(Department of Computer Dezhou University, Dezhou Shandong 253023,China);Chinese word segmentation arithmetic based on the degree of combination[J];Journal of Dezhou University;2003-02
7 SU Fang-zhong, LIN Shi-ping(College of Mathematics and Computer Science, Fuzhou University, Fuzhou, Fujian 350002, China);The research and implementation on a Chinese automatic word-segment algorithm in Web text mining[J];Journal of Fuzhou University(Natural Sciences Edtion);2004-S1
8 FU Yan-mei(Electronical & Informational School,Anyan 455000,China);Chinese Word Segmentation Of Intelligent Question Answering Systems[J];Journal of Hubei University of Technology;2009-01
9 CHEN Ming-hua,YIN Jing-hua,SHU Chang,WANG Ming-jiang(School of Applied Sciences,Harbin University of Science and Technology,Harbin 150080,China);A Chinese word segmentation system design based on forward-backward maximum matching algorithm[J];Information Technology;2009-06
10 XU Ai-ping1,OUYANG Hong-tao2(1.School of Computer,Wuhan University,Wuhan 430079,China;2.Shenzhen SED ARC Co.,Ltd,Shengzhen 518057,China);An algorithm to identify semanteme of GIS chinese inquiry sentences in surface layer[J];Journal of Harbin Institute of Technology;2009-01
China Proceedings of conference Full-text Database 2 Hits
1 SuiYan ZhangPuLanguage Information Processing Institution, Beijing Language and Culture University, Beijing 100083;A Preparatory Study on Distilling “Valid Character Strings” Based on “Dynamic Corpus”[A];[C];2001
2 ;A Revised BMM and RMM Algorithm of Chinese Automatic Words Segmentation[A];[C];2006
【Co-references】
Chinese Journal Full-text Database 10 Hits
1 SU Pei\|cheng (Department of Chinese, Peking University, Beijing, 100871, China);Toward Modernization of Chinese Language in the 21th Century[J];Journal of Peking University(Humanities and Social Sciences);2001-01
2 Ding Feng Dong Na Lin Biqin Yuan Baozong (College of Electronics and Information Engineering, Northern Jiaotong University, Beijing 100044);Automatic Segment in Natural Language Processing System[J];JOURNAL OF NORTHERN JIAOTONG UNIVERSITY;1999-06
3 GONG Han-ming,ZHOU Chang-sheng (Department of Computer Science & Automation,Beijing Institute of Machinery, Beijing 100085, China);Chinese word segmentation system research[J];Journal of Beijing Institute of Machinery;2004-03
4 Liang Nanyuan;WRITTEN CHINESE AUTOMATIC DISTINGUISHING WORDS & A AUTOMATIC DISTINGUISHING WORDS SYSTEM-CDWS[J];;1984-04
5 Liang Nanyuan;THE KNOWLEDGE OF CHINESE WORDS AUTOMATIC SEGMENTATION[J];;1988-04
6 SONG Li-zhe~1,NIU Zhen-dong~(2,3),SONG Han-tao~1,YU Zheng-tao~1,SHI Xue-lin~1 (1.Department of Computer Science and Engineering, School of Information Science and Technology, Beijing Institute of Technology, Beijing100081,China; 2.School of Computer Software, Beijing Institute of Technology, Beijing 100081, China; 3.Beijing National Library Digital Technology Corp Ltd, Beijing100083,China);Study on the User Profile of Personalized Service in Digital Library[J];Journal of Beijing Institute of Technology;2005-01
7 ZHANG Feng, FAN Xiao-zhong(Department of Computer Science and Engineering, School of Information Science and Technology, Beijing Institute of Technology, Beijing100081, China);Resolution of Overlapping Ambiguity Strings Based on Maximum Entropy Model[J];Journal of Beijing Institute of Technology;2005-07
8 XUE Wei-min~(1,2),LU Yu-chang~2(1.Automation College of Beijing Union University,Beijing 100101,China;2.Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China);Research On Text Data Mining[J];Journal of Beijing Union University;2005-04
9 LI Lei, SUN Chun kui, YANG Xiao lan, ZHONG Yi xin (Department of Information Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China)Abstract: A understanding based Chinese automatic;Understanding Based Chinese Automatic Abstracting System in Special Domain[J];JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS;2000-01
10 BAO Hai-long LI Jin-lin (Beijing Institute of Technology, Beijing: 100081);A Study of the Identify Method about IPC and Theme Terms on Patent Retrieval[J];Journal of Beijing Institute of Technology(Social Sciences Edition);2003-05
China Proceedings of conference Full-text Database 2 Hits
1 Shan GAO, Yan ZHANG, Bo XU, ChengQing ZONG, ZhaoBing HAN,National Lab of Pattern Recognition Institute of Automation,chinese Academy of Sciences,Beijing 100080;The Research on Integrated Chinese Words Segmentation and Labeling based on Trigram Statistical Model[A];[C];2001
2 PU YuDa, GUAN Yi,WANG Qiang (Dept. of Computer Science and Technology, Harbin Institute of Technology;Heilongjiang,Harbin);Information Extraction from Web Page Based on Data Mining Thought[A];[C];2006
【Secondary References】
Chinese Journal Full-text Database 10 Hits
1 MA Zhi-qiang,ZHOU Chang-sheng,DING Wei,YANG Na (Department of Computer Science & Automation,Beijing Institute of Machinery,Beijing 100085,China);The research and implement of campus net search engine[J];Journal of Beijing Institute of Machinery;2007-01
2 LI Guo-he1,LIU Guang-sheng1,WU Wei-jiang1,SUN Hong-jun2,3,TANG Xian-ming2,3,HAN Bao-dong2,3(1.College of Geophysics and Information Engineering,China University of Petroleum,Beijing 102249,China;2.The State Key Laboratory of Petroleum Resource and Prospecting3.Research Institute of Petroleum Exploration and Development,Sinopec,Beijing 100083,China);Method of Chinese word rough segmentation based on maximum match and ambiguity detection[J];Journal of Beijing Information Science & Technology University;2010-S2
3 Zhang Fan Lin Jian(Department of Information Management,HuaZhong Normal University,Wuhan,Hubei,430079);Research on Filtering Mechanism in Intelligent Search Engine[J];Library and Information;2007-04
4 GE Yu1,Liang Jing2,CHEN Xiaomin2(1.College of Fundamental Education,Sichuan Normal University,Chengdu 610068,China;2.Information and Calculation Science Department,Chengdu Electromechanical College,Chengdu 610031,China);A Probe Into the Hot Issues in the Search Engine System[J];Journal of Chengdu Electromechanical College;2009-04
5 LIAO Wei-li,CHEN Lin,YANG Xin(School of Software Engineering,Chongqing University,Chongqing 400044,China);Strategy of Order Relation Scheduling on Mobile Value-added Service Platform[J];Journal of Chongqing Institute of Technology(Natural Science);2009-09
6 GUAN Li-he~1,YANGGang~2,LI Yong-li~2 ( 1.Department of Computera and Information, Chongqing Jiaotong University, Chongqing 400074,China; 2.School of Information Science and Engineering, Lanzhou University, Gansu Lanzhou 730000,China);Developing of the law-case automatic categorizing system based on law lexicons[J];Journal of Chongqing Jiaotong University;2004-01
7 GAO Bo(Yanling School,Changzhou Institute of Technology,Changzhou 213002);Generation Algorithm of Professional Information Database Based on Corpus Statistics Tree[J];Journal of Changzhou Institute of Technology;2009-Z1
8 ZHAO Chun-hong 1,GAO Xi-long 2,WANG Ning 1,3,ZHAO Wei 1,LIU Guo-hua 1(1.College of Information Science and Engineering,Yanshan University,Qinhuangdao,Hebei 066004,China;2.Hebei Vocation Institute of Architecture and Material Technolgy,Qinhuangdao,Hebei 066004,China;3.Department of Computer,Qiqihar University,Qiqihar,Heilongjiang 161006,China);A Chinese segmention method by applying divide-and-conquer strategy[J];Journal of Yanshan University;2009-05
9 Sun, Maosong, and Zou Jiayan;A critical appraisal of the research on Chinese word segmentation[J];Contemporary Linguistics;2001-01
10 ZHANG Li~(*1),ZHANG Li-yong~1,ZHANG Xiao-miao~1,GENG Tie-suo~2,YUE Zong-ge~3(1.School of Electr.and Inf.Eng.,Dalian Univ.of Technol.,Dalian 116024,China;2.Nat.Assets Adm.Office,Dalian Univ.of Technol.,Dalian 116024, China;3.Hosp.of Dalian Univ.of Technol.,Dalian 116024,China);Research on ambiguous words segmentation algorithm based on improved BP neural network[J];Journal of Dalian University of Technology;2007-01
China Proceedings of conference Full-text Database 10 Hits
1 Wang Huihui, Yang Guowei ( College of Computer Science and Engineering, University of Electronic Science and Technology of China, ChengDu, 610054);Research of Qestion Answering System Based On The Example[A];[C];2005
2 WU Hong-ping ZHOU Guo-xiang College of Computer and Information ,HeFei University of Technology, HeFei, 230009, China;Research on Web Text Mining[A];[C];2007
3 Xiaodan Zhu, Qian Diao & Zhou Joe FIntel China Research Center;A Two-character Hash Function For Chinese Words[A];[C];2001
4 SuiYan ZhangPuLanguage Information Processing Institution, Beijing Language and Culture University, Beijing 100083;A Preparatory Study on Distilling “Valid Character Strings” Based on “Dynamic Corpus”[A];[C];2001
5 Xu Chao Chen Xiaohe School of Literature, Nanjing Normal University, Nanjing, 210097;Comment on the Chinese Analysis Ability of Two Commercial Machine Translation Software[A];[C];2002
6 Changning Huang, Jianfeng Gao and Mu Li Microsoft Research Asia;Reconsidering on Chinese Word Segmentation[A];[C];2003
7 Wang Houfeng Institute of Computational Linguistics. Peking University, Beijing, 100871;Evaluation on Chinese Tokenization in Machine Translation[A];[C];2003
8 Yang ErHong Fang Ying Qiao Yu(Beijing Language University, Beijing 100083, China); ~2(ShanXi University, TaiYuan 030006, China);;The Evoluation of Modern Chinese Segmentation and POS Tagging[A];[C];2004
9 ZhENG Min, CAI Lianhong (Department of Computer Science and Technology of Tsinghua University, Beijing, 100084);A New Rule-based Method of automatic phonetic notation on polyphones[A];[C];2004
10 LI Jiang-bo ZHOU Qiang CHEN Zu-shun The State Key Laboratory of Intelligent Technology and Systems. Department of Computer Science and Technology. Tsinghua University. Beijing 100084;An Study on Rapid Algorithm for Chinese Dictionary Query[A];[C];2004
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved