Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Computer Knowledge and Technology(Academic Exchange)》 2007-08
Add to Favorite Get Latest Update

Research of Chinese Word Segmentation in Search Engine

LI Yan-xin (department of computer,Shijiazhuang Railway Institute,Shijiazhuang 050043,China)  
The Maximum Matching Method is one of the most used segmentation algorithm, which is low efficiency and has the length is limited. Based on the study of the architecture of Chinese coding and Chinese word segmentation algorithm, a new data structure for Chinese thesaurus is introduced, which supports standard binary search and hashing operation by means of the first Chinese character in a string. Then a faster segmentation algorithm is explained. And the process is also presented.
【CateGory Index】: TP391.3
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
Chinese Journal Full-text Database 2 Hits
1 Jie Chunyu, Liu Yuan, Liang Nanyuan;On Methods of Chinese Automatic Segmentation[J];Journal of Chinese Information Processing;1989-01
2 Guo Xianghao and Zhong Yixin (AI Lab,Beijing University of Post and Telecommunication,Beijing 100876) Yang Li (CECE Center,Northern Jiaotong University,Beijing 100044);A Fast Algorithm for Chinese Words Automatic Segment Based on Two letters word family Structure[J];JOURNAL OF THE CHINA SOCIETY FOR SCIENTIFIC AND TECHNICAL INFORMATION;1998-05
Chinese Journal Full-text Database 10 Hits
1 HU Xi-heng (Department of Mathematics,Anshan Normal University,Anshan Liaoning 114007,China);Application of Maximum Matching Method in Chinese Segmentation Technology[J];Journal of Anshan Normal University;2008-02
2 ZHAO Xiao-fan,HU Shun-yi(School of Computer and Information Engineering,Anyang Normal University,Anyang 455000,China);The Chinese word Segmentation based on Forward Maximum Match Method[J];Journal of Anyang Normal University;2010-05
3 GONG Han-ming,ZHOU Chang-sheng (Department of Computer Science & Automation,Beijing Institute of Machinery, Beijing 100085, China);Chinese word segmentation system research[J];Journal of Beijing Institute of Machinery;2004-03
4 LI Guo-he1,LIU Guang-sheng1,WU Wei-jiang1,SUN Hong-jun2,3,TANG Xian-ming2,3,HAN Bao-dong2,3(1.College of Geophysics and Information Engineering,China University of Petroleum,Beijing 102249,China;2.The State Key Laboratory of Petroleum Resource and Prospecting3.Research Institute of Petroleum Exploration and Development,Sinopec,Beijing 100083,China);Method of Chinese word rough segmentation based on maximum match and ambiguity detection[J];Journal of Beijing Information Science & Technology University;2010-S2
5 GUAN Li-he~1,YANGGang~2,LI Yong-li~2 ( 1.Department of Computera and Information, Chongqing Jiaotong University, Chongqing 400074,China; 2.School of Information Science and Engineering, Lanzhou University, Gansu Lanzhou 730000,China);Developing of the law-case automatic categorizing system based on law lexicons[J];Journal of Chongqing Jiaotong University;2004-01
6 LIU Chun-hui, JIN Shun-fu, LIU Guo-hua, LI Ying (College of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei 066004, China);A Chinese segmentation method based on optimization maximum matching and statistics[J];Journal of Yanshan University;2009-02
7 ZHAO Chun-hong 1,GAO Xi-long 2,WANG Ning 1,3,ZHAO Wei 1,LIU Guo-hua 1(1.College of Information Science and Engineering,Yanshan University,Qinhuangdao,Hebei 066004,China;2.Hebei Vocation Institute of Architecture and Material Technolgy,Qinhuangdao,Hebei 066004,China;3.Department of Computer,Qiqihar University,Qiqihar,Heilongjiang 161006,China);A Chinese segmention method by applying divide-and-conquer strategy[J];Journal of Yanshan University;2009-05
8 WU Deng-tang (Chinese Language Department, Dandong Vocational Technical College, Dandong 118003, China);On Words Composed of Latin Letters——Also my Proposition on the Processing of Chinese Information and Automatic Words Segment[J];Journal of Dandong Teachers College;2003-02
9 Sun, Maosong, and Zou Jiayan;A critical appraisal of the research on Chinese word segmentation[J];Contemporary Linguistics;2001-01
10 HUANG De\|gen 1,2 , ZHU He\|he 2, WANG Kun\|lun 2, YANG Yuan\|sheng 2, ZHONG Wan\|xie 1 ( 1. Res. Inst. of Eng. Mech., Dalian Univ. of Technol., Dalian 116024, China; 2. Dept. of Comput. Sci. and Eng., Dalian Univ. of Technol., Dalian;Chinese automatic words segmentation based on maximum matching and second\|maximum matching[J];JOURNAL OF DALIAN UNIVERSITY OF TECHNOLOGY;1999-06
China Proceedings of conference Full-text Database 9 Hits
1 Qing-ping HU University of Jiangsu, China;Controlled Language and Its Prospective Application in Chinese-English Machine Translation[A];[C];2005
2 Xiaodan Zhu, Qian Diao & Zhou Joe FIntel China Research Center;A Two-character Hash Function For Chinese Words[A];[C];2001
3 Tao Jianhua Cai Lianhong Zhao Sheng HCI&MI, Dep. of CS, Tsinghua University, Beijing, 100084;Text Analysis and Prosody Processing for Chinese SpeechSynthesis[A];[C];2001
4 Changning Huang, Jianfeng Gao and Mu Li Microsoft Research Asia;Reconsidering on Chinese Word Segmentation[A];[C];2003
5 Yang ErHong Fang Ying Qiao Yu(Beijing Language University, Beijing 100083, China); ~2(ShanXi University, TaiYuan 030006, China);;The Evoluation of Modern Chinese Segmentation and POS Tagging[A];[C];2004
6 Li Shoushan and Huang Chu-Ren The Hong Kong Polytechnic University,Department of Chinese & Bilingual Studies.Hong Kong.;Chinese Word Segmentation Based on Word Boundary Decision[A];[C];2009
7 ;Key Technology Analyzing and Localization Designing of SiteSearch[A];[C];2001
8 FENG Xia College of Computer Science & Technology Civil Aviation University of China Tianjin,300300,China TANG Xian-chao College of Computer Science & Technology Civil Aviation University of China Tianjin,300300,China;An Improved Dictionary-based Chinese Word Segmentation Approach in Lucene[A];[C];2010
9 Guo Jing (Beijing Publishing House of Electronics Industry 100036);A Prototype of Search Engine Based on Automatic Chinese Phrase Segmentation[A];[C];2001
Chinese Journal Full-text Database 10 Hits
1 Ding Feng Dong Na Lin Biqin Yuan Baozong (College of Electronics and Information Engineering, Northern Jiaotong University, Beijing 100044);Automatic Segment in Natural Language Processing System[J];JOURNAL OF NORTHERN JIAOTONG UNIVERSITY;1999-06
2 YANG Xiang-liang YANG Jun-shun CUI Yan-lin (Shaanxi University of Science & Technology,Xi'an 710021,China);Application Research of UI Design in Shaping Product Image[J];Packaging Engineering;2007-09
3 Sun, Maosong, and Zou Jiayan;A critical appraisal of the research on Chinese word segmentation[J];Contemporary Linguistics;2001-01
4 Zhang Guoxuan Wang Xiaohua Zhou Bishui Hangzhou Institute of Electronic Engineering,310037;A Fast Automatic Word Segmentation System for Chinese Characters and Its Algorithm Design[J];Journal of Computer Research and Development;1993-01
5 Zhan Yan Chen Hao Yuan Fang Wang Xizhao(Faculty of Mathematics and Computer Science,Hebei University,Baoding071002);Word Segmentation Method Research Based on Chinese Text Classification[J];Computer Engineering and Applications;2003-23
6 HE Sheng,QU Wei-guang,XU Chao. 1.School of Chinese Language and Literature, Nanjing Normal University, Nanjing 210097, China 2.Deptartment of Computer Science, Nanjing Normal University, Nanjing 210097, China;Extendable digital dictionary for automatic Chinese word segmentation[J];Computer Engineering and Applications;2008-21
7 by Duan Xiaobin;One Study on Chinese Word Segmentation Based on Key-word Library Having Three Level Index[J];Computer & Digital Engineering;2007-07
8 ZHANG Jun-ying1,HU Xia2,BU Jia-jun1(1.College of Computer Science & Technology,Zhejiang University,Hangzhou 310027,China;2.Hangzhou Science & Technology Information Institute,Hangzhou 310000,China);Survey on text information extraction from Web page[J];Application Research of Computers;2009-08
9 Li Guochen, Liu Kaiying, and Zhang Yongkui (Computer Science Dept. ,ShanXi Univ.);Segmentating Chinese Word and Processing Different Meanings Structure.[J];Journal of Chinese Information Processing;1988-03
10 Yao Tian-Shun, Zhang Gui-Ping & Wu Ying-Ming(Northeast University of Technology);A Rule-Based Chinese Automatic Segmenting System[J];Journal of Chinese Information Processing;1990-01
【Secondary Citations】
Chinese Journal Full-text Database 7 Hits
2 LIANG NANYUAN (Beijing Institute of Aeronautics and Astronautics);AN INTRODUCTION TO AUTOMATIC DISTINGUISHING OF WRITTEN CHINESE WORDS[J];Computer Applications and Software;1987-03
3 Liu Yuan Liang NanyuanDept. of Computer Science. Beijing Instituue ofAeronautics and Astronautics;Basic Engineemig for Chinese Processing——Modern Chinese Words Frequency Count[J];Journal of Chinese Information Processing;1986-01
4 Yao Tian-Shun, Zhang Gui-Ping & Wu Ying-Ming(Northeast University of Technology);A Rule-Based Chinese Automatic Segmenting System[J];Journal of Chinese Information Processing;1990-01
5 Luo Zhengqing, Chen Zengwu,Hu Shangxw(Zhejiang University, HangZhou 3l0037);A Revised MM Algorithm of Chinese Automatic Words Segmentation[J];Journal of Chinese Information Processing;1996-03
6 Zhang Min; Li Sheng;Wang Haifeng and Zhao Tiejun(Computer Department,Harbin Institute of Technology,Harbin 150001)Wang Tiezhi(Forestry School of Qiqihar,Qiqihar 161006);Evaluation-based Chinese Automatic Word Segmentation System[J];JOURNAL OF THE CHINA SOCIETY FOR SCIENTIFIC AND TECHNICAL INFORMATION;1996-02
7 Su Xinning (Department of Information Management,Nanjing University,Nanjing 210093);Improvement of Automatic Indexing in Chinese[J];JOURNAL OF THE CHINA SOCIETY FOR SCIENTIFIC AND TECHNICAL INFORMATION;1996-06
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved