Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《New Century Library》 2017-08
Add to Favorite Get Latest Update

A Semi-automatic Data Cleaning Method for Extracting Secondary Institutions' Data from WOS Address Field

He Chunjian;  
Chinese higher education institutions need to count the articles included in Web of Science(WOS) by their secondary institutions. This paper puts forward a semi-automatic data cleaning method based on regular expressions for extracting ranking of the dispatch agency, name of the secondary institutions and the corresponding authors from WOS address fields. At last, it takes the statistics of articles included in WOS of Nanjing Normal University in 2015 as an example to conduct an empirical study, and analyze the situation of the articles issued by various faculties and authors.
【Fund】: 2015年江苏省社会科学基金项目“历史文化古迹高保真全自动数字化平台建设研”(项目编号:15TQB005)研究成果之一
【CateGory Index】: G353.1
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved