Full-Text Search:
Home|Journal Papers|About CNKI|User Service|FAQ|Contact Us|中文
《Journal of Chongqing University of Technology(Natural Science)》 2017-08
Add to Favorite Get Latest Update

Research on Predicate-Based Sampling Technology of Big Data

JIANG Qun;FU Yu;LI Wensheng;LIANG Ruishi;YANG Wu;College of Computer Science,Zhongshan Institute,University of Electronic Science and Technology of China;Media Department,Guangzhou Huali Science and Technology Vocational College;College of Computer Science and Engineering,Chongqing University of Technology;  
To solve big data sampling problem,this paper uses MapReduce to sample big data and produce a sample whose content satisfy a given predicate. Since the default Hadoop execution depends on the size of the input and is wasteful of cluster resources. The paper has extended the default Hadoop to support job-demand dynamic management of its resource consumption on cluster. Experiments results show that the implementation of the proposed policy performance is better than the default Hadoop policy. Therefore,it was proved that sampling big by using MapReduce is feasible and effective.
【Fund】: 国家自然科学基金青年科学基金资助项目(61300095);; 留学人员科技活动择优资助项目“商业智能应用软件研究与开发”(2009CR02)
【CateGory Index】: TP311.13
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
©2006 Tsinghua Tongfang Knowledge Network Technology Co., Ltd.(Beijing)(TTKN) All rights reserved