Full-Text Search:
Home|About CNKI|User Service|中文
Add to Favorite Get Latest Update

ProGen: Provenance database generator for large-scale data set

ZHANG Xiao1,2,WANG Shan1,2,LIAN Na1,2(1.Key Laboratory of Data Engineering and Knowledge Engineering of the Ministry of Education,Renmin University of China,Beijing 100872,China;2.School of Information,Renmin University of China,Beijing 100872,China)  
It is crucially important for researchers especially scientists to judge the correctness and timeliness of data and experiments according to provenance.Regarding the technologies about view materialization and data annotation,provenance has emerged to be a new research topic.Appropriate provenance data set is the foundation for verifying the accuracy and functionality of new techniques and/or algorithms on provenance management,meanwhile,the synthetic provenance data set is also of importance for verification and improvement of algorithms before gleaning the real provenance data to some expected extent.In this paper,one novel provenance database generator,ProGen was proposed,which was able to generate a provenance database,according to the input data schema and provenance annotation,with the specific data volume.The evaluation indicates that our design and implementation is efficient and scalable.
Download(CAJ format) Download(PDF format)
CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.
©CNKI All Rights Reserved