Publications (4)0 Total impact
- Scientific Programming. 01/2011; 19:27-43.
Conference Proceeding: Toward a Reliable Distributed Data Management System.[show abstract] [hide abstract]
ABSTRACT: Modern collaborative science has placed increasing burden on data management infrastructure to handle the increasingly large data archives generated. Beside functionality, reliability and availability are also key factors in delivering a data management system that can efficiently and effectively meet the challenges posed and compounded by the unbounded increase in the size of data archive generated by scientific applications. In this paper, we present our work on increasing and improving reliability and availability in the data management system we designed for the PetaShare project, we also discuss our work on benchmarking the performance and scalability of metadata management system in PetaShare project.Ninth International Symposium on Parallel and Distributed Computing, ISPDC 2010, Istanbul, Turkey, July 7-9, 2010; 01/2010
- [show abstract] [hide abstract]
ABSTRACT: We designed a semantic enabled metadata framework using ontology for multi-disciplinary and multi-institutional large scale scientific data sets in a Data Grid setting. Two main issues are addressed: data integration for semantically and phys-ically heterogeneous distributed knowledge stores, and semantic reasoning for data verification and inference in such a setting. This framework enables data interoper-ability between otherwise semantically incompatible data sources, cross-domain query capabilities and multi-source knowledge extraction. In this paper, we present the basic system architecture for this framework, as well as an initial implementation. We also analyze a real-life scenario and show integration of our framework into the PetaShare Data Grid where multi-disciplinary data archives are geographically distributed across six research institutions in Louisiana.. Since January, 2007, he has been involved in Petashare project and his research mainly focuses on ontology metadata in grid and distributed computing environment.International Journal of Grid and Utility Computing. 01/2009; 1(4).
Conference Proceeding: Cross-domain metadata management in data intensive distributed computing environment.[show abstract] [hide abstract]
ABSTRACT: As the the size of scientific datasets grows, it becomes imperative that cross-domain metadata management system needs to be developed to facilitate interdisciplinary scientific research. Three key issues need to be addressed: the development of a cross-domain metadata schema; the implementation of a metadata management system based on this schema; the integration of the metadata system into existing infrastructure with reasonable performance and scalability. In this paper, we give an overview of the research we have done as part of the PetaShare project to address the above mentioned problems.Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31 - September 4, 2009, New Orleans, Louisiana, USA; 01/2009
Louisiana State UniversityBaton Rouge, Louisiana, United States