Analyzing design choices for distributed multidimensional indexing
Scientific datasets are often stored on distributed archival storage systems, because geographically distributed sensor devices
store the datasets in their local machines and also because the size of scientific datasets demands large amount of disk space.
Multidimensional indexing techniques have been shown to greatly improve range query performance into large scientific datasets.
In this paper, we discuss several ways of distributing a multidimensional index in order to speed up access to large distributed
scientific datasets. This paper compares the designs, challenges, and problems for distributed multidimensional indexing schemes,
and provides a comprehensive performance study of distributed indexing to provide guidelines to choose a distributed multidimensional
index for a specific data analysis application.
KeywordsMultidimensional indexing–Distributed indexing–Decentralized indexing–Data intensive computing
Data provided are for informational purposes only. Although carefully collected, accuracy cannot be guaranteed. The impact factor represents a rough estimation of the journal's impact factor and does not reflect the actual current impact factor. Publisher conditions are provided by RoMEO. Differing provisions from the publisher's actual policy or licence agreement may be applicable.