Update methods for maintainability of a multidimensional index on non-ordered discrete vector data
dc.contributor.author | Cherniak, R | |
dc.contributor.author | Zhu, Q | |
dc.contributor.author | Pramanik, S | |
dc.date.accessioned | 2024-09-27T01:21:32Z | |
dc.date.available | 2024-09-27T01:21:32Z | |
dc.date.issued | 2020-09-01 | |
dc.identifier.issn | 1076-5204 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/195086 | |
dc.description.abstract | There are numerous applications nowadays such as bioinformatics, cybersecurity, and social media that demand to efficiently process various types of queries on multidimensional (vector) data with values coming from a non-ordered discrete (categorical) domain for each dimension. The BoND-tree index scheme was recently developed to efficiently process so-called box queries on a large dataset in disk from such a vector data space. The index construction (insertion) and query algorithms were introduced in the original work. To maintain such an efficient index structure for a large dynamic dataset, one has to develop efficient and effective methods to support other operations including deletions, updates, and bulk loading. Although studies on deletions and bulk loading for the BoND-tree have been reported in early work, how to efficiently and effectively update the BoND-tree remains an open problem. In this paper, we first present a general update procedure which covers all scenarios including special cases for insertions and deletions. We then examine two approaches to updating the BoND-tree. The relevant algorithms and experimental evaluations are presented. Our study shows that using the bottom-up update method can provide improved efficiency, comparing to the traditional top-down update method, especially when the number of dimensions for a vector that need to be updated is small. On the other hand, our study also shows that the two update methods have a comparable effectiveness, which indicates that the bottom-up update method is generally more advantageous. | |
dc.title | Update methods for maintainability of a multidimensional index on non-ordered discrete vector data | |
dc.type | Article | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/195086/2/ijca2020_a1.pdf | |
dc.identifier.doi | https://dx.doi.org/10.7302/24325 | |
dc.identifier.source | International Journal of Computers and their Applications | |
dc.description.version | Published version | |
dc.date.updated | 2024-09-27T01:21:24Z | |
dc.identifier.volume | 27 | |
dc.identifier.issue | 3 | |
dc.identifier.startpage | 94 | |
dc.identifier.endpage | 106 | |
dc.identifier.name-orcid | Cherniak, R | |
dc.identifier.name-orcid | Zhu, Q | |
dc.identifier.name-orcid | Pramanik, S | |
dc.working.doi | 10.7302/24325 | en |
dc.owningcollname | Computer and Information Science, Department of (UM-Dearborn) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.