A Pareto Model for OLAP View Size Estimation
dc.contributor.author | Nadeau, Thomas P. | en_US |
dc.date.accessioned | 2006-09-11T17:18:32Z | |
dc.date.available | 2006-09-11T17:18:32Z | |
dc.date.issued | 2003-04 | en_US |
dc.identifier.citation | Nadeau, Thomas P.; (2003). "A Pareto Model for OLAP View Size Estimation." Information Systems Frontiers 5(2): 137-147. <http://hdl.handle.net/2027.42/46042> | en_US |
dc.identifier.issn | 1387-3326 | en_US |
dc.identifier.issn | 1572-9419 | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/46042 | |
dc.description.abstract | On-Line Analytical Processing (OLAP) aims at gaining useful information quickly from large amounts of data residing in a data warehouse. To improve the quickness of response to queries, pre-aggregation is a useful strategy. However, it is usually impossible to pre-aggregate along all combinations of the dimensions. The multi-dimensional aspects of the data lead to combinatorial explosion in the number and potential storage size of the aggregates. We must selectively pre-aggregate. Cost/benefit analysis involves estimating the storage requirements of the aggregates in question. We present an original algorithm for estimating the number of rows in an aggregate based on the Pareto distribution model. We test the Pareto Model Algorithm empirically against four published algorithms, and conclude the Pareto Model Algorithm is consistently the best of these algorithms for estimating view size. | en_US |
dc.format.extent | 293027 bytes | |
dc.format.extent | 3115 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | text/plain | |
dc.language.iso | en_US | |
dc.publisher | Kluwer Academic Publishers; Springer Science+Business Media | en_US |
dc.subject.other | Economics / Management Science | en_US |
dc.subject.other | Management of Computing and Information Systems | en_US |
dc.subject.other | Systems Theory, Control | en_US |
dc.subject.other | Operation Research/Decision Theory | en_US |
dc.subject.other | Business Information Systems | en_US |
dc.subject.other | Pareto Distribution | en_US |
dc.subject.other | OLAP | en_US |
dc.subject.other | View Size Estimation | en_US |
dc.subject.other | Materialized View Selection | en_US |
dc.title | A Pareto Model for OLAP View Size Estimation | en_US |
dc.type | Article | en_US |
dc.subject.hlbsecondlevel | Mathematics | en_US |
dc.subject.hlbsecondlevel | Management | en_US |
dc.subject.hlbsecondlevel | Industrial and Operations Engineering | en_US |
dc.subject.hlbsecondlevel | Economics | en_US |
dc.subject.hlbtoplevel | Science | en_US |
dc.subject.hlbtoplevel | Business | en_US |
dc.subject.hlbtoplevel | Engineering | en_US |
dc.description.peerreviewed | Peer Reviewed | en_US |
dc.contributor.affiliationum | Computer Science and Engineering Division (CSE), Department of Electrical Engineering and Computer Science (EECS), The University of Michigan, 1301 Beal Avenue, Ann Arbor, MI, 48109-2122, USA | en_US |
dc.contributor.affiliationumcampus | Ann Arbor | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/46042/1/10796_2004_Article_5115802.pdf | en_US |
dc.identifier.doi | http://dx.doi.org/10.1023/A:1022693305401 | en_US |
dc.identifier.source | Information Systems Frontiers | en_US |
dc.owningcollname | Interdisciplinary and Peer-Reviewed |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.