Piggyback Query Optimization with Statistics Collection for Database Management Systems

Supported by the Centre for Advanced Studies at the IBM Toronto Laboratory
                 and The University of Michigan (Rackham, UMD Research, and CEEP)
 

Principal Investigator

Dr. Qiang Zhu
Department of Computer and Information Science,
The University of Michigan, Dearborn, MI 48128
qzhu@umich.edu
 

Co-Investigator

Dr. Nandit Soparkar
Department of Electrical Engineering and Computer Science
The University of Michigan, Ann Arbor, MI 48109
soparkar@umich.edu
 

Industrial Collaborators

Dr. Suyun Chen & Berni Schiefer
Database Technology
IBM Toronto Laboratory
suyun@ca.ibm.com
 

Graduate Students

Brian Dunkel, Ph.D. candidate, IBM CAS fellowship
Wing Lau, M.Sc. student
Wahyudi Gunawan, M.Sc. student
Jung-uk Kim, M.Sc. student
 

Undergraduate Student

Nikola Markovic
 

Project Overview

Most database management systems (DBMS) perform query optimization based on statistical information about the underlying database. Out-of-date statistics may lead to inefficient query processing in the system. Existing solutions to this problem have serious drawbacks such as heavy administrative burden, high system load, and tardy updates. To overcome these drawbacks, this project investigates a new approach, called the piggyback method, to solve the problem. The key idea is to piggyback some additional retrievals during the processing of a user query in order to collect more up-to-date statistics. The collected statistics are used to optimize the processing of subsequent queries.

There are several types of piggybacking including vertical piggybacking, which fetches additional columns; horizontal piggybacking, which retrieves additional rows; mixed vertical and horizontal piggybacking, which mixes the previous types; and multi-query piggybacking, which utilizes data from multiple queries. Basic piggybacking operators are defined to easily convert user queries into different types of piggybacked queries. Issues such as obtainable statistics, collecting levels, piggyback timing, parallel piggybacking, and a starvation problem are being investigated. The application of the piggyback technique to other database areas such as data mining is also being explored.

Project References

Related Links

  • Database Research Group at UM - D
  • EECS at the UM Ann Arbor campus

  • Number of visitors to this page is: Counter Image since 01/01/2001