This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A Decision Model for Choosing the Optimal Level of Storage in Temporal Databases
March/April 1998 (vol. 10 no. 2)
pp. 297-309

Abstract—A database allows its users to reduce uncertainty about the world. However, not all properties of all objects can always be stored in a database. As a result, the user may have to use probabilistic inference rules to estimate the data required for his decisions. A decision based on such estimated data may not be perfect. We call the costs associated with such suboptimal decisions the cost of incomplete information. This cost can be reduced by expanding the database to contain more information; such expansion will increase the data-related costs because of more data collection, manipulation, storage, and retrieval. A database designer must then consider the trade-off between the cost of incomplete information and the data-related costs, and choose a design that minimizes the overall cost to the organization. In temporal databases, the sheer volume of the data involved makes such a trade-off at design time all the more important. In this paper, we develop probabilistic inference rules that allow us to infer missing values in spatial, as well as temporal, dimension. We then use the framework for developing guidelines for designing and reorganizing temporal databases, which explicitly includes a trade-off between the incomplete information and the data-related costs.

[1] I. Ahn, "Towards an Implementation of Database Management Systems with Temporal Support," Proc. Second Int'l Conf. Data Eng., pp. 374-381, Feb. 1986.
[2] T.M. Barron, "Incomplete Information Costs, Data Related Costs, and Information Systems Design," PhD dissertation, Univ. of Washington, Seattle, 1987.
[3] J. Clifford and D.S. Warren, "Formal Semantics for Time in Databases," ACM Trans. Database Systems, vol. 8, no. 2, pp. 214-254, June 1983.
[4] D. Dey, T.M. Barron, and V.C. Storey, "A Conceptual Model for the Logical Design of Temporal Databases," Decision Support Systems, vol. 15, no. 4, pp. 305-321, Dec. 1995.
[5] D. Dey, T.M. Barron, and V.C. Storey, "A Complete Temporal Relational Algebra," VLDB J., vol. 5, no. 3, pp. 167-180, Aug. 1996.
[6] S.K. Gadia, "A Homogeneous Relational Model and Query Languages for Temporal Databases," ACM Trans. Database Systems, vol. 13, no. 4, pp. 418-448, Dec. 1988.
[7] W.H. Inmon, "Managing the Data Warehouse Environment," Data Management Rev., vol. 6, no. 2, p. 8, Feb. 1996.
[8] C.S. Jensen, J. Clifford, R. Elmasri, S.K. Gadia, P. Hayes and S. Jajodia, eds., "A Glossary of Temporal Database Concepts," ACM SIGMOD Record, vol. 23, no. 1, pp. 52-64, Mar. 1994.
[9] D.H. Kraft, "A Threshold Rule Applied to the Retrieval Decision Model," J. Am. Soc. for Information Science, vol. 29, no. 2, pp. 77-80, 1978.
[10] H. Mendelson and A.N. Saharia, "Incomplete Information Costs and Database Design," ACM Trans. Database Systems, vol. 11, no. 2, pp. 159-185, June 1986.
[11] A.M. Mood, F.A. Graybill, and D.C. Boes, Introduction to the Theory of Statistics, McGraw-Hill, 1974.
[12] G. Özsoyovglu and R.T. Snodgrass, “Temporal and Real-Time Databases: A Survey,” IEEE Trans. Knowledge and Data Eng., vol. 7, no. 4, pp. 513–532, 1995.
[13] N. Pissinou, R.T. Snodgrass, R. Elmasri, I.S. Mumick, M.T. Özsu, B. Pernici, A. Segev, B. Theodoulidis, and U. Dayal, "Towards an Infrastructure for Temporal Databases: Report of an Invitational ARPA/NSF Workshop," ACM SIGMOD Record, vol. 23, no. 1, pp. 35-51, Mar. 1994.
[14] V. Poe, "Data Warehouse: Architecture is Not Infrastructure," Database Programming and Design, vol. 8, no. 7, pp. 24-31, July 1995.
[15] S.M. Ross, Introduction to Probability Models, Academic Press, 1993.
[16] G. Salton and M. McGill, Introduction to Modern Information Retrieval, McGraw Hill, New York, 1983.
[17] A. Segev and A. Shoshani, "Logical Modeling of Temporal Data," Proc. ACM SIGMOD Conf. Management of Data, pp. 454-466, May 1987.
[18] R.T. Snodgrass, “The Temporal Query Language TQuel,” ACM Trans. Database Systems, vol. 12, no. 2, pp. 247–298, 1987.
[19] R. Snodgrass and I. Ahn, "A Taxonomy of Time in Databases," Proc. ACM SIGMOD Conf., 1985.
[20] V.J. Tsotras and A. Kumar, "Temporal Database Bibliography Update," ACM SIGMOD Record, vol. 25, no. 1, pp. 41-51, Mar. 1996.
[21] E. Wong, "A Statistical Approach to Incomplete Information in Database Systems," ACM Trans. Database Systems, vol. 7, no. 3, pp. 470-488, Sept. 1982.

Index Terms:
Temporal database, data warehousing, data incompleteness, logical design, storage cost, cost of incompleteness.
Citation:
Debabrata Dey, Terence M. Barron, Aditya N. Saharia, "A Decision Model for Choosing the Optimal Level of Storage in Temporal Databases," IEEE Transactions on Knowledge and Data Engineering, vol. 10, no. 2, pp. 297-309, March-April 1998, doi:10.1109/69.683758
Usage of this product signifies your acceptance of the Terms of Use.