The Management of Probabilistic Data
October 1992 (vol. 4 no. 5)
pp. 487-502

It is often desirable to represent in a database, entities whose properties cannot be deterministically classified. The authors develop a data model that includes probabilities associated with the values of the attributes. The notion of missing probabilities is introduced for partially specified probability distributions. This model offers a richer descriptive language allowing the database to more accurately reflect the uncertain real world. Probabilistic analogs to the basic relational operators are defined and their correctness is studied. A set of operators that have no counterpart in conventional relational systems is presented.

probabilistic data management; database representation; data model; missing probabilities; partially specified probability distributions; descriptive language; relational operators; database theory; relational algebra; relational databases
D. Barbará, H. Garcia-Molina, D. Porter, "The Management of Probabilistic Data," IEEE Transactions on Knowledge and Data Engineering, vol. 4, no. 5, pp. 487-502, Oct. 1992, doi:10.1109/69.166990
