CSE599T: Topics in Probabilistic and Statistical Databases

Description: Concepts, algorithms, and systems used for process probabilistic data, and for applying statistical techniques to data management. Applications include management of uncertain data, data anonymization, approximate query processing, and query size estimation. We will discuss the probabilistic data model, several approaches to query evaluation, data lineage/provenance, the random graph data model, sketches from data, and sampling techniques.

Prerequisities: (none listed)

Portions of the CSE599T web may be reprinted or adapted for academic nonprofit purposes, providing the source is accurately quoted and duly creditied. The CSE599T Web: © 1993-2024, Department of Computer Science and Engineering, Univerity of Washington. Administrative information on CSE599T (authentication required).