The MauveDB Project
The MauveDB project is motivated by the tremendous increase in
the number of distributed measurement infrastructures such as
wireless sensor networks that continuously generate invaluable
data about our everyday world. However, the potential of this
data has been hard to realize mainly because of the typically
incomplete, imprecise, erroneous, and uncertain nature of the
data generated. The MauveDB project aims to develop abstractions
that make it easy for users and application developers to
continuously apply statistical modeling tools to streaming
sensor data. Such statistical
models can be used for data cleaning, prediction, interpolation, anomaly
detection and for inferring hidden variables from the data, thus addressing
many of the challenges in managing sensor data.
MauveDB supports a new abstraction called "model-based views" to achieve the
above goal. A model-based view is analogous to a traditional database view
in that it can be used to present a consistent "view" of the underlying data
to the user. However, as opposed to a traditional database view, a model-based view
is defined using a statistical model instead of an SQL query. This not only
significantly enriches the user interaction with the sensed data, but also
results in more efficient processing of data.
A brief overview of the MauveDB project and the underlying technology
Publications
- Exploiting Shared Correlations in Probabilistic Databases;
Prithviraj Sen, Amol Deshpande, and Lise Getoor;
To appear in VLDB 2008.
[abstract]
- Predictive Modeling-based Data Collection in Sensor Networks;
Lidan Wang and Amol Deshpande;
EWSN 2008.
[pdf] [abstract]
Selected as one of the best papers.
- Online Filtering, Smoothing and Probabilistic Modeling of Streaming data;
Bhargav Kanagal, Amol Deshpande;
ICDE 2008.
[pdf] [abstract]
(Extended version: UMD CS Tech. Report, CS-TR-4867, May 2007.)
- Data Management in the Worldwide Sensor Web;
Magdalena Balazinska, Amol Deshpande, Michael Franklin, Phil Gibbons, Jim Gray, Mark Hansen, Michael Liebhold, Suman Nath, Alex Szalay, and Vincent Tao;
IEEE Pervasive Computing, Volume 6(2), 2007.
[abstract]
- Representing and Querying Correlated Tuples in Probabilistic Databases;
Prithviraj Sen, Amol Deshpande;
ICDE 2007.
[pdf] [abstract]
- MauveDB: Supporting Model-based User Views in Database Systems;
Amol Deshpande, Sam Madden;
SIGMOD 2006.
[pdf] [talk] [abstract]
- Model-based Approximate Querying in Sensor Networks ;
Amol Deshpande, Carlos Guestrin, Sam Madden, Joseph M. Hellerstein, Wei Hong;
International Journal on Very Large Data Bases (VLDB Journal), 2005.
[pdf] [abstract]
Project Participants
External Collaborators
Acknowledgments
This material is based upon work supported by the National Science Foundation under Grant No. 0546136.
Any opinions, findings, and conclusions or recommendations expressed in this material are those of the
author(s) and do not necessarily reflect the views of the National Science Foundation.