Format
A paper review should summarize concisely the main contributions of the paper. It should also identify the limitations of the paper, if any. Please feel free to include any questions, for example, on an assumption made in the paper that you found unrealistic, or any technique that is unclear to you.Submission
Please email paper reviews to the instructor by 10 am on the day of class. Please send reviews in the body of the message. Do not send them as attachments. The paper reviews cover 25% of the course grade.Reading list (subject to change)
1. Data Warehouses
[CD97] Surajit Chaudhuri and Umeshwar Dayal. An Overview of Data Warehousing and OLAP Technology. SIGMOD Record, 26(1), 1997, 65-74.
[GCB+97] Jim Gray, Surajit Chaudhuri, Adam Bosworth, Andrew Layman, Don Reichart, Murali Venkatrao, Frank Pellow, and Hamid Pirahesh. Data Cube: A Relational Aggregation Operator Generalizing Group-by, Cross-Tab, and Sub Totals. Data Min. Knowl. Discov., 1(1), 1997, 29-53.
[ZDN97] Yihong Zhao, Prasad Deshpande, and Jeffrey F. Naughton. An Array-Based Algorithm for Simultaneous Multidimensional Aggregates. Proc. SIGMOD Conference, 1997, 159-170.
2. Data Mining
[AS94] Rakesh Agrawal and Ramakrishnan Srikant. Fast Algorithms for Mining Association Rules in Large Databases. Proc. VLDB, 1994, 487-499.
[ZRL96] Tian Zhang, Raghu Ramakrishnan, and Miron Livny. BIRCH: An Efficient Data Clustering Method for Very Large Databases. Proc. SIGMOD Conference, 1996, 103-114.
3. Column-based Databases
[SAB+05] Mike Stonebraker, Daniel Abadi, Adam Batkin, Xuedong Chen, Mitch Cherniack, Miguel Ferreira, Edmond Lau, Amerson Lin, Sam Madden, Elizabeth O'Neil, Pat O'Neil, Alex Rasin, Nga Tran and Stan Zdonik. C-Store: A Column Oriented DBMS. VLDB, 2005, 553-564.
4. Views and Lineage
[Hal01] A.Y. Halevy. Answering Queries Using Views: A Survey. VLDB Journal, 10(4), 2001.
[KR99] Yannis Kotidis and Nick Roussopoulos. DynaMat: A Dynamic View Management System for Data Warehouses. Proc. of SIGMOD Conference, 1999, 371-382.
5. Parallel Databases
[DGS+90] David J. DeWitt, Shahram Ghandeharizadeh, Donovan A. Schneider, Allan Bricker, Hui-I Hsiao, and Rick Rasmussen. The Gamma Database Machine Project. IEEE Trans. Knowl. Data Eng., 2(1), 1990, 44-62.
[DG04] Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. Proc. of OSDI (Symposium on Operating System Design and Implementation) 2004.
[CDG+06] Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber. Bigtable: A Distributed Storage System for Structured Data. Proc. OSDI (Symposium on Operating System Design and Implementation) 2006.
6. Distributed Databases
[SAL+96] Michael Stonebraker, Paul M. Aoki, Witold Litwin, Avi Pfeffer, Adam Sah, Jeff Sidell, Carl Staelin, and Andrew Yu. Mariposa: A Wide-Area Distributed Database System. VLDB J., 5(1), 1996, 48-63.
7. Sequential, Temporal, Stream Databases
[BGJ] Michael H. Bohlen, Johann Gamper, and Christian S. Jensen.
Temporal Databases. Chapter of a book edited by Hammer and Schneider.
[SLR96]
Praveen Seshadri, Miron Livny, Raghu Ramakrishnan. The Design and Implementation of a Sequence Database System. Proc. VLDB, 1996, 99-110.
[HCH+99]
Eric N. Hanson, Chris Carnes, Lan Huang, Mohan Konyala, Lloyd Noronha, Sashi
Parthasarathy, J. B. Park, and Albert Vernon. Scalable Trigger Processing. Proc.
ICDE, 1999, 266-275.
[ABW06]
Arvind Arasu, Shivnath Babu, and Jennifer Widom.
The CQL continuous query language: semantic foundations and query execution.
VLDB J. 15(2): 121-142 (2006)
[AH00]
Ron Avnur and Joseph M. Hellerstein. Eddies: Continuously Adaptive Query Processing. Proc. SIGMOD Conference, 2000, 261-272.
8. Probabilistic Data Management
[DS07]
Nilesh Dalvi and Dan Suciu. Efficient Query Evaluation on Probabilistic Databases. VLDB Journal, 16(4), 2007.
[BSHW06]
O. Benjelloun, A. Das Sarma, A. Halevy, and J. Widom. ULDBs: Databases with Uncertainty and Lineage. Proc. of VLDB, 2006.
[CM05]
Graham Cormode, S. Muthukrishnan. An improved data stream summary: the count-min sketch and its applications. J. Algorithms 55(1): 58-75 (2005)
[LD10]
Jian Li, Amol Deshpande.
Ranking Continuous Probabilistic Datasets.
PVLDB 3(1): 638-649 (2010)