CMPSCI 745: Advanced Topics in Database Systems

This graduate course covers advanced data and information management systems. The first part of the course addresses the design and implementation of advanced database systems including data warehouses, data mining, column-based databases, parallel databases, and distributed databases. The second part of the course explores advanced research topics including data streams, sensor data management, data provenance and lineage, and probabilistic databases.

This is a three-credit graduate database course. The prerequisite is a graduate course on the principles and implementations of traditional database systems, an equivalent of CMPSCI645. Students with other backgrounds should contact the instructor for approval for enrollment.

Professor: Yanlei Diao

Course Time:

TuTh 11:15-12:30 in the Computer Science Building Room 303

Office Hours:

Please email the instructor for appointments.

Course requirements:

Paper Reviews 25%
Class Presentation 15%
Class Participation 10%
Project 50%

Students are required to read 1 paper for each class and email paper reviews to the instructor by 10 am on the day of class. Failture to submit a paper review on time will result in 1 point deducted. Students are asked to select one paper from the reading list and give an in-class presentation. The presentation covers 15% of the final grade. Students are also expected to participate in class discussion, which covers another 10% of the grade. Finally, there is a course project, covering 50% of the course grade. Students can form teams of 2 people or work individually. They can propose projects on any topic related to the course material. To select an appropriate topic for the project, please contact the intructor at the beginning of the semester.

The course can be taken for pass/fail. Please contact the instructor for approval.

Text:

The course readings are primarily drawn from the 4th Edition of "Readings in Database Systems" (a.k.a. the "Red Book") edited by Hellerstein and Stonebraker. Most readings for this course will be posted on the course web site.

For an excellent introduction to the basics of database systems, students are referred to the textbook "Database Management Systems" by Ramakrishnan and Gehrke. The textbook is available from Amazon.