CMPSCI 745: Advanced Database Systems

This course covers advanced data management systems, including high-performance database systems and recent big data systems. The course first covers the design and implementation of high-performance database systems including data warehouses, decision support systems, column stores, and parallel databases. It also covers advanced databases to support new data models, including temporal databases, sequence databases, and data stream systems. Then a special focus of the course will be on recent big data systems, including MapReduce-based parallel processing, bigtable-style storage systems, and many new processing algorithms and techniques developed in industry and academia in this setting.

This graduate database course can be taken for 3 credits with a letter grade or pass/fail. The prerequisite is a graduate course on the principles and implementations of traditional database systems, an equivalent of CMPSCI 645. Students with other backgrounds are asked to contact the instructor for approval for enrollment.

Professor: Yanlei Diao

Course Time:

TueThu 11:15 am -12:30 pm in LGRC A310

Office Hours:

Tue 3:00 - 4:00 pm in CS Building Room 232

Course requirements:

Paper Reviews 25%
Class Presentation 15%
Midterm Exam 20%
Research project 40%

Students are required to read one or two papers for each class and email the paper review to the instructor by 10 am on the day of class. Please make sure to have the email title "745 PAPER REVIEW", as such email messages wil be automatically collected into an email folder for this class. Failure to submit a paper review on time will result in 1 point deducted.

Students are asked to select one paper from the reading list and give an in-class presentation. The presentation covers 15% of the final grade. There will be a take-home midterm exam, which includes both materials covered in the the course and open-ended questions. Finally, there is a course project, covering 40% of the course grade. Students can form teams of 2 people or work individually. They can propose projects on any topic related to the course material. To select an appropriate topic for the project, please contact the instructor at the beginning of the semester.

Text:

The course readings are drawn from the 4th Edition of "Readings in Database Systems" (a.k.a. the "Red Book") edited by Hellerstein and Stonebraker, as well as most recent publications. Most readings for this course will be posted on the course web site.

To review the basics of database systems, students are referred to the textbook "Database Management Systems" by Ramakrishnan and Gehrke. The textbook is available from Amazon.