This graduate course covers advanced data and information management systems. It first covers the design and implementation of advanced database systems including data warehouses, decision support systems, and data mining. It then examines several generations of parallel and distributed databases, including the most recent MapReduce-based parallel systems that are widely used in information industry. This course further includes temporal databases, sequence databases, and data stream systems. Finally, it presents advanced topics in probabilistic data management.
This graduate database course can be taken for 3 credits, 1 credit, or pass/fail. The prerequisite is a graduate course on the principles and implementations of traditional database systems, an equivalent of CMPSCI645. Students with other backgrounds are asked to contact the instructor for approval for enrollment.
This course can be re-taken since it includes new materials each time that it is offered.
Professor: Yanlei Diao
Course Time:TuTh 1:00-2:15 pm in the Computer Science Building Room 140
Office Hours:Please email the instructor for appointments.
Students are required to read 1 paper for each class and email paper reviews to the instructor by NOON on the day of class. Failure to submit a paper review on time will result in 2 points deducted. Students are asked to select one paper from the reading list and give an in-class presentation. The presentation covers 15% of the final grade. Students are also expected to participate in class discussion, which covers 5% of the grade. Finally, there is a course project, covering 40% of the course grade. Students can form teams of 2 people or work individually. They can propose projects on any topic related to the course material. To select an appropriate topic for the project, please contact the instructor at the beginning of the semester.
Text:The course readings are primarily drawn from the 4th Edition of "Readings in Database Systems" (a.k.a. the "Red Book") edited by Hellerstein and Stonebraker. Most readings for this course will be posted on the course web site.
For an excellent introduction to the basics of database systems, students are referred to the textbook "Database Management Systems" by Ramakrishnan and Gehrke. The textbook is available from Amazon.