Class Time and Place: Tue 12:00-2:50;IS 501
Instructor:
Vladimir Zadorozhny
Office Hours: by appointment
Tel: (412) 624 9411
e-mail: vladimir@sis.pitt.edu
GSA: Andrii Cherniak
Office Hours: Friday, 2:00 - 3:00 pm, SIS 2B10
Tel: (412) 624 7378
e-mail: aic3@pitt.edu
This is an advanced graduate course on database systems. The focus of this course is on distributed and unstructured database management, as well as on relationship between database management, information retrieval, and semantic technologies.
INFSCI 2710, or permission of instructor
.
Introduction (centralized vs distributed databases; homogeneous vs heterogeneous databases; OLAP and data warehousing, wide area networks and databases; structured vs unstructured data management; Web and databases; wireless networks and databases).
Architectures of modern database systems (client/server, advanced networked systems, OLAP and data warehousing systems).
Principles of distributed data management (distributed data storage, query processing, transaction management).
Unstructured data management (limitations of structured data models, data management and information retrieval).
Web-based data management (semi-structured data and querying XML, semantic Web data management, cloud computing and data management).
Advanced networked information systems (pervasive and mobile data management, stream data management, sensor data management.).
This course is
being offered for three credits. The coursework consists of Homeworks,
Exams, and a team-oriented Project. The grading is as follows:
|
Homeworks |
20% |
|
Midterm Exam |
20% |
|
Final Exam |
30% |
|
Project |
30% |
Late Policy: Homeworks are due at the beginning of class on the due date. Homework can be turned in the following class for a 25% penalty. No assignments will be accepted after that time.
Lesslet Topic | Contributors | Lesslet Post Due Day | Lesslet Take&Like Due Day |
| Data Warehousing |
Andrew Barnett Patric Cameron Jie Chen Yu-Ming Chen | Sat, 01/28 | Tue, 01/31 |
| Naming Service |
Cruz Micheal,Ezhil Manoj Entin,Andrew Bruce Jin, Xiao Junker,Thomas Parke | Sat, 02/04 | Tue, 02/07 |
| Distributed Lock Management |
Karataev,Evgeny Pavlovich Kelaiya,Ravi K. Kraisangka,Jidapa Lei,Lei | Sat, 02/11 | Tue, 02/14 |
| Query Processing |
Liao,Han Lin,Xiaolin Liu,Chang McVey,Joseph Patrick | Sat, 02/18 | Tue, 02/21 |
| Query Optimisation |
Miller,Eric Scott Narayana Swamy,Poornima Peng,Ye Pulgino,Evan Matthew | Sat, 02/25 | Tue, 02/28 |
| Object Databases |
Quattrone,Henry Joseph Salunke,Aditya Deepak Subramoniapillai Ajeetha,Nikilesh Swartz,Mark T. | Sat, 03/17 | Tue, 03/20 |
| XPath |
Tan,Miao Thomas,Kenneth Paul Wang,Guanda Yan,Jiahua | Sat, 03/24 | Tue, 03/27 |
| HADOOP |
Yang,Hsiang-Ting Zhang,Jiong Zhang,Lihong Zhang,Ming | Sat, 04/14 | Tue, 04/17 |
| HADOOP |
Yuchen Liu | Mon, 04/02 | Tue, 04/03 |
There will be a hadoop lab on 03/27. Please, bring your laptop to the class
Project grade is going to be distributed proportionally to the amount of effort each group member put in the project. Please, each of you, by midnight of Apr 17, email to aic3@pitt.edu your group evaluation. This evaluation should include a list of your group members together with the percentage of how much work each group member put in the project.
The project is done in groups of 2-3 students and is documented by a written group report. A demo will be scheduled for each group. The groups are ``self-policed''.
Special Projects: This class also offers students an opportunity to participate in special projects. Special projects give you a chance to work on more advanced topics related to current research projects conducted by the instructor and his doctoral students. If you are interested in participating in the special project please send an email to the instructor (cc to GSA) with brief description of you skills as soon as possible.
| Topic | Documents |
| History of databases | What does around comes around |
| Parallel Databases | Parallel DB, Gamma, Bubba, Volcano, Shared-nothing architecture, Parallel join |
Silberschatz, Korth, and Sudarshan Database System Concepts, 6th edition , McGraw-Hill, 2010, ISBN 0-07-352332-1 (selected chapters)
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. Introduction to Information Retrieval, Cambridge University Press. 2008, ISBN 0521865719 (selected chapters)
Grigoris Antoniou and Frank van Harmelen. A Semantic Web Primer. The MIT Press; 2 edition, 2008 . ISBN-10: 0262012421, ISBN-13: 978-0262012423 (selected chapters)
Additional references will be posted here or provided in class.
|
Week |
Day |
Topics |
Special Events |
|
1 |
Tu 1/10 |
Class Introduction |
|
|
2 |
Tu 1/17 |
Advanced Database Architectures |
|
|
3 |
Tu 1/24 |
OLAP and Data Warehousing |
|
|
4 |
Tu 1/31 |
OLAP and Data Warehousing |
Project Assigned |
|
5 |
Tu 2/7 |
Distributed DBMS: Transactions, Concurrency, Recovery |
Proj. Group Lists Due |
|
6 |
Tu 2/14 |
Distributed DBMS: Transaction, Concurrency, Recovery |
HW#1 Due |
|
7 |
Tu 2/21 |
Distributed DBMS: Query Processing and Optimization |
|
|
8 |
Tu 2/28 |
MIDTERM |
Exam |
|
9 |
Tu 3/6 |
Spring Recess |
NO CLASSES |
|
10 |
Tu 3/13 |
Querying Unstructured Data |
|
|
11 |
Tu 3/20 |
Querying Unstructured Data |
|
|
12 |
Tu 3/27 |
Intro to hadoop |
HW#2 Due |
|
13 |
Tu 4/3 |
Web Data Management |
|
|
14 |
Tu 4/10 |
Pervasive Data Management |
|
|
15 |
Tu 4/17 |
Pervasive Data Management |
Project Due |
|
16 |
Tu 4/24 |
FINAL EXAM |
Exam |
Last Modified: January 8, 2012.