CS4400X: Introduction to Database (Data Management)
(Spring 2020)
Attention:
- Major course announcements will be
made on GT Canvas so that you all get automatic email notifications
- Please check the Schedule page regularly for links to slides, links to related materials, reminders for due dates, etc..
- Please check the Workload page for your expected workload throughout this semester
- Please check the Resources page for useful resources related to this course
Important Q&As about this course
Q1: How is CS4400-X different from other sections of CS4400?
A1: CS4400-X will cover the relational database technologies, just like the rest of CS4400, in about half of the semester. In the other half, we will cover other important and modern aspects of data management and data science, including data profiling/mining, practical machine learning, and data quality and data cleaning.
Q2: Is there any programming assignments?
A2: Yes. There will be an assignment about writing SQL queries. There will also be a Kaggle competition style assignment using Python.
Q3: Are there mid-terms and finals?
A3: There will be a closed-book mid-term exam. There will be no final exam, which is replaced by a programming competition to resolve a data science task using Python.
Q4: What is the grade breakdown?
A4: Please see the Workload Page.
Topics
The course will be cover the following topics in data management and data science:
- Relational database technologies (~ 7-8 weeks)
- Data profiling/data mining (~ 3-4 weeks)
- Introduction to practical machine learning (~ 1-2 weeks)
- Data quality and data cleaning (~ 3-4 weeks)
Logistics
We will be using
Canvas
for course announcements, uploading materials that should not be
made public, submission of assignments, and release of grades, etc.
- Instructor: Xu Chu
- Email: xu.chu@cc.gatech.edu
- Office: Klaus 3322
- TA: Peng Li
- Email: pli88@gatech.edu
- Office: Klaus 3319
- Time: Tue and Thur, 4:30 - 5:15pm
- Location: College of Computing 102
- TA Office Hour: Every Wed 1:30pm to 2:30pm at Klaus 3319
- Instructor Office Hour: Every Tue 3:00pm to 4:00pm at Klaus 3322
- Appointments: E-mail me to book a
slot. The subject of the email should contain "CS4400X".
Prerequisites
- Proficiency in Python programming
- Knowledge about data structures and algorithms
- Experience in machine learning
Academic Honesty: