CS4400X: Introduction to Database (Data Management)
(Spring 2021)
Attention:
- This will be a fully online offering due to COVID.
- Please check the Schedule page regularly for links to slides, links to related materials, reminders for due dates, etc..
- Please check the Workload page for your expected workload throughout this semester
- Please check the Resources page for useful resources related to this course
Important Q&As about this course
Q1: How is CS4400-X different from other sections of CS4400?
A1: CS4400-X has two parts. Part I is the same as CS4400, which discuss relational database technologies. Part II is additional materials that talk about data management issues in data science, including data profiling/mining, practical machine learning, and data quality and data cleaning. Please refer to the Schedule page for detailed topic breakdown. Part I is about 2/3 of the course, and Part II is about 1/3 of the course.
Q2: Is there any programming assignments?
A2: Yes. There will be an assignment about writing SQL queries. There will also be a Kaggle competition style assignment using Python.
Q3: Are there mid-terms and finals?
A3: There will be only one exam, which covers the first part of the course. The final project covers the second part of the course.
Q4: What is the grade breakdown?
A4: Please see the Workload Page.
Logistics
We will be using
Canvas
for course announcements, uploading materials that should not be
made public, submission of assignments, and release of grades, etc.
- Instructor: Xu Chu
- Email: xu.chu@cc.gatech.edu
- Office: Klaus 3322
- TAs:
- Renzhi Wu (renzhiwu@gatech.edu)
- Zhiyi Chen (zchen798@gatech.edu)
- Time: anytime!! Please reach us any time on the course slack workspace, link can be found on Canvas.
Prerequisites
- Proficiency in Python programming
- Knowledge about data structures and algorithms
- Experience in machine learning
Academic Honesty: