Using real data sets from open source data repositories such as data.gov, the DC Open Data Catalog, and Kaggle.com, students create a data product to address a real-world problem. Students work in a data science team to apply the data science pipeline (data ingestion, data munging and wrangling, computation and analysis, modeling and application, and reporting and visualization) to a real-world problem or issue. The course involves the practical application and presentation of concepts and tools learned during the core courses. All completed pieces of the project will be hosted online to help students build a data science project portfolio.
Upon successful completion of the course, students will:
- Apply the knowledge, skills, and abilities applicable to the data science pipeline to a real world problem and data set
- Work in a data science team to create a data product
- Present a completed project and product to faculty and peers
- Build a data science project portfolio
Enrollment in this course is restricted. Students must submit an application and be accepted into the Certificate in Data Science in order to register for this course.
Current Georgetown students must create an application using their Georgetown NetID and password. New students will be prompted to create an account.
Course prerequisites include:
- A bachelor's degree or equivalent
- Completion of at least two college-level math courses (e.g. statistics, calculus, etc.)
- Successful completion of Data Story Telling (XBUS-506)
- Basic familiarity with programming or a programming language
- A laptop for class meetings and coursework
Applies Towards the Following Certificates
- Certificate in Data Science : Required