Course Description

Cleaning and formatting data, also known as “data wrangling,” are the most under appreciated yet time-consuming steps in the data science pipeline. In real world analyses, data wrangling can consume up to 80% of project time.  During this course, students will learn and apply the Extract/ Transform/ Load (ETL) process used by professional data scientists to clean and prep data sets for analysis.

Course Objectives

Upon successful completion of the course, students will:

  • Understand the time commitment needed for data wrangling 
  • Identify data sets that may be time-intensive to clean
  • Efficiently clean data sets of both structured and unstructured data to prepare for analysis
  • Apply the Extract/ Transform/ Load (ETL) process to a data set
  • Better estimate the time required for data wrangling tasks


Enrollment in this course is restricted. Students must submit an application and be accepted into the Certificate in Data Science in order to register for this course.

Current Georgetown students must create an application using their Georgetown NetID and password. New students will be prompted to create an account.

Course Prerequisites

Course prerequisites include:

  • A bachelor's degree or equivalent
  • Completion of at least two college-level math courses (e.g. statistics, calculus, etc.)
  • Successful completion of Data Sources (XBUS-502)
  • Basic familiarity with programming or a programming language
  • A laptop for class meetings and coursework

Applies Towards the Following Certificates

Enroll Now - Select a section to enroll in
Live Online
9:00AM to 4:00PM
Oct 01, 2022 to Oct 15, 2022
Schedule and Location
Contact Hours
Course Tuition
Tuition non-credit $833.00 Click here to get more information
Section Notes


Welcome to Live Online!

Using the Live Online platform, Georgetown faculty deliver real-time online exceptional educational experiences based on a human-centered approach that integrates the needs of professional learners and the possibilities of technology.

Live online classes are small to support highly interactive engaged learning and collaboration, deliver the same learning objectives as classroom courses, and count towards the completion of a Georgetown certificate.

You will experience:

  • live lectures, discussions, activities, and the dynamic exploration of topics and concepts
  • the immediacy of live connections with the instructor and your peers to challenge and encourage innovative thinking and the meaningful exploration of ideas
  • scheduled lectures, discussions, and presentations with a comparable level of interaction as classroom attendance
  • networking and the creation of social connections with other professional learners in a global classroom



Required fields are indicated by .