Skip to content | Change text size

M O N A T A R

InfoTech Unit Avatar

FIT5196 Data Wrangling

Chief Examiner

This field records the Chief Examiner for unit approval purposes. It does not publish, and can only be edited by Faculty Office staff

To update the published Chief Examiner, you will need to update the Faculty Information/Contact Person field below.

Lan Du

NB: This view restricted to entries modified on or after 19990401000000

Unit Code, Name, Abbreviation

FIT5196 Data Wrangling (27 Feb 2015, 4:29pm) [Data Wrangling (27 Feb 2015, 4:29pm)]

Reasons for Introduction

Reasons for Introduction (27 Feb 2015, 4:30pm)

Data Science is a rapidly expanding field in industry and many leading universities in the USA and the UK are starting data science degrees and units. Monash FIT trialed a data science unit for the undergraduate business IT course, FIT3152, in 2013, which is continuing, mostly focusing on data analysis. This Data Wrangling unit will start in 2016. It is a core elective unit in Master of Data Science (on-campus) and a core unit in Graduate Diploma in Data Science (Monash Online).

Reasons for Change (21 Sep 2020, 1:52pm)

2/3/2015 - reworded pre-requisite for admin clarification.

28/7/2015 - Removed FIT5145 as a prerequisite as agreed at GDDS Steering Committee meeting 27/7/2015. Effective immediately.

07/08/2015 - Admin: submitted changes to synopsis made by CE. Also numbered learning outcomes.

10/08/2015 - Admin: update to format of Synopsis removing dot points and making it a paragraph as required for the handbook.

19/02/2016- Refined unit synopsis and objectives, heavy SQL material was removed because of inadequate teaching resources, for S2 and TP3

4/05/2016 - added justification for the 100% in-semester assessment.

31-Aug-2016- upgraded LOs and prerequisites according to the latest discussion on refreshing the unit

01/09/2016 - Admin: numbered learning outcomes.

07/09/2016 - Admin: Update learning outcomes as per GPC 4/16 (Item 7.1).

09/01/2019- Lan: changed labs to tutorials

18/10/2019: Updating prerequisites to include FIT9136. Effective S1, 2020.

11/11/2019: FEC 5/19 approved prereq amendments entered 18/10/2019. Also approved was a prereq waiver clause for students studying C6007 Master of AI. Effective 2020.

21/09/2020 - Admin: Update to include new assessment and teaching approach fields as per Handbook requirements.

Role, Relationship and Relevance of Unit (27 Feb 2015, 4:32pm)

This unit has no similar existing units. It is complementary to FIT5148 and FIT5149, and will be taught using Python as the programming environment.

Objectives

Objectives (08 Sep 2016, 09:04am)

Upon successful completion of this unit, it is expected that students will be able to:

  1. Parse data in the required format
  2. Assess the quality of data for problem identification
  3. Resolve data quality issues ready for the data analysis process
  4. Integrate data sources for data enrichment
  5. Document the wrangling process for professional reporting
  6. Write program scripts for data wrangling processes

Unit Content

ASCED Discipline Group Classification (27 Feb 2015, 4:34pm)

020399

Synopsis (19 Feb 2016, 1:03pm)

This unit introduces tools and techniques for data wrangling. It will cover the problems that prevent raw data from being effectively used in analysis and the data cleansing and pre-processing tasks that prepare it for analytics. These include, for example, the handling of bad and missing data, data integration and initial feature selection. It will also introduce text mining and web analytics. Python and the Pandas environment will be used for implementation.

Prescribed Reading (for new units) (21 Sep 2020, 1:43pm)

Technological requirements)

Students will need Python 3 and Jupyter Notebook. Download both Python 3.7 and Jupyter Notebook from https://www.anaconda.com/distribution/

Teaching Methods

Mode (27 Feb 2015, 4:39pm)

on-campus, Monash Online

Assessment

Assessment Summary (21 Sep 2020, 1:48pm)

In-semester assessment: 100%

  1. Assessment 1: Parsing Raw Data and Text Preprocessing - 35% (part 1: 20%, part 2: 15%) - ULO: 1, 2, 3
  2. Assessment 2: Data Cleansing and Integration - 35% - ULO: 2, 3, 4, 5, 6
  3. Assessment 3: End-of-term Quiz - 30% - ULO: 1, 2, 3, 4

Workloads

Workload Requirements (09 Jan 2019, 12:52pm)

Minimum total expected workload equals 144 hours per semester comprising:

  1. Contact hours for on-campus students:
    • Two hours/week lectures.
    • Two hours/week tutorials.
  2. Contact hours for Monash Online students:
    • Two hours/week online group sessions.
    • Online students generally do not attend lecture, tutorial and laboratory sessions, however should plan to spend equivalent time working through resources and participating in discussions.
  3. Additional requirements:
    • A minimum of 8 hours per week of personal study (22 hours per week for Monash online students) for completing lab/tutorial activities, assignments, private study and revision, and for online students, participating in discussions.

Resource Requirements

Teaching Responsibility (Callista Entry) (27 Feb 2015, 4:41pm)

FIT

Prerequisites

Prerequisite Units (11 Nov 2019, 12:40pm)

FIT9133 or FIT9136 or equivalent; or entry into C6007.

Proposed year of Introduction (for new units) (27 Feb 2015, 4:43pm)

2016

Location of Offering (27 Feb 2015, 4:43pm)

Caulfield

Faculty Information

Proposer

Wray Buntine

Approvals

School: 11 Nov 2019 (Emma Nash)
Faculty Education Committee: 11 Nov 2019 (Emma Nash)
Faculty Board: 11 Nov 2019 (Emma Nash)
ADT:
Faculty Manager:
Dean's Advisory Council:
Other:

Version History

27 Feb 2015 Wray Buntine Initial Draft; modified UnitName; modified Abbreviation; modified ReasonsForIntroduction/RIntro; modified ReasonsForIntroduction/RoleRelationshipRelevance; modified UnitObjectives/ObjText; modified UnitObjectives/ObjCognitive; modified UnitObjectives/ObjPsychomotor; modified UnitObjectives/ObjSocial; modified UnitObjectives/ObjAffective; modified UnitContent/ASCED; modified UnitContent/Synopsis; modified UnitContent/PrescribedReading; modified Teaching/Mode; modified Assessment/Summary; modified Workload/ContactHours; modified ResourceReqs/SchoolReqs; modified Prerequisites/PreReqUnits; modified Prerequisites/PreReqKnowledge; modified DateOfIntroduction; modified LocationOfOffering; modified UnitObjectives/Objectives
27 Feb 2015 Jeanette Niehus FIT5196 Chief Examiner Approval, ( proxy school approval )
27 Feb 2015 Jeanette Niehus FEC Approval
27 Feb 2015 Jeanette Niehus FacultyBoard Approval - FEC Executive Approval given 27/2/2015
02 Mar 2015 Trudi Robinson Reworded pre-requisite for admin clarification.
03 Mar 2015 Jeanette Niehus FIT5196 Chief Examiner Approval, ( proxy school approval )
25 Mar 2015 Jeanette Niehus FIT5196 Chief Examiner Approval, ( proxy school approval )
25 Mar 2015 Jeanette Niehus FEC Approval
25 Mar 2015 Jeanette Niehus FacultyBoard Approval - GPC executive approval given 16/03/2015
28 Jul 2015 Trudi Robinson Removed FIT5145 as a prerequisite as agreed at GDDS Steering Committee meeting 27/7/2015.
06 Aug 2015 Wray Buntine Initial Draft; modified UnitObjectives/Objectives; modified UnitContent/Synopsis
07 Aug 2015 Jeanette Niehus Admin: modified ReasonsForIntroduction/RChange; modified UnitObjectives/Objectives
10 Aug 2015 Jeanette Niehus Admin: modified ReasonsForIntroduction/RChange; modified UnitContent/Synopsis
10 Aug 2015 Jeanette Niehus FIT5196 Chief Examiner Approval, ( proxy school approval )
10 Aug 2015 Jeanette Niehus FEC Approval
10 Aug 2015 Jeanette Niehus FacultyBoard Approval - GPC executive approval given 10/08/2015
19 Feb 2016 Lan Du modified UnitObjectives/Objectives; modified UnitContent/Synopsis; modified UnitObjectives/Objectives; modified FacultyInformation/FIContact
23 Feb 2016 Lan Du modified ReasonsForIntroduction/RChange
17 Mar 2016 Jeanette Niehus FIT5196 Chief Examiner Approval, ( proxy school approval )
17 Mar 2016 Jeanette Niehus FEC Approval
17 Mar 2016 Jeanette Niehus FacultyBoard Approval - GPC executive approval given 16/03/2016
03 May 2016 Jeanette Niehus Admin: modified Chief Examiner
04 May 2016 Lan Du modified Assessment/Summary; modified ReasonsForIntroduction/RChange
11 May 2016 Jeanette Niehus FIT5196 Chief Examiner Approval, ( proxy school approval )
11 May 2016 Jeanette Niehus FEC Approval
11 May 2016 Jeanette Niehus FacultyBoard Approval - Approved at GPC 2/16 on 5/5/2016
31 Aug 2016 Lan Du modified UnitObjectives/Objectives; modified Prerequisites/PreReqUnits; modified ReasonsForIntroduction/RChange
01 Sep 2016 Jeanette Niehus Admin: modified ReasonsForIntroduction/RChange; modified UnitObjectives/Objectives
08 Sep 2016 Jeanette Niehus Admin: modified ReasonsForIntroduction/RChange; modified UnitObjectives/Objectives
08 Sep 2016 Jeanette Niehus FIT5196 Chief Examiner Approval, ( proxy school approval )
08 Sep 2016 Jeanette Niehus FEC Approval
08 Sep 2016 Jeanette Niehus FacultyBoard Approval - Approved at GPC 4/16, Item 7.1
09 Jan 2019 Lan Du modified Workload/ContactHours; modified ReasonsForIntroduction/RChange
20 Feb 2019 Emma Nash FIT5196 Chief Examiner Approval, ( proxy school approval )
20 Feb 2019 Emma Nash FIT5196 Chief Examiner Approval, ( proxy school approval )
20 Feb 2019 Emma Nash FEC Approval
20 Feb 2019 Emma Nash FacultyBoard Approval - Executively approved by DGP 20/02/2018
18 Oct 2019 Emma Nash modified ReasonsForIntroduction/RChange; modified Prerequisites/PreReqUnits
28 Oct 2019 Emma Nash FIT5196 Chief Examiner Approval, ( proxy school approval )
28 Oct 2019 Emma Nash FEC Approval
28 Oct 2019 Emma Nash FacultyBoard Approval - Minor amendment approved at GPC 5/19.
11 Nov 2019 Emma Nash modified ReasonsForIntroduction/RChange; modified Prerequisites/PreReqUnits
11 Nov 2019 Emma Nash FIT5196 Chief Examiner Approval, ( proxy school approval )
11 Nov 2019 Emma Nash FEC Approval
11 Nov 2019 Emma Nash FacultyBoard Approval - Approved at FEC 5/19.
21 Sep 2020 Joshua Daniel modified UnitContent/PrescribedReading; modified Assessment/Summary
21 Sep 2020 Joshua Daniel modified ReasonsForIntroduction/RChange

This version: