DSCI 5180 (Spring 2020) Project Instruction and Guidelines
Data Analytics is a subject that can be best appreciated only when applied to a dataset you are
familiar with. The aim of this project is to achieve that. Do not view this project as a hurdle in
the course, rather a bridge to connect the topics you learnt to your work or subject domain.
There are five main modules in this course
Module 1 : Normal Distribution (Percentile, distribution of means, and chance of occurrence if
we assume normal distribution)
Module 2 : Confidence Interval Estimation (Including Sample Size determination)
Module 3 : Inferences from data (Hypothesis testing, i.e., confirming or checking if a claim made
about the data. In this module, we dealt with only one sample)
Module 4 : More Inferences from data (Multiple samples)
Module 5 : Regression analysis (Both simple and multiple, apart from basic ANOVA)
Objective : The purpose of the project is for you to apply what you learnt from at least 4
modules on your dataset and make some inferences or estimations.