Postgraduate taught 

Data Analytics MSc/PgDip/PgCert: Online distance learning

Predictive Modelling (ODL) STATS5076

  • Academic Session: 2024-25
  • School: School of Mathematics and Statistics
  • Credits: 10
  • Level: Level 5 (SCQF level 11)
  • Typically Offered: Semester 2
  • Available to Visiting Students: No
  • Taught Wholly by Distance Learning: Yes
  • Collaborative Online International Learning: No

Short Description

This course introduces students to predictive models for regression and classification.

Timetable

The course mostly consists of asynchronous teaching material.

Excluded Courses

Linear Models 3

Statistics 3L: Linear Models

Regression Models (Level M)

Co-requisites

-/-

Assessment

100% Continuous Assessment

The continuous assessment will typically be made up of one class test, a report, and three homework exercises, including online quizzes. Full details are provided in the programme handbook.

Course Aims

The aims of this course are:

■ to introduce students to predictive modelling using multiple linear regression as a showcase;

■ to present some of the distributional theory underpinning the normal linear models and the associated methods for testing and interval estimation;

■ to explain how the design matrix of a linear model can be constructed to accommodate categorical covariates or, through basis expansions, non-linear effects;

■ to describe and contrast several common methods for model assessment as well as variable and model selection;

■ to show students how to implement these statistical methods using the R computer package.

Intended Learning Outcomes of Course

By the end of this course students will be able to:

■ formulate normal linear models in vector-matrix notation and apply general results to derive ordinary least squares estimators in particular contexts;

■ construct a design matrix incorporating categorical covariates or covariates with a nonlinear effect;

■ derive, evaluate and interpret point and interval estimates of model parameters;

■ conduct and interpret hypothesis tests in the context of the Normal Linear Model;

■ derive, evaluate and interpret confidence and prediction intervals for the response at particular values of the explanatory variables;

■ assess the assumptions of a normal linear model using residual plots and diagnostics;

■ make use of and critique different methods for assessing the performance of a predictive model such R2 or AIC/BIC and use these for model or variable selection;

■ identify scenarios where data may be considered to be smooth functions and apply suitable data analysis techniques;

■ implement these statistical methods using the R computer package;

■ frame statistical conclusions clearly.

Minimum Requirement for Award of Credits

Students must submit at least 75% by weight of the components (including examinations) of the course's summative assessment.