Undergraduate 

Computing Science (faster route) BSc/MSci

Data Product Engineering H COMPSCI4107P

  • Academic Session: 2024-25
  • School: School of Computing Science
  • Credits: 10
  • Level: Level 4 (SCQF level 10)
  • Typically Offered: Semester 1
  • Available to Visiting Students: Yes
  • Collaborative Online International Learning: No

Short Description

In this course, students will learn the process for how to design, build, test, deploy, maintain, and monitor scalable and robust data products using the Data Product Life Cycle (DPLC). Students will gain hands-on experience working with datasets and use cases, collaborating in teams, and applying agile methodologies to deliver data products that meet the needs of real world stakeholders. The course will cover the entire DPLC process, including experimentation and productization, with a focus on reliability, fault tolerance, scalability, deployment, and meeting regulatory requirements. The course will prepare students for careers in data & digital technology, equipping them with the knowledge and skills required to work in cross-functional teams and navigate complex regulatory requirements.

Timetable

1 hour lecture per week and 2 hours of practical labs per week

Excluded Courses

NA

Co-requisites

Machine Learning H (or equivalent) is recommended but not required

Assessment

■ 10% Weekly in class quiz to cover knowledge.

■ 50% Repeated single slide presentation on different aspects of the product journey based on weekly topics. *** Each person delivers one slide to be assessed on (developed slide collaboratively) *** 

■ Development of a hypothesis driven experiment

■ Presentation of the rationale for the exploring the hypothesis of business benefits (quality improvement, risk reduction, cost savings/avoidance, increased revenue/margin, customer satisfaction)

■ Link to importance of non-functional criteria

■ Assess both vision/roadmap and execution towards this in implementation and link to non-functional considerations (fault tolerance, data privacy, security.)

■ 40% On the product itself. 

■ Assume a dashboard to be developed

■ Power BI/Tableaux/Cloud deployment

Are reassessment opportunities available for all summative assessments? No

Reassessments are normally available for all courses, except those which contribute to the Honours classification. Where, exceptionally, reassessment on Honours courses is required to satisfy professional/accreditation requirements, only the overall course grade achieved at the first attempt will contribute to the Honours classification. For non-Honours courses, students are offered reassessment in all or any of the components of assessment if the satisfactory (threshold) grade for the overall course is not achieved at the first attempt. This is normally grade D3 for undergraduate students and grade C3 for postgraduate students. Exceptionally it may not be possible to offer reassessment of some coursework items, in which case the mark achieved at the first attempt will be counted towards the final course grade. Any such exceptions for this course are described below. 

Course Aims

To ensure the students learn about building large scale collaborative data products in an enterprise environment. The course aims to focus on machine learning and large language model development at scale.

Intended Learning Outcomes of Course

By the end of this course students will be able to:

 

1. Articulate the phases, workflow and key outputs of the data product life-cycle

2. Identify when and how to experiment to prove or disprove a hypothesis

3. Explain how to productionise a successful hypothesis as part of a team across the enterprise

4. Identify, describe and perform the key roles in the Data Product Life-cycle / Data Product Engineering

5. Justify data product design choices aligned with business and regulatory requirements

 

Minimum Requirement for Award of Credits

Students must submit at least 75% by weight of the components (including examinations) of the course's summative assessment.