Areas of Expertise
- Population Health Research
- SAIL Databank
- Data Linkage
- Spatial Data Analysis
The module aims at raising the awareness of students about scientific computing. It provides a brief overview of computation and focuses on the computational needs and workflows that health data scientists most often employ. Students will also learn about the professional context within which health data scientists operate.
Health data scientists are expected to work with diverse data sources. However, due to the abstraction offered by modern database management systems, these data sources can be treated similarly through a set of standardised operations. The objective of this module is to raise the awareness of students about the process of data modelling and the key operations involved in the data processing of large and diverse datasets.
This module introduces the topic of linked health data analysis at an introductory to intermediate level. It fills a gap in research training opportunities by combining the principles of health care epidemiology with hands-on practical exercises in the implementation of computing solutions. The module provides students with a theoretical grounding in the classroom on each topic, followed by a training session on the corresponding computing solutions. Students use de-identified linked data files in the hands-on exercises. The computing component of the module assumes a basic familiarity with computing syntax used in programs such as SPSS, SAS or STATA and methods of basic statistical analysis of fixed-format data files.