Methods of analysis of data from longitudinal studies allow us to make use of their rich data and to explore the temporal relationships between measures collected across different life stages. Regression analysis is an important and widely-used technique for exploring the relationship between an outcome (e.g. later-life health) and possible explanatory variables (e.g. early-life circumstances). We can gain important insights in social science, biomedical and health research by studying a range of factors throughout the life course, including physical and mental health, and socioeconomic and behavioural factors
In this module you will learn about:
Suggested citation: Moulton, V., O’Neill, D., Park, A. & Ploubidis, G.B. (2020). Analysing longitudinal data. CLOSER Learning Hub, London, UK: CLOSER
This section introduces some of the important fundamentals of analysing data from longitudinal studies and describes how regression techniques can be used to explore variables relating to different points in an individual’s life course.
Longitudinal data analysis can be used to explore how characteristics and experiences from early life can influence later outcomes, while taking account of other childhood factors. In this module, we will use an extract of data from the NCDS CLOSER Training Dataset (CTD) to examine the relationship between intelligence test scores at the age of 11 years and BMI at age 42 years. This section will provide you with guidance on accessing relevant data, undertaking exploratory data analysis and preparing the data for the more advanced statistical modelling covered in subsequent sections.
This section introduces a method, known as general linear regression, that can be used to examine how an outcome that has been measured on a continuous scale is associated with potentially explanatory variables. We offer a step-by-step illustration of how we can use this important statistical analysis approach to explore such associations in longitudinal data.
This section discusses a method that can be used to analyse the association between a dichotomous (two-category) outcome measure and potentially explanatory variables. This method is a widely used approach and the following guide provides a detailed illustration of how we can use this logistic regression method to answer research questions with longitudinal data.
This section provides guidance on a method that can be used to explore the association between a multiple-category outcome measure and other potentially explanatory variables. Multinomial logistic regression can offer us useful insights when we are working with longitudinal data and this section breaks down and discusses each of the key steps involved.