Social and Economic Statistics

A crowd of people

The Social and Economic Statistics group specialises in interdisciplinary social research, psychometrics, quantitative criminology and economic time series methods.

Interdisciplinary Social Research

Our social research focuses on developing methods for the analysis of social data and applying those methods to research problems in the health and social sciences. Application areas focus on foetal development and movement; psychology (child development), sociology (social care and looked after children) and crime (modern slavery and human trafficking). Members of the group often work in close collaboration with other departments at Lancaster, and with police and government organisations.

Statistical areas of development include missing data, new latent class analysis methods, statistical disclosure control, high-frequency count data in surveys; score building through log-linear modelling; estimation of hidden populations; longitudinal trajectory methods and synthetic data.

Psychometrics

Research in the social and behavioural sciences often hinges upon objective measurement of latent variables representing unobservable latent constructs. Associated statistical techniques include item response modelling, structural equation modelling, and more general latent variable models.

Methodological work in this area is focussed on topics such as the structural learning of latent variable models, multivariate outlier detection, change-point detection and algorithmic fairness. Recent or ongoing practical applications of such work include detection of cheating or item bias in standardised education tests, validation of indicators of infant and child development (in collaboration with the WHO) and evaluation of the psychometric properties of tools to assess patients' psychosocial needs.

Quantitative Criminology

The focus of this research area has been the analysis of criminal career data. This has involved the development of group-based trajectory models and latent growth mixture models to develop methods for the analysis of concepts such as escalation in offending , patterns of offending over time, precursors to homicide and serious sexual assault, and sexual and general recidivism, using police offending data. More recently, focus has shifted to victimisation surveys. examining issues of measurement of interpersonal violence, domestic violence and threats of violence and developing measures of harm. Cross-national comparative work has also been undertaken.

Economic Time Series Methods

Governments and institutions such as central banks are constantly looking to obtain more frequent, and timely information on the economy. However, relying on traditional survey-centric economic statistics presents challenges as they often lack granularity and are published with significant delays. Our research in collaboration with the Office for National Statistics, considers how modern model-selection techniques can be used to increase the resolution of survey-based measurements by using information from so-called “fast-indicators”, such as traffic flow, supermarket scanner data, or VAT returns. Extending these approaches within a Dynamic Factor model framework enables the construction of real-time estimates of economic activity whilst accounting for missing data.

A further area of methodological interest is the analysis and computation of parameter estimators associated with the volatility models of financial time series, such as the GARCH model. In particular, estimators are proposed that are robust and perform well even under various deviations from the underlying model assumptions. Applications of robust estimators are applied to the analysis of financial risk measures. Both theoretical and empirical properties of these estimators and the bootstrap approximations of their distributions are of interest.

Case study: James Jackson

James Jackson started as a PhD student in 2019 and researched the development of synthetic data techniques for administrative data sources, a topic of great interest to official statisticians. He proposed a novel approach using saturated count models, exploring both under-dispersed and over-dispersed distributions, including the little-used discretised gamma family of distributions.

He graduated in 2022 and was jointly supervised by Robin Mitra and Brian Francis, with collaboration and further supervision from the Office for National Statistics. Funded by an ESRC NWSSDTP CASE studentship, he was a joint winner of the departmental Nick Smith prize for the best-year PhD student. He amassed four publications during his time in Lancaster, including a prestigious paper in the Journal of the Royal Statistical Society, Series A. He is now working as a research associate at Turing-Roche on advanced missing data problems.

Case study: Hang Liu

Hang Liu graduated in 2021 with his PhD thesis title `Robust Estimation for GARCH Models and VARMA Models’ and is now an Associate Professor at University of Science and Technology of China in Hefei, China.

Hang started his PhD in October 2017 after being awarded an ESRC North West Social Science Doctoral Training Partnership Studentship; prior to starting his PhD, he completed his Master's in Quantitative Finance at Lancaster University. His thesis contributes to the theory and methodology of robust estimation for two time series models: the generalized autoregressive conditional heteroscedasticity (GARCH) model and the vector autoregressive moving-average (VARMA) model. By the time of his graduation, his papers on his thesis were both accepted in JASA and The Econometrics Journal.