TL;DR
This paper introduces a novel model-based clustering method for longitudinal life-course sequences, using mixtures of exponential-distance models to identify typical career trajectories and their predictors from categorical sequence data.
Contribution
It develops a new clustering approach that directly models sequences with exponential-distance mixtures, incorporating covariates and sampling weights for improved analysis.
Findings
School examination performance is the key predictor of career trajectory clusters.
The method provides closed-form parameter estimation for sequence dissimilarities.
Identifies distinct career trajectory patterns in Northern Irish youth data.
Abstract
Sequence analysis is an increasingly popular approach for analysing life courses represented by ordered collections of activities experienced by subjects over time. Here, we analyse a survey data set containing information on the career trajectories of a cohort of Northern Irish youths tracked between the ages of 16 and 22. We propose a novel, model-based clustering approach suited to the analysis of such data from a holistic perspective, with the aims of estimating the number of typical career trajectories, identifying the relevant features of these patterns, and assessing the extent to which such patterns are shaped by background characteristics. Several criteria exist for measuring pairwise dissimilarities among categorical sequences. Typically, dissimilarity matrices are employed as input to heuristic clustering algorithms. The family of methods we develop instead clusters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
