Self-Expressive Subspace Clustering to Recognize Motion Dynamics of a Multi-Joint Coordination for Chronic Ankle Instability
Shaodi Qian, Sheng-Che Yen, Eric Folmar, Chun-An Chou

TL;DR
This study introduces a nonlinear subspace clustering framework to analyze multi-joint motion patterns for diagnosing chronic ankle instability, achieving over 70% accuracy and revealing significant joint interactions.
Contribution
It develops a novel analytic method combining subspace clustering and SVM to objectively identify CAI from biosensor data, improving diagnosis over traditional methods.
Findings
Achieved >70% classification accuracy in distinguishing CAI from healthy controls.
Identified significant motion pattern differences in hips and other joints.
Demonstrated potential for supporting clinical diagnosis and rehabilitation decisions.
Abstract
Ankle sprains and instability are major public health concerns. Up to 70% of individuals do not fully recover from a single ankle sprain and eventually develop chronic ankle instability (CAI). The diagnosis of CAI has been mainly based on self-report rather than objective biomechanical measures. The goal of this study is to quantitatively recognize the motion pattern of a multi-joint coordination using biosensor data from bilateral hip, knee, and ankle joints, and further distinguish between CAI and healthy cohorts. We propose an analytic framework, where a nonlinear subspace clustering method is developed to learn the motion dynamic patterns from an inter-connected network of multiply joints. A support vector machine model is trained with a leave-one-subject-out cross validation to validate the learned measures compared to traditional statistical measures. The computational results…
| Feature Learning | Statistical | Correlation | PCA | SSC | KSSC |
| Testing Hit Rate | 57.49% | 56.72% | 52.25% | 68.50% | 72.16% |
| Training Hit Rate | 85.23% | 77.73% | 70.53% | 76.13% | 83.13% |
| Feature Learning | Statistic | Correlation | PCA | SSC | KSSC |
| Testing Voting Accuracy | 61.70% | 57.45% | 51.06% | 74.47% | 80.85% |
| Training Voting Accuracy | 91.91% | 79.56% | 74.42% | 75.53% | 86.36% |
| Baseline | Physical joints | Correlation based | SSC based | |||||
| Hips | Knees | Ankles | Cluster I | Cluster II | Cluster I | Cluster II | ||
| Hit rate | 50.69% | 60.53% | 43.40% | 46.09% | 50.21% | 60.15% | 67.37% | 50.04% |
| 55.32% | 65.96% | 44.68% | 46.81% | 48.94% | 65.96% | 68.09% | 51.06% | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Self-Expressive Subspace Clustering to Recognize Motion Dynamics for Chronic Ankle Instability
Shaodi Qian
Department of Mechanical & Industrial Engineering, Northeastern University, Boston, MA 02115
Sheng-Che Yen
Departments of Physical Therapy, Movement & Rehabilitation Science, Northeastern University, Boston, MA 02115
Eric Folmar
Departments of Physical Therapy, Movement & Rehabilitation Science, Northeastern University, Boston, MA 02115
Chun-An Chou Corresponding author. Department of Mechanical & Industrial Engineering, Northeastern University, Boston, MA 02115
Abstract
Ankle sprains and instability are major public health concerns. Up to 70% of individuals do not fully recover from single ankle sprains and eventually develop chronic ankle instability (CAI). The diagnosis of CAI has been mainly based on self-report rather than objective biomechanical measures. The goal of this study is to quantitatively recognize the motion patterns of a multi-joint coordinate system using gait data of bilateral hip, knee, and ankle joints, and further distinguish CAI from control cohorts. We propose an analytic framework, where the concept of subspace clustering is applied to characterize the dynamic gait patterns in a lower dimensional subspace from an inter-dependent network of multiply joints. A support vector machine model is built to validate the learned measures compared to traditional statistical measures in a leave-one-subject-out cross validation. The experimental results showed 70% classification accuracy on average for the dataset of 47 subjects (24 with CAI and 23 controls) recruited to examine in our designed experiment. It is found that CAI can be observed from other joints (e.g., hips) significantly, which reflects the fact that there exists inter-dependency in the multi-joint coordinate system. The proposed framework presents a potential to support clinical decisions using quantitative measures during diagnosis, treatment, rehabilitation of gait abnormality caused by physical injuries (e.g., ankle sprains in this study) or even central nervous system disorders.
Keywords: Chronic Ankle Instability, Gait Data Analysis, Subspace Learning, Pattern Recognition, Decision Model
1 Introduction
The most common type of ankle injuries is lateral ankle sprains where individuals roll the foot inward and damage the lateral ankle structures Garrick (\APACyear1977). It is reported that up to 70% of individuals may eventually develop chronic ankle instability (CAI) following a significant ankle sprain Anandacoomarasamy \BBA Barnsley (\APACyear2005). CAI is characterized by persistent pain and swelling, episodes of ankle giving way, and recurrent ankle sprains Hertel (\APACyear2002). Individuals with CAI have been reported to have diminished physical activity Hubbard-Turner \BBA Turner (\APACyear2015) and lower quality of life Houston \BOthers. (\APACyear2015). More problematically, emerging evidence has linked CAI to future development of irreversible, posttraumatic ankle osteoarthritis Golditz \BOthers. (\APACyear2014). The high prevalence and associated medical costs make CAI a significant health concern Soboroff \BOthers. (\APACyear1984).
Despite the clinical significance of CAI, the etiology of this medical problem remains unclear. It has been hypothesized that CAI may be caused by peripheral issues such as reduced ankle proprioception Garn \BBA Newton (\APACyear1988); Forkin \BOthers. (\APACyear1996) and weak peroneal muscles Bosien \BOthers. (\APACyear1955); Tropp (\APACyear1986). However, some counter-evidence has shown that individuals with CAI do not necessarily have reduced proprioception Refshauge \BOthers. (\APACyear2000) and ankle strength deficits are not highly correlated with chronic ankle instability Kaminski \BBA Hartsell (\APACyear2002). More recently, researchers started questioning if central control issues also contribute to CAI Hass \BOthers. (\APACyear2010). A main reason was that individuals with CAI often show deviations in walking and running gaits Monaghan \BOthers. (\APACyear2006); Delahunt \BOthers. (\APACyear2006).
Human locomotion requires the control from the central level. Theories such as the central pattern generator have been proposed to explain how the central nervous system controls gait Duysens \BBA Van de Crommert (\APACyear1998); Dimitrijevic \BOthers. (\APACyear1998). Previous studies have found increased ankle inversion Monaghan \BOthers. (\APACyear2006); Delahunt \BOthers. (\APACyear2006), increased ankle plantarflexion Chinn \BOthers. (\APACyear2013); Drewes, McKeon, Kerrigan\BCBL \BBA Hertel (\APACyear2009), and altered spatial and temporal parameters Gigi \BOthers. (\APACyear2015) during walking and/or running in individuals with CAI. These findings suggest that the central control of gait is altered in these patients, and support that CAI is not only a peripheral issue but also a central issue.
Gait represents a complex control problem in which redundant degrees of freedom must be constrained and coordinated to create a smooth pattern Van Emmerik \BOthers. (\APACyear2005). Thus, examining inter-joint or inter-segment coordination may be able to generate further insight into the central control issue in individuals with CAI. A review of literature revealed that most gait studies on CAI focused on examining individual joints rather than coordination Moisan \BOthers. (\APACyear2017). Few studies examined if CAI is associated with coordination change during gait, but they only looked at coordination between two segments Herb \BOthers. (\APACyear2014); Drewes, McKeon, Paolini\BCBL \BOthers. (\APACyear2009) or two joints Yen \BOthers. (\APACyear2017), e.g., ankle-ankle or hip-ankle. A possible reason may be that the current measurements for coordination, such as vector coding and continuous relative phase, can only quantify the coordination between two body components Hamill \BOthers. (\APACyear2000).
Owing to a significant advancement of sensor technologies, modeling and analysis of high-volume, high-resolution time series data demonstrate successes in various applied domains of healthcare, security, and system monitoring in data-rich environments. Phinyomark \BOthers. (\APACyear2018); Gowsikhaa \BOthers. (\APACyear2014). In particular, gait data analysis is focused on pattern recognition of gait abnormality caused by physical injuries (e.g., ankle sprains Punt \BOthers. (\APACyear2015)) or central nervous system disorders (e.g., Parkinson’s disease (PD) Pirker \BBA Katzenschlager (\APACyear2017); Tucker \BOthers. (\APACyear2015); Duhamel \BOthers. (\APACyear2006) and stroke Marrocco \BOthers. (\APACyear2016)). The essential idea of these studies attempted to extract statistical measures from motion time series, and examine the significance of extracted features using statistical tests (e.g., -test or ANOVA) or statistical learning methods. However, a challenging task is to find discriminant inter-dependent network patterns, instead of independent features, from a multi-variate, high-dimensional gait data space. Most recent researches investigated new quantitative methods for gait pattern recognition of central nervous system disorders, but not of disabilities caused by physical injuries. In this paper, we present the research to specifically model a chronic motion disability problem (i.e., CAI) and identify discriminant patterns based on the self-expression of multi-variate gait data.
In this study, the overarching goal is to characterize critical information (measurements) that can summarize and/or represent the motion behaviors of individuals with CAI using gait kinematic data for a multi-joint coordinate system of bilateral hip, knee, and ankle joints. During our designed running experiments, biosensors were placed to record the locomotions of sagittal, frontal, and transverse planes. We present the multi-joint coordinate system as an inter-connected network as it is hypothesized that there exists inter-dependency between hip, knee, and ankle joints. We then propose an analytic framework that first extracts the network patterns of subspaces learned from multi-variate kinematic data during running, and then develop a decision model using support vector machine (SVM) to validate and distinguish individuals with CAI from control subjects.
The organization of this paper is as follows. In Section 2, we review relevant research for clinical ankle sprains diagnosis and assessment, followed by a review of quantitative methods for functional data analysis. In Section 3, we describe data acquisition of our designed experiment and then present the proposed analytic framework that involves both unsupervised learning and supervised learning of mutli-variate gait data. In Section 4, computational results are demonstrated and compared between conventional data extraction and our method. Finally, the conclusion and future work are addressed in Section 5.
2 Related Work
2.1 Clinical Diagnosis and Assessment of Ankle Sprains
Traditional clinical assessment for CAI is mainly based on self report. For example, Hiller et al. developed Cumberland Ankle Instability Tool (CAIT) to assess subjective symptoms of CAI Hiller \BOthers. (\APACyear2006). Martin et al. developed a self-report survey called The Foot and Ankle Ability Measure (FAAM) that has been widely used in CAI assessment Martin \BOthers. (\APACyear2005). Another self-report assessment for CAI is the Ankle Instability Instrument developed by Docherty et al. Docherty \BOthers. (\APACyear2006). These tools have been recommended by he International Ankle Consortium to determine if an individual has CAI Gribble \BOthers. (\APACyear2014). However, self-report answers may be subject to a number of biases (e.g., social desirability bias) that could affect the reliability and accuracy of the measures. To address this problem, these self-reported tools should be used in conjunction with objective measures.
Previous studies have used lab tools to objectively measure ankle proprioception, balance control, ankle muscle strength, and the reaction time to ankle inversion perturbation in individuals with CAI, but the results were often inconsistent Holmes \BBA Delahunt (\APACyear2009); Gutierrez \BOthers. (\APACyear2009). For example, some studies reported that individuals with CAI have delayed reaction time to inversion perturbation Konradsen \BBA Ravn (\APACyear1990); Karlsson \BBA Andreasson (\APACyear1992), but others did not find similar results Ebig \BOthers. (\APACyear1997). Recently, more studies have examined walking and/or running patterns in individuals with CAI, and the results have suggested that CAI may be associated with deviations in these patterns Monaghan \BOthers. (\APACyear2006); Delahunt \BOthers. (\APACyear2006); Chinn \BOthers. (\APACyear2013); Drewes, McKeon, Kerrigan\BCBL \BBA Hertel (\APACyear2009); Gigi \BOthers. (\APACyear2015); Yen \BOthers. (\APACyear2017). Walking and/or running could be used as a motor task to objectively differentiate control subjects and those with CAI.
Existing gait research on individuals with CAI often focus on examining the ankle kinematics Monaghan \BOthers. (\APACyear2006); Delahunt \BOthers. (\APACyear2006); Chinn \BOthers. (\APACyear2013). However, gait is a complex motor task that involves all leg joints. Inter-joint coordination could be a parameter used to determine if an individual develops CAI. For example, Yen et al. used a vector coding method to examine hip-ankle coupling during walking, and found that the coupling pattern was different between control subjects and those with CAI Yen \BOthers. (\APACyear2017). A weakness of this study was that the coordination measure was limited to hip-ankle coupling in the affected side in the frontal plane, and deviations may exist in other couplings that were not measured. This weakness was due to limitation in current methods to quantify a coordination pattern. Predominate methods such as vector coding and continuous relative phase can only quantify the coupling between two kinematic trajectories Hamill \BOthers. (\APACyear2000).
2.2 Pattern Recognition and Analysis of Human Motion Data
Pattern recognition and analysis of kinematic/gait data aim to discover the insights into human motion behaviors through statistical and machine learning methods; recent research studies were surveyed in Phinyomark \BOthers. (\APACyear2018). While statistical methods including -test, ANOVA and covariance analysis were used to analyze and test the coupling or grouping across subjects based on statistic measurements for various applications of kinematic and/or physiological data Park \BOthers. (\APACyear2017), advanced techniques of functional data analysis and machine learning demonstrated more promising results in regard to high-dimensional multi-variate gait data Park \BOthers. (\APACyear2017). In various health related studies, supervised learning methods, such as SVM, -nearest neighbors (KNN), linear discriminative analysis (LDA), neural network (NN), were employed for predicting or classifying in between of target and control cohorts, and usually combined with dimensionality reduction approaches, such as principle component analysis (PCA) to discover information from a high-dimensional space Deluzio \BBA Astephen (\APACyear2007); Coffey \BOthers. (\APACyear2011); Fukuchi \BOthers. (\APACyear2011); Eskofier \BOthers. (\APACyear2012); Andrade \BOthers. (\APACyear2013); Phinyomark \BOthers. (\APACyear2014); Janidarmian \BOthers. (\APACyear2015); Derlatka \BBA Bogdan (\APACyear2015); Tucker \BOthers. (\APACyear2015); Watari \BOthers. (\APACyear2016); Phinyomark \BOthers. (\APACyear2016); Rida \BOthers. (\APACyear2016). In addition, some researches were focused on modeling motion dynamics of multi-variate kinematic data using hidden Markov model Mannini \BBA Sabatini (\APACyear2012) and Bayesian network Moon \BBA Pavlović (\APACyear2008) for gait pattern recognition.
For dimensionality reduction of multi-variate time series data, sparse subspace learning (SSL) or sparse subspace clustering (SSC) is particularly advantageous for reducing the effect of noise while segmenting data in the original space into (multiple) low-dimensional subspaces associated with different groups or functions Wang \BBA Xu (\APACyear2016); Bahadori \BOthers. (\APACyear2015). The idea is to learn an affinity matrix from the original data using sparse or low-rank minimization techniques, and subsequently apply spectral clustering to this affinity matrix to find independent subspaces Elhamifar \BBA Vidal (\APACyear2013). This subspace clustering method is centered around a self-expressive assumption that a sample point (or times series in a channel) can be linearly represented by other sample points (or channels) in a certain subspace. Furthermore, more efficient algorithms were proposed to solve for the affinity matrix Li \BOthers. (\APACyear2017). In our study, we adopt the same self-expressive assumption as the previous works for dynamic data modeling, and solve for the affinity matrix using an efficient algorithm, called alternating direction method of multipliers algorithm (ADMM) Boyd \BOthers. (\APACyear2011). It can compute matrix factorization for multiple time series simultaneously. In another work, the self-expression model was solved and further analyzed by other algorithms, e.g., SEED method Dyer \BOthers. (\APACyear2015) and fast iterative shrinkage-thresholding algorithm Zhang \BOthers. (\APACyear2018). The latter research developed a dynamic functional subspace learning method to generate a time-varying correlation matrix for estimating motion dynamics, applied to change point detection Zhang \BOthers. (\APACyear2018). A major advantage is that large fluctuations are prevented in estimating over time. However, their algorithm is not computationally efficient and computes individual time series one at a time. In a previous study Elhamifar \BBA Vidal (\APACyear2013), they considered SSC results of geometric interpretation, i.e., principal angles between subspaces, for object detection and clustering in video footage, and face recognition (photo clustering). Meanwhile, SSC concept is found in many applications e.g., image compression Hong \BOthers. (\APACyear2006), motion segmentation Rao \BOthers. (\APACyear2010), video segmentation Elhamifar \BBA Vidal (\APACyear2013), social community clustering Chen \BOthers. (\APACyear2014), and gene expression profiles clustering McWilliams \BBA Montana (\APACyear2014). In this paper, we consider the SSC concept in investigating chronic motion disability of multi-variate kinematic data. In our analytical framework, with the self-expressive assumption, the SSC result (sparse subspace matrix) is obtained by optimizing the objective function using the ADMM algorithm and used as discriminative features for CAI pattern classification.
3 Materials and Methods
3.1 Data Acquisition and Processing
In this study, we acquired human motion data in running mode from 47 subjects in total, among which 24 subjects with CAI and 23 control subjects. There are 36 females (18 subjects with CAI and 18 controls) and 11 males (6 subjects with CAI and 5 controls). Subjects were recruited from the Northeastern University community. Screening exams were conducted for all potential subjects to determine their suitability for the study based on the inclusion and exclusion criteria. We followed the selection standards endorsed by the International Ankle Consortium to set the inclusion criteria for CAI Gribble \BOthers. (\APACyear2014). All subjects with CAI had a significant ankle sprain at least one year before being enrolled in this study. In addition, they scored 24 or lower in CAIT and had more than one episode of ankle giving ways in the past 6 months.
All qualified subjects were asked to participate in a single session, in which they were asked to run on a treadmill for one minute. Their running performances and behaviors were captured by a 7-camera 3D motion capture system (Qualisys AB, Sweden). Reflective markers were placed on the major bony landmarks of the pelvis and left and right legs in order to create biomechanical models of bilateral hip, knee, and ankle. The recorded marker data were then analyzed using Visual3D (C-Motion, MD). The resultant variables are bilateral hip, knee, and ankle position changes in each of the sagittal, frontal, and transverse planes over time. Limited by the number of subjects, we consider running cycles (unit cycle) in each subject, instead of individual subjects, as samples. Each joint trajectory was segmented into cycles (defined as the interval between two consecutive initial contacts of the same foot) for further analysis. On average, each session contains around forty cycles varied by subject’s running posture. Figure 1 shows the kinematic data collected from the motion capture system for both CAI and control subjects. For each joint, there are three motion time series in x-axis, y-axis, and z-axis presenting sagittal, frontal, and transverse planes. There are eighteen channels in each motion dataset. We define channels 1-3 representing the sagittal, frontal, and transverse planes on right hip, channels 4-6 representing the sagittal, frontal, and transverse planes for left hip, and so on. Since the total number of time points in running cycles varies in different subjects, we applied a linear interpolation method to ensure all cycles’ length to 84 time points, which is the longest running cycle in the dataset. Then, motion time series data were detrended and z-scored to avoid variability between subject postures and potential human errors during data collection. As a result, for a subject, there are 40 matrices, denoted by where and .
3.2 Proposed Method
In this section, we describe our proposed analytic framework composed of two phases to recognize the dynamic motion patterns across channels in the multi-joint coordinate system for discriminating and predicting subjects with CAI from a control cohort. Note that in this study, we use a term ‘channel’ to express sagittal, frontal, or transverse plane of a joint. Figure 2 illustrates the concept of our proposed method. In the first phase, we learn and extract significant clustering (or inter-connection) patterns of segmented motion signals. For each running cycle of individual subjects, motion signals undergo the detrending and z-scoring pre-processing steps. Then, a sparse subspace clustering Elhamifar \BBA Vidal (\APACyear2013) is adopted to obtain clustering coefficients (CCs) to represent the dynamical inter-relationships among channels. In the second phase, we consider a linear SVM-based classification model using the estimated CCs as input features for performing a binary classification task between CAI and control cohorts. Both channel-based and network-based modeling strategies (CM1 and CM2) are proposed. First, for a channel, we use its clustering coefficients (i.e., a column vector in the CC matrix) as input features, highlighted in red rectangular in Figure 2, which present the correlation of other channels to it due to the self-expression assumption. Second, we compute a symmetric CC matrix. Then a clustering analysis is further applied to find a more representative network, and use all CCs (in the triangular CC matrix), highlighted in yellow rectangular, as input features. The detailed methodology is presented as follows.
3.2.1 Self-Expressive Sparse Subspace Learning
Let us consider a set of multi-variate motion time series , where = 18, in a multi-joint coordinate system. It is considered as a dynamic system of multiple units where each unit linearly or nonlinearly interact with each other at different connectivity degree and can be presented in certain subspaces , where is the number of subspaces. In this study, it is assumed that each motion time series is self-expressive and presented by a linear combination of some other motion time series and they share the same subspace. In other words, each motion time series can be represented by the basis functions in a subspace shared with other motion time series, which are highly correlated with each other while those in different subspaces have no or weak correlations. Note that the number of subspaces () need not to be pre-determined and will be determined by solving a self-expressive sparse subspace clustering problem.
The mathematical formulation of self-expressive model is presented as follows:
[TABLE]
where Y is the original time series in channels and is a coefficient vector. To find a lower-dimensional subspace shared by original channels, we aim to minimize the coefficient values. The optimization problem for SSC is then formulated as follows:
[TABLE]
After solving this minimization problem, a sparse solution of is obtained, where the non-zero elements express the coefficients of channel to channel . Furthermore, we extend to solve the whole system by introducing a coefficient matrix C = . The extended optimization problem can be written as
[TABLE]
Since noises usually appear in real data, we assume where represents the real signals and is independent and identically distributed noise with zero mean. We rewrite the model in Equation (3) as follows:
[TABLE]
In the objective function, we still minimize the coefficients and add a penalty term to minimize the fitting errors, where is the penalty parameter, subject to the same constraints. By solving the model Equation (4), we reveal corresponding subspaces and obtain a sparse representation of CC matrix C, in which the nonzero elements represent the channels from the same subspace. Ideally, if both coefficients and are nonzero, then we consider channel and share the same subspace.
In real-life cases, however, inter-dependency and/or interactions are not usually linear explicitly. We herein introduce a kernel trick, a commonly used technique in pattern recognition, to map the original motion time series in a non-linear dimensional space, where subspace can be linearly expressed. Then, we can rewrite the optimization problem for kernel sparse subspace clustering (KSSC) as follows Patel \BBA Vidal (\APACyear2014):
[TABLE]
where is a mapping function and . A positive semi-definite kernel matrix is defined as and Gaussian kernel is used in our study. It is noted that the inter-dependency/interactions are still linear in the mapped subspace.
3.2.2 ADMM Solution Approach
To solve both SSC and KSSC optimization problems, we adopt a well-developed alternating direction method of multipliers (ADMM) algorithm Boyd \BOthers. (\APACyear2011). We present the solution process for KSSC optimization problem and it can be done for SSC optimization problem in a similar way. We first rewrite the model in Equation (5) by introducing an auxiliary matrix as follows Elhamifar \BBA Vidal (\APACyear2013):
[TABLE]
Then, we add two penalty terms to make the problem convex in terms of variables C and A, where is the trade-off parameter between the two terms. The convex optimization problem is expressed as follows:
[TABLE]
Note that the models in Equation (LABEL:aff) and Equation (LABEL:aff2) share the same solution since all feasible solutions that satisfy all the constraints ensure two penalty terms equal to zero. Furthermore, we derive a new formulation for KSSC optimization problem by applying Lagrange multipliers a vector \textbf{\delta}\in R^{N} and a matrix \textbf{\Delta}\in R^{N\times N} as follows:
[TABLE]
The ADMM is aimed to blend the decomposability and solving this convex optimization problem by decomposing it into small sub-problems in sequence and solving individual variables one at a time Boyd \BOthers. (\APACyear2011). The solving procedure iterates until C is converged within an acceptable error or a pre-set maximum iteration is reached. For any iteration :
Update by optimizing Equation (8) with respect to A with fixed , and . We obtain by calculating the following.
[TABLE] 2. 2.
Update by optimizing Equation (8) with respect to C with updated and fixed and . We obtain
[TABLE]
Note that is the shrinkage-thresholding operator applied on each element in matrix and is defined as . Operator returns its argument if non-negative, and returns zero otherwise. 3. 3.
Update by optimizing Equation (8) with respect to E with updated and , and fixed and .
[TABLE] 4. 4.
Updating and with updated .
[TABLE]
[TABLE]
The pseudo-code is summarized in Algorithm 1.
For optimizing hyper-parameters and , we used grid search method to find the best settings in terms of the highest cross-validation accuracy: and = 800 for SSC problem, and and for KSSC problem.
3.2.3 Feature Extraction and Representation
For input features used in a pattern classification task in the next section, we propose two strategies such that motion pattern is recognized and distinguished in the learnt subspace. First, to examine the discriminative power, we consider individual channel-based vector in the clustering coefficient matrix (see CM1 in Figure 2). Each channel-based vector contains a set of non-zero coefficients of other channels correlating to channel in the same subspace. Channel-based vectors from all cycles of all subjects are aggregated to form an input feature set. There are sets of input features to be performed in the classification task separately.
Second, we calculate an affinity symmetric matrix to present the network (or connectivity) pattern across all channels (see CM2 in Figure 2). We employ a state-of-the-art clustering technique, called density-based spatial clustering of applications with noise (DBSCAN) Ester \BOthers. (\APACyear1996), to find clustering subspaces , in which each cluster is formed by channels that interact with each other closely and dynamically; that is, connectivities in between are strong. As DBSCAN is aimed to discover clusters of input channels, two parameters are required to set up: is a distance threshold to determine the closeness for the neighborhood of a given sample and is a threshold to limit the number of neighbors to form a cluster. In our study, we consider the Euclidean distance in the feature space. To partition all channels into two parts for all subjects, we set the threshold equal to the median distance since we like to include as many channels as possible in the neighborhood and = 4 for which we generate consistent clustering result across all subjects. DBSCAN starts with an arbitrary channel and includes all other channels that are directly density-reachable as its neighborhood is satisfied with the two parameter thresholds. This process iterates until no new channels being added to this neighborhood. Note that this is computationally fast since there are only 18 channels for clustering. As a result, two major clusters () are formed. We then perform sparse subspace learning, described in Section 3.2.1, on those channels included in individual clusters and , respectively. For each cluster, we obtain an updated clustering matrix (see FR2 in Figure 2), where the elements in the upper-triangle part in are referred to as network coefficients in the new subspace.
3.2.4 Pattern Classification
With the learned sparse clustering matrices and in Section 3.2.2, we propose to train a SVM-based classification model for performing binary classification tasks between CAI and control cohorts. The objective is to validate the effectiveness of the learned motion patterns for distinguishing CAI subjects from control subjects. SVM is considered as one of robust supervised learning methods in various applied problems of high-dimensional data Hearst \BOthers. (\APACyear1998). The idea of SVM is to find a hyperplane with a maximum margin that allows to separate the samples in two classes (i.e., CAI versus control). Let us have training data as samples presented by feature set and binary class associated with the samples. A hyperplane is defined as
[TABLE]
where is a weight vector and is a scalar. A sample is decided to belong to class if ; otherwise, it belongs to class if . To find an optimal hyperplane , one can solve a constrained optimization problem to optimize the in-between margin, shown as follows:
[TABLE]
We solve this classification problem using a widely used library LIBSVM Chang \BBA Lin (\APACyear2011) with the default settings. For the sake of interpretation, a linear kernel is used throughout the computational experiments in this study. The optimized weights tell the features importance; the larger positive or negative value is, the more discriminative the feature is. See the resultant example in Figure 7. The detail will be discussed in the latter section.
4 Computational Results
In this section, for CAI dynamic motion pattern recognition, we perform our proposed subspace clustering (SSC and KSSC) methods compared to traditional methods including statistic features and Pearson correlation, and PCA. We train a linear SVM model with these extracted features for classification in a leave-one-subject-out cross validation schame; each subject of 40 sample cycles is left for testing iteratively. We also examine the goodness-of-fit of our methods, and obtain R-squared equal to 0.98 on avereage for all subjects. In Figure 3, we illustrate the fitted curves (in red) give a good approximation to the original running curves (in blue) for one cycle of all channels in Subject 1.
4.1 Results for Channel-based Subspace
To examine the discriminative power of learnt motion patterns in different subspaces, we perform a classification task for individual learnt subspaces using channel-based vectors in the matrix . We evaluate using two measures: (1) hit rate HR = for individual subjects, where true positive () is the number of correctly classified sample cycles (matching true class) and is the total number of sample cycles, and (2) majority voting accuracy (). Subject is correctly classified as the percentage of correctly classified sample cycles exceeds 50%. The results based on the two evaluation metrics are presented in Figures 4 and 5, respectively. In Figure 4, for both SSC and KSSC features, we see that the only subspace based on Channel 1 (sagittal plane of right hip) results in 70% testing hit rate, followed by the other subspace based on Channel 18 (transverse plane of left ankle). The rest of subspaces based on other channels results in testing hit rate. In Figure 5, with SSC features, there is the only one subspace based on Channel 1 (sagittal plane of right hip) resulting in testing . The subspaces based on Channels 3, 11, 13, 14, and 15 result in testing , and there exists a relatively larger gap between training and testing accuracies. With KSSC features, there is the only one subspace based on Channel 1 (sagittal plane of right hip) resulting in testing and the rest of subspaces based on other channels results in testing .
It is interesting that most CAI subjects can be distinguished from the hip position (Channel 1) rather than ankle position (Channels 13-18). The motion abnormality on knees along has little influence or impact on the running patterns of subjects. It is noted that there a big gap (around 20-40%) between testing and training accuracies except Channel 1, but is commonly seen in cross-subject validation due to high variation between samples in different subjects. Limited by the number of subjects, it is difficult to eliminate the bias between subjects with different body conditions, such as BMI, gender, dominant and affected leg. It may be nothing related to over-fitting in the classification model building.
In Figure 6, we illustrate the classification results of both SSC and KSSC methods for all subjects are reported based on the best subspace of Channel 1 (sagittal plane of right hip). Correct prediction is colored in blue, wrong prediction is colored in white, and no data entry is colored in black. Subjects with CAI are framed in red and control subjects are framed in green. As observed, with SSC features, we classified running cycles correctly for most subjects except CAI Subjects 2, 4, 6, 9, 10, 11, and 16, and control Subjects 28, 29, 43, 44, and 45. With KSSC features, we classified running cycles correctly for most subjects except for CAI Subjects 2, 6, 9, 16, and 19, and control Subjects 9, 35, 43, and 44.
We further compare classification performance of our proposed SSC and KSSC methods to other statistical and PCA based extraction methods. We compute statistic features: mean and variance of motion time series. Pearson correlation is calculated to measure the linear relationships for all pairs of two motion time series and a correlation matrix is formed. PCA, a classical dimensionality reduction technique, is used to map a set of motion time series into a set of uncorrelated time series (called principal components) in a new space. We consider the top three principal components that account for 99.9 percent of variance in the original data space. Then we calculate the mean and variance of principal components to form a feature vector. Table 1 presents all experimental results and shows that our proposed SSC and KSSC methods outperforms other methods by .
Our experiments also show that KSSC method outperforms SSC method. It means motion patterns are more discriminative on in a higher dimensional (non-linear) space, but not the original space. However, direct inter-dependency between joints /channels in motion patterns becomes not easily interpretable since all channels are tangle with each other.
4.2 Results for Network-based Subspace
Consider the multi-joint coordinate system as a network of 18 nodes (representing channels). We use the network coefficients , computed in Section 3.2.3, to describe the connectivity pattern among channels. We preform DBSCAN on every cycle for all subjects and obtained resulting clusters for every cycle. Then, to obtain the aggregating cluster information for all subjects, we consider two channels in the same subspace only if more than 50% of subjects cluster these two channels into the same subspace. It turns out that two representative SSC-based clusters are formed in our experiment. Cluster I, resulting in higher classification accuracy, includes Channels 1, 2, 5, 7, 10, 11, 16, 17 and 18, for which Channels 1, 2 and 6 locate at hip joints, Channels 7, 10 and 11 locate at knee joints, and Channels 16, 17 and 18 locate at left ankle joint. The remaining channels are included in Cluster II. We then solve the SSC problem to obtain new network coefficients for Clusters I and II, which are used for pattern classification task. Note that we do not consider KSSC method because of burdensome interpretation of inter-dependency among channels.
Table 2 presents the testing results (hit rate and majority voting accuracy) for network-based subspace. We compare our method to other clustering methods based on human physical joints (hips, knees, and ankles) and Pearson correlation, as well as the baseline (including all channels in a cluster). Using the same DBSCAN setting, two Pearson correlation-based clusters are found. For all resultant clusters, we employ the SSC method for the channels in the clusters to obtain network coefficients . The subspace (Cluster I) learnt by our SSC method results in the highest accuracy (68%). Between two correlation-based clusters, Cluster II results in a higher accuracy (65%) and includes Channels 3, 4, 5, 6, 8, 11, 13, 15 and 16, for which most channels locate on hips and ankles. If we subjectively select the joints for testing, we find that hip joint is the best cluster compared to knees and ankles.
In addition, it is noticed that the testing accuracy (68.09%) from network-based algorithm is lower than one (74.47%) from channel-based algorithm. It might be because of some information loss from considering the only channels in the sub-cluster. However, for network-based subspace learning, we intended to observe the inter-dependency between channels in the representative subspace. In Figure 7, we present the weights of SVM model using all subjects as training dataset for both correlation-based Cluster II and SSC-based Cluster I. As known, in SVM, features with higher weights (regardless of sign) are more important for classification. Inter-dependency corresponding to higher weights are found between channels in hip and ankle joints. In summary, recognizing motion (running) abnormality for CAI may be diagnosed with hip joints rather than ankle joints. It is suspected that subjects with CAI in our study may use muscles around hips to adjust their posture to protect their ankle from sprains.
5 Conclusion
In this work, we studied a physical injury problem caused by ankle sprains, which develops a long-term and chronic instability in human daily movements. While most of recent studies were focused on the application of traditional statistic methods to biomechanical gait data, in this research we proposed a new analytic method to quantitatively learn the functional subspace that represents the original multivariate gait motion data acquired from the 3D motion capture system. We expected that there are joint effects as a network system, i.e., a multi-joint coordinate system of bilateral ankles, knees, and hips. Our proposed sparse subspace clustering algorithms attempted to learn significant subspaces in a lower dimensional space such that original motion dynamics can be easily characterized. A SVM classification model was trained with the extracted network-based features and validated for subjects with CAI compared to control subjects. We obtained higher classification performances by 10% compared to the statistic-based features (that resulted in 55-60% accuracy). Moreover, we found that there indeed exist joint effects among these channels from our analysis. The motion instability caused by ankle sprains are found significantly on the hip joints in the studied subjects. Subjects with CAI might use hip to stabilize and balance during movement. We have shown a potential that this proposed model can be applied to support the decisions in diagnosis, treatment, and rehabilitation. In this study, we extracted features from individual cycles, which may contain only local segment information. For future work, we will investigate global motion behaviors by extracting dynamic features from multiple running cycles in a sequence in the experiment, which may improve the pattern classification performance. We will further investigate abnormal detection problem - when motion behaviors begin to deteriorate during movement in individual subjects with CAI. Further, a dynamic modeling approach is suggested to obtain a functional clustering tensor. Given that our current study does not consider the geometry structure of multi-varate gait data, incorporating geometry information may improve the understanding of the motion patterns in the human body system.
Acknowledgement
This work is supported by Northeastern TIER 1 Seed Grant Program.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Anandacoomarasamy \BBA Barnsley ( \APA Cyear 2005) \APA Cinsertmetastar anandacoomarasamy 2005 long {APA Crefauthors} Anandacoomarasamy, A. \BCBT \BBA Barnsley, L. \APA Cref Year Month Day 2005. \BBOQ \APA Crefatitle Long term outcomes of inversion ankle injuries Long term outcomes of inversion ankle injuries. \BBCQ \APA Cjournal Vol Num Pages British Journal of Sports Medicine 393e 14–e 14. \Print Back Refs \Current Bib
- 2Andrade \B Others . ( \APA Cyear 2013) \APA Cinsertmetastar Andrade 2013 {APA Crefauthors} Andrade, A., Polese, J \BPBI C., Paolucci, L., Menzel, H \BHBI J. \BCBL \BBA Teixeira-Salmela, L. \APA Cref Year Month Day 201310. \BBOQ \APA Crefatitle Functional Data Analyses for the Assessment of Joint Power Profiles During Gait of Stroke Subjects Functional data analyses for the assessment of joint power profiles during gait of stroke subjects. \BBCQ \APA Cjournal Vol Num Pages Journal of appli
- 3Bahadori \B Others . ( \APA Cyear 2015) \APA Cinsertmetastar bahadori 2015 functional {APA Crefauthors} Bahadori, M \BPBI T., Kale, D., Fan, Y. \BCBL \BBA Liu, Y. \APA Cref Year Month Day 2015. \BBOQ \APA Crefatitle Functional subspace clustering with application to time series Functional subspace clustering with application to time series. \BBCQ \B In \APA Crefbtitle International Conference on Machine Learning International conference on machine learning ( \BPGS 228–237). \Print Bac
- 4Bosien \B Others . ( \APA Cyear 1955) \APA Cinsertmetastar bosien 1955 residual {APA Crefauthors} Bosien, W \BPBI R., Staples, O \BPBI S. \BCBL \BBA Russell, S \BPBI W. \APA Cref Year Month Day 1955. \BBOQ \APA Crefatitle Residual disability following acute ankle sprains Residual disability following acute ankle sprains. \BBCQ \APA Cjournal Vol Num Pages JBJS 3761237–1243. \Print Back Refs \Current Bib
- 5Boyd \B Others . ( \APA Cyear 2011) \APA Cinsertmetastar boyd 2011 distributed {APA Crefauthors} Boyd, S., Parikh, N., Chu, E., Peleato, B. \BCBL \BBA Eckstein, J. \APA Cref Year Month Day 2011. \BBOQ \APA Crefatitle Distributed optimization and statistical learning via the alternating direction method of multipliers Distributed optimization and statistical learning via the alternating direction method of multipliers. \BBCQ \APA Cjournal Vol Num Pages Foundations and Trends® in Machine lea
- 6Chang \BBA Lin ( \APA Cyear 2011) \APA Cinsertmetastar chang 2011 libsvm {APA Crefauthors} Chang, C \BHBI C. \BCBT \BBA Lin, C \BHBI J. \APA Cref Year Month Day 2011. \BBOQ \APA Crefatitle LIBSVM: a library for support vector machines Libsvm: a library for support vector machines. \BBCQ \APA Cjournal Vol Num Pages ACM Transactions on Intelligent Systems and Technology 2327. \Print Back Refs \Current Bib
- 7Chen \B Others . ( \APA Cyear 2014) \APA Cinsertmetastar Chen 2014 {APA Crefauthors} Chen, Y., Jalali, A., Sanghavi, S. \BCBL \BBA Xu, H. \APA Cref Year Month Day 2014 \APA Cmonth 01. \BBOQ \APA Crefatitle Clustering Partially Observed Graphs via Convex Optimization Clustering partially observed graphs via convex optimization. \BBCQ \APA Cjournal Vol Num Pages J. Mach. Learn. Res.1512213–2238. \Print Back Refs \Current Bib
- 8Chinn \B Others . ( \APA Cyear 2013) \APA Cinsertmetastar chinn 2013 ankle {APA Crefauthors} Chinn, L., Dicharry, J. \BCBL \BBA Hertel, J. \APA Cref Year Month Day 2013. \BBOQ \APA Crefatitle Ankle kinematics of individuals with chronic ankle instability while walking and jogging on a treadmill in shoes Ankle kinematics of individuals with chronic ankle instability while walking and jogging on a treadmill in shoes. \BBCQ \APA Cjournal Vol Num Pages Physical Therapy in Sport 144232–239. \Pr
