General Movements Assessment and Amiel-Tison Neurologic Examination in Neonates and Infants: Correlations and Prognostic Values Regarding Neuromotor Outcomes
Adrian Ioan Toma, Vlad Dima, Lidia Rusu, Andreea Necula, Roxana Pavalache Stoiciu, Larisa Andrășoaie, Andrada Mirea, Anca Roxana Bivoleanu

TL;DR
This study explores how two neurological exams predict cerebral palsy and motor delays in infants, finding that combining them improves prediction accuracy.
Contribution
The study demonstrates that combining General Movements Assessment and Amiel-Tison exams improves predictive value for cerebral palsy and motor delays in infants.
Findings
Absence of fidgety movements at 12 weeks CA is the strongest predictor for CP and motor delays.
Combining GM and AT exam results improves prediction of CP risk and delayed sitting in preterm infants.
Logistic regression models using combined exam findings reached statistical significance for CP and motor outcomes.
Abstract
Background: Our study aimed to investigate whether the general movements assessment (GMA) and Amiel-Tison Neurologic Examination performed at term-equivalent age (TEA) and 12 weeks corrected age (CA) could predict the occurrence of cerebral palsy (CP) and delayed gross motor milestones in a sample of term and preterm infants and whether the predictive values could be increased by using the two examinations during the same visit. Methods: A total of 70 infants (62 preterm and 8 at term) were examined at TEA and 12 weeks CA using GMs (General Movements) and Amiel-Tison (AT) examinations. We determined the correlation between the results of the examinations and several selected items (scarf sign, popliteal angle, and axial tone) and neuromotor outcomes (presence of cerebral palsy (CP), independent sitting, and independent walking). We attempted to build binary logistic regression models…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfant Development and Preterm Care · Neonatal and fetal brain pathology · Cerebral Palsy and Movement Disorders
1. Introduction
Timely identification and referral for early intervention are an essential part of the management of NICU (Neonatal Intensive Care Unit) graduates at risk of cerebral palsy or motor impairment [1]. Even though an accurate diagnosis of cerebral palsy can usually be established around 18–24 months corrected age [2,3], studies have shown that early identification of infants at risk is possible using special clinical examination techniques [4].
Neurologic examinations of neonates and young infants can be performed using different tools and techniques [5], based on different concepts that will be briefly reviewed below.
Firstly, neurologic exams can be performed classically, as a formal neurologic examination similar to the neurologic examination of older children and adults but considering the maturational features of neonates and young infants, as suggested by Volpe [6]. They are based on the classic concepts of Sherrington regarding the reflex arc, excitation, and response—a normal response will imply the integrity of the pathway, while an abnormal or absent response will suggest an abnormality [7].
There are other examination techniques using the same theoretical basis of response to an external stimulus but specifically developed for neonates and infants using tone as the main feature: the Hammersmith Neonatal Neurologic Examination [8] and the updated Amiel-Tison Neurological Assessment at Term (ATNAT) [9,10]. The interpretation of these examinations is based on the optimality concept [11,12], i.e., it is easier to predict a normal outcome following a normal examination. To ensure consistent results, in the case of preterm neonates, examinations are performed at 40 weeks corrected age (Term Equivalent Age—TEA) [8,9,10]. The physiological basis of the Amiel-Tison neurologic examination is the concept of two control systems for motor function. The subcortical system, based in the cerebellum and brainstem, matures in a caudal-to-cranial manner and controls the flexor tone and anti-gravity maintenance of posture. The cortical (pyramidal) system, based in the hemispheres and basal ganglia, matures in a descending cranial-to-caudal pattern and is responsible for the progressive acquisition of motor control, such as head control, sitting, and walking, during the first year of life [9,10,12]. At 40 weeks corrected age (term-equivalent age), the superior system begins to take power (prise de pouvoir—in French) from the inferior subcortical system, at the level of the flexor and extensors of the neck. This allows the active tone—motricité dirigéé—to be tested by placing the body in a certain position and expecting a certain response in the tone. The pull to sit and reverse manoeuvres elicit contractions of the flexors and extensors of the neck, obtaining a balance that results in normal situations in the maintenance of the head control for a few seconds [12,13]. The exam includes items structured in several domains: cranial exam, neurosensory (ocular and hearing), spontaneous motor activity, passive tone (upper limbs, lower limbs and trunk)—the angle between two segments (popliteal angle), the displacement with a controlled force from a certain point (scarf sign) or recoil (arms or legs), active tone (see above), archaic reflexes, palate and tongue, toleration to manipulation, and feeding autonomy. There are also noted special circumstances for the exam and the medical status of the patient [10,12]. Each item receives a certain score; there is no final score, though. Based on the predominance of normal or abnormal signs, the exam is scored as normal or abnormal (moderately or severely abnormal) [10,12]. This examination technique has the advantage of having versions that can be used in infants and toddlers and has the same principles [13]. The performance of the exam does not require formal training, although it has specific administration guidelines [5,9,10] and has been noted to have a good inter-observer reliability for most of the items [14]. Regarding the predictive value for the incidence of CP and motor dysfunction, a study performed on term neonates showed that all infants with optimal ATNA (Amiel-Tison Neurological Assessment) scores had normal neurological examinations on follow-up. None of the neonates with a severely abnormal examination had a normal neurological exam at follow-up; thus, the agreement between ATNA and the exam at follow-up was excellent (K = 0.83) [15]. In the case of very preterm infants, the combination both results from ATNAT and head ultrasound had the best predictive value. When using the neurologic exam alone, the probability of a normal outcome at 12 months was 94% [16].
The general movements assessment (GMA) proposed by Precthl uses a different paradigm [17]. This technique consists in the evaluation of spontaneous movements in the neonates and small infants, by the use of observation and videotaping [17,18]. The physiological basis of the method is that the normal general movements are spontaneous movements of the whole body, generated by a group of subcortical neurons (central pattern generators, CPGs) [19] and modulated by the cortical structures [20,21]. The lack of modulation by the cortical structures due to different lesions and pathologies results in an abnormal GM pattern [17,18,19,20,21]. The movements involve the entire body, with a variable sequence of movements of the head, neck, trunk, arms, and legs. Those movements are defined by a gradual beginning and end and variable amplitude, intensity, and speed [17]. Certain types of GMs are observed at different postnatal ages. In the neonatal period, until the age of 6–9 weeks, the normal movement pattern, called writhing movements, consists of movements involving the whole body, waxing and waning in intensity, force and speed, with fluent and elegant rotations of the limbs and trunk that create an impression of variability [17]. The abnormal movements in the neonatal period are either poor repertoire movements (monotonous movements that do not occur in a complex way) or cramped–synchronized movements (the limbs and trunk muscles contract and relax almost simultaneously) [17]. At 6–9 weeks, the normal pattern of movements changes to fidgety movements (movements of small amplitude, moderate speed, and variable in acceleration, probably representing fine-tuning of the proprioceptive system that occurs in the limbs, trunk and neck) [17]. The absence of fidgety movements is considered an abnormal pattern [17]. The GM disappear around 20 weeks [17,18] post-term when the voluntary movements appear; thus, the GMA examination could be used until approximately 20 weeks corrected age. Performing and interpreting the exam requires formal training under the tutorship of GM Trust. Since the introduction of the examination technique, it has been observed that GM is a good predictor for the occurrence of CP [17], and is considered the most valuable test in the first 3 months of life [2]. In a cohort of preterm neonates, it has been shown that the observation of the GM pattern was 100% sensitive for the identification of the risk of CP, and a cramped–synchronized GM pattern had a sensitivity of 92.5% and a specificity of 100% when it came to identifying the risk of CP [22]. A study performed on 130 patients showed that a persistent CS pattern and the absence of fidgety movements were strong predictors for the appearance of CP [23]. Another later study on a large cohort of infants from different continents showed that 95% of the children who later developed CP did not show fidgety movements [24].
Two studies compared the concordance of the two methods (GMA and Amiel-Tison Examination) regarding the prediction of the neurologic and developmental outcomes. A study performed on 45 preterm infants, followed for 12–15 months corrected age, showed the agreement between the two methods was excellent in the case of the examinations performed at term (k = 0.87) and good at in the case of the exams performed at 3 months (k = 0.54); the sensitivity was 92–100% for the Amiel-Tison exam and 96–100% for the GMA at term and 3 months [25]. In another study involving 103 very-low-birthweight infants regarding the outcome at 20 months, the GM exam performed at TEA had a positive predictive value of 89% and a negative predictive value of 84% and the Amiel-Tison exam had a positive predictive value of 33% and a negative predictive value of 88% for the identification of the outcomes [26].
The idea to combine the findings of two neurologic examinations in the same patients in order to increase the accuracy and the predictive value was found to be successful. An Italian group combined an exam based on classical neurologic examination principles focused on tone (HINE—The Hammersmith Infant Neurological Examination) with GMA, and the result has been an improvement in the early prediction of the neurodevelopmental outcome [27]. To date, no studies have assessed the combined findings of the ATNAT [9,12] or Amiel-Tison Neurologic Examination of the Infant and Children [13] and GMA [17,18] in the case of the follow-up of the NICU graduates. Thus, the research objectives of our group were as follows:
- -To identify the correlations between the normal and abnormal results of the GMA and Amiel-Tison neurologic examinations performed in a population of NICU graduates at 40 weeks corrected age (Term Equivalent Age—TEA) and 12 weeks corrected age.
- -To identify the prognostic value of the two examinations and the different items used for detecting the risk of cerebral palsy or the risk of gross motor delay (independent sitting and independent walking).
- -To verify by, using logistic regression models, if the combination of results from the two examinations at term-equivalent age and 12 weeks corrected age results in a better prognostic value than using the result of each examination separately.
We aimed to test these hypotheses in a sample of term and preterm NICU graduates and in a subgroup of former preterm neonates from the whole population sample.
2. Materials and Methods
2.1. Population Studied
The study was performed within the follow-up program of Medlife Medical Park, Bucharest. Approval was obtained according to the regulations from the Ethics Committee of MedLife Medical System (Decision No 1097/15 March 2023 renewed with decision No 11/19 November 2025). Informed consent was obtained from the parents of the patients.
The sample of patients studied was represented by 70 neonates—62 preterm and 8 term neonates with different conditions (see the Results Section), who were consecutively admitted to our program at their visit at TEA and fulfilled the criteria of being examined at TEA, 12 weeks, 7, 12, 18 and 24 months CA. In the case of the term neonates, all the patients fulfilled the 5 mandatory visits. In the case of preterm infants, 62/69 fulfilled the 5 visits and were considered for the final analysis. Seven patients were lost to follow-up—six fulfilled the TEA and 3-month visit, and five out of seven also came back for the 7-month visit. At the visit at 12 months corrected age, 62/69 came for the visit—after 12 months, no other patient has been lost to follow-up (consort chart, Figure 1). The patients were born in different units in Bucharest and in the country and were referred to the follow-up program of our clinic by their attending physicians. We attempted to enroll all 77 infants that presented to our program within the study period, but we are not aware of the number of infants fulfilling the inclusion criteria in all the units that sent patients during that time interval, or of the number of infants sent by the attending physicians to be enrolled in our program during the same time period. The initial non-enrolment could represent an additional source of selection bias. The inclusion criteria were as follows:
- -The patient belongs to a category of neonates at risk that needed to be included in the follow-up program for high-risk neonates, according to the guidelines of Romanian Neonatology Association [28], consistent with the international guidelines in the field [29].
- -The informed consent of the family has been obtained.
- -The patient was followed until 24 months corrected age or until a diagnosis of cerebral palsy (CP) was established—whichever occurred sooner.
The recruitment for the study started in June 2023 and ended in October 2023. The patients were followed until October 2025, when the last patient attained the 2-year CA milestone.
2.2. Neurologic Examinations
The protocol of the neurologic examination can be followed in Table 1.
At TEA and 12 weeks corrected age, we performed GMA assessment [17,18] and Amiel-Tison examinations corresponding to the age of the infant (ATNAT at TEA [9] or Amiel-Tison Neurologic Examination for the infant and child aged 0–6 years at 12 weeks CA) [13]. In the case of the subsequent visits (7,12,18, and 24 months CA), an Amiel-Tison examination with the appropriate items for the age was performed [13]. To avoid an interpretation bias, the GM and Amiel-Tison examinations were performed by different persons. The person performing the GM examinations (ARB) received appropriate training from the GM Trust, while the person performing AT examinations received training in Port Royal Hospital Paris and had 20 years of experience in this examination (AIT). Notably, he was also trained in GMA.
In the case of the General Movements Assessment (GMA) at TEA, the movement pattern was noted, according to the classic guidelines:
- -Normal (writhing pattern): movements of small-to-moderate amplitude, slow-to-moderate speed, involving in a variable sequence the head, neck, trunk, arms, and legs, with an elliptical form [17,18];
- -Poor repertoire pattern (PR): movements that are monotonous in character and do not occur in a complex way [17,18];
- -Cramped–synchronized pattern (CS): all limbs and muscles contract and relax almost simultaneously; the movements lack the normal smooth and fluent character and are rigid [17,18].
In the case of the GMA exam performed at 12 weeks CA, the presence or absence of the fidgety movements was noted. Fidgety movements, the normal GMA pattern at that age, consist of movements of small amplitude, at moderate speed and variable acceleration of the neck, trunk, and limbs in all directions continually [17,18].
In the case of ATNAT and the Amiel-Tison neurologic examination of the infant and child, the exam has been scored as optimal and non-optimal based on the guidelines provided by the authors. An optimal result is represented by the absence of the neurologic signs, while a non-optimal result consists of the presence of neurologic signs (abnormalities of tone and excitability noted with 1 or 2 in the examination chart) [9,12,13]. The scores for certain individual items that were considered important were noted separately to be investigated for prognostic values (also graded as normal and abnormal according to the examination charts) [9,12,13]. Three items were selected as representative for the passive tone of the upper limbs (scarf sign), passive tone of the lower limbs (popliteal angle), and active tone (pull-to-sit and reverse maneuver) [12,13] (Table 2).
Regarding the actual procedure for scheduling the visits, all the appointments were organized according to the calendar for follow-up—see above (Table 1). The following visit was programmed at the end of each consult. The patients could reschedule the visit themselves, by phone call, within a two-week interval from the programmed visit. If they did not come to the programmed visit, we tried to reschedule by a phone call, and if they did not answer or did not come to the visit, they were considered lost to follow-up.
2.3. Outcome Measures
The outcome measures of the study were represented by the occurrence of independent sitting, independent walking and a diagnosis of cerebral palsy. In the case of the independent sitting, we noted both the presence of the gross motor acquisition and the age of occurrence/defined as normal according to the examination guidelines of Amiel-Tison and Gosselin [13].
Independent sitting has been defined as the capacity to sit without support for at least 15 s [13]. The optimal age of acquisition was considered to be before 9 months CA; if independent sitting occurs after that age, it is considered delayed [13].
Independent walking has been defined as the capacity to take a minimum of 5 unaided steps [30,31]. The optimal age of acquisition was considered before 18 months CA. If independent walking occurs after that age, it is considered delayed [13].
The ages of attainment of the motor milestones, including sitting and walking, were not based on parental recall, but on whether the milestone was present or absent at scheduled visits. Classification as “delayed sitting” (beyond 9 months CA) and “delayed walking” (beyond 18 months CA) was operationalized based on absence at the nearest scheduled visits (7/12 months for sitting, 12/18/24 months for walking).
Cerebral palsy is diagnosed according to the 2006 definition as a permanent disorder of movement and posture causing activity limitation that is attributed to non-progressive disturbances occurring in a developing brain [32]. The diagnosis is established by the neurologist as soon as possible, but usually not before 12 months of age. CP cases are classified according to the European CP initiative as spastic, dystonic, or ataxic, with unilateral or bilateral involvement, and with predominance in the upper or lower limbs, or all four limbs equally involved [33]. The GMFCS (Gross Motor Function Classification System) grade is also established as soon as possible, usually at the age of 2 years CA [2].
2.4. Statistical Analysis
In order to avoid interpretation bias [34], the statistical analysis was performed by a person from another center, with no medical background, who had not been part of the data collection team (LR). The statistician received the anonymized database together with the research questions.
The data were centralized in a SPSS 18.0 database and processed with the statistical functions for which they are suitable, at a significance threshold of 95%.
The variables investigated (independent and outcome variables) were of a binary type (usually having two possible values—normal and abnormal). In order to assess the association between the results of the examinations and the risk of CP, a delayed sitting and delayed/absent walking Chi-Square test was used [35].
The analysis of the prediction was performed for the entire study sample (70 patients) and for the subgroup of preterm neonates (62 patients).
In order to investigate the value of different signs and results of the examination in prediction of the outcome variables, we used the Odds Ratio and the ROC curve—a visual representation of model performance that allowed us to identify the strongest predictors. Collinearity diagnosis was performed for all the variables and found values of VIF (variance inflation factor) <5 for CP, sitting, and walking, showing that the multicollinearity does not influence the variance of the regression coefficients.
In order to predict the probability of the binary outcomes, we attempted to use binary logistic regression models [36]. The stepwise forward selection method was used to build the models. For each statistics model, there was calculated the Odds Ratio (Exp (B)) for each predictor. We considered a model for which the p-value was <0.05 to be statistically significant.
3. Results
3.1. General Characteristics of the Population
As noted above, the study included only the infants who fulfilled all the visits until 2 years of age. Thus, the study cohort comprised 70 infants, of whom 62 were preterm and 8 had other conditions not related to prematurity and were almost all born at term (see below). The gender ratio was 30 male/40 female. Forty-five of the infants were singletons, twenty two were twins (eleven pairs), and there was one set of triplets (3 female neonates). The mean gestational age was 33.9 weeks, with a median of 34 weeks (standard deviation ± 2.835 weeks), minimum gestational age of 27 weeks and a maximum of 40 weeks. The overall incidence of CP was 15/70 in the whole group, with an incidence of 11/62 in the preterm neonates group and 4/8 in the group of neonates born at term.
The distribution by gestational ages of the subgroup of preterm infants can be seen in Figure 2. They are plotted as separate groups: the infants that were followed until 24 months corrected age and the infants lost to follow-up.
There has been a 100% rate of return for the neonates at term and 89.85% (62/69) in the preterm group. In the case of preterm infants lost to follow-up, the baseline characteristics did not differ significantly from the infants that completed the follow-up until 2 years: a mean gestational age of 33 weeks with standard deviation ± 2.94 weeks for the infants lost to follow-up, 32.82 weeks preterm infants that completed the follow-up, gender ratio: three male/four female in the group of patients lost to follow-up (not significantly different from the preterm infants group (24 male/38 female). During the follow-up process, at 12 weeks CA, 5/6 presented with fidgety movements and optimal results for the AT examination and at 7 months, 4/5 exhibited independent sitting and normal AT exam and 1/5 presented with a popliteal angle < 90°—this was reported for the same patient that did not present with fidgety movements at 12 weeks CA.
The non-preterm group was composed of infants with malformations and genetic conditions; the characteristics of the group could be examined in Supplementary Material Table S1. Due to the small size of the group, a statistical analysis was not performed.
3.2. GMA Examination at TEA and 12 Weeks Corrected Age—Analysis of the Whole Group
At the GMA examination performed at TEA, 16 patients presented with a normal GM pattern, 37 with a poor repertoire (PR) pattern, and 17 with a cramped–synchronized (CS) pattern. At the examination performed at 12 weeks CA, in 55/70 patients, the fidgety movements were present, and in 15/70, fidgety movements were absent.
Even if the analysis of the whole group is presented in the following section, as stated above, it should be acknowledged that the whole group lacks homogeneity, and even if the number of non-preterm infants (patients with congenital and genetic conditions) is small, the heterogeneity of the group should result in caution when analyzing and generalizing the results. The decision to analyze the whole group first has been motivated by the fact that we attempted to investigate the correlation of the examinations and the predictive value in a sample consisting not only of preterm infants but also of infants of different gestational ages and pathologies in order to increase the value of the results. Also, this was our initial study protocol, and not analyzing the whole sample could be considered to cause selection bias. Accordingly, we decided to analyze the whole group and to acknowledge the limitations in this case.
There has been a very good correlation between the GM patterns at TEA and 12 weeks CA (Table 3), with all the patients with normal GM patterns at TEA progressing to a pattern with present fidgety movements and 6/17 of the patients with CS and 9/37 patients with PR showing no fidgety movements at 12 weeks CA.
When separately analyzed, the GM patterns at 40 weeks GA did not have a statistically significant association with the occurrence of CP. The absence of the fidgety movements at 12 weeks CA was a statistically significant predictor for the appearance of CP (Table 4).
Regarding delayed sitting (Table 5) and delayed/absent walking (Table 6), a statistically significant association has also been found in the case of absent fidgety movements.
3.3. Amiel-Tison Neurologic Evaluation at TEA and 12 Weeks CA—Whole Group
The ATNAT performed at TEA was considered optimal in 47/70 cases. The result was considered suboptimal in 23/70 cases. The Amiel-Tison neurologic examination at 12 weeks CA had an optimal result in 42/70 cases and was graded as non-optimal in 28/70 infants.
Both the final result (synthesis) of the ATNAT and the values of the items (scarf sign, popliteal angle, axial tone) had a statistically significant association with the occurrence of cerebral palsy and delayed/absent walking (Table 4 and Table 6). In the case of the delayed sitting, an abnormal popliteal angle at TEA has been associated with delayed sitting (Table 5), while the other items of ATNAT did not show any statistically significant association with this delay.
In the case of the Amiel-Tison evaluation performed at 12 weeks CA, we have also identified statistically significant associations between the final result/synthesis and the selected items and the appearance of CP and delayed walking (Table 4 and Table 6). As in the case of ATNAT, an association with delayed sitting has been observed in the case of the the final result (synthesis) of the exam findings (Table 5), in addition, a statistically significant association has been found in the case of abnormal axial tone.
3.4. Correlation Between GMA and Amiel-Tison Examinations—Whole Group
In the case of the examinations performed at TEA, there is a statistically significant correlation between the GM patterns and an abnormal popliteal angle, abnormal axial tone and the result (synthesis) of the AT examination (synthesis) (Table 7).
In the case of the examinations performed at 12 weeks CA in the whole group of patients, there has been a good correlation between the GM and Amiel-Tison examinations, both in the case of the synthesis of the results and of the selected items (Table 8).
3.5. Logistic Regression Models
3.5.1. Logistic Regression Models—Whole Group
In order to investigate which items from the examinations could be statistically significant predictors for the risk of cerebral palsy (CP), we calculated the odds ratios (Supplementary Material Table S2.1) and the ROC curves (Figure 3) for the different GM patterns in the case of GMA, for the selected items and final results (synthesis) in the case of the Amiel-Tison examinations. In the case of the GM patterns at TEA, we compared the patients with a normal pattern with those with an abnormal one (cramped–synchronized and poor repertoire). In the case of the GM examination at 12 weeks CA, we compared the patients with fidgety movements with those in whom the fidgety movements were not present. In the case of Amiel-Tison’s examination at TEA and 12 weeks CA, we compared the patients with normal/optimal findings with the patients with abnormal/non-optimal results.
The ROC curve for the risk of CP of the whole group showed that all the variables had an AUC > 0.6000, with the strongest individual predictor being the GM pattern at 12 weeks CA (AUC = 0.873; CI 95%: 0.748–0.998; p < 0.001), followed by the synthesis of the Amiel-Tison exam at 12 weeks CA (AUC = 0.755; CI 95%: 0.617–0.892; p < 0.003), Amiel-Tison exam at 12 weeks CA—popliteal angle (AUC = 0.733; CI 95%: 0.580–0.887; p < 0.006), and the synthesis of the Amiel-Tison exam at TEA (AUC = 0.715; CI 95%: 0.561–0.869; p < 0.011) (Figure 3).
In order to check if a combination of items from the two examinations resulted in a better prediction for the risk of CP than each examination alone, we attempted to develop binary logistic regression models for the exams at TEA and 12 weeks CA. The models can be found in Appendix A, Table A1—Exams at TEA; and Table A2—Exams at 12 weeks CA. As previously stated, given the sample size and event counts, these multivariable models are exploratory and prone to overfitting. In the case of risk of CP, the predictive models showed statistical significance only for the independent variables GM pattern at 12 weeks (absent fidgety movements) + AT12 weeks scarf sign (abnormal) and AT12 weeks popliteal angle (abnormal) (p < 0.032) (model 3 Table A2—Appendix A).
Similar models of risk were attempted for the delayed acquisitions of gross motor skills (sitting and walking).
In the case of delayed sitting, significant odds ratios were found for the abnormal popliteal angle at TEA and for absent fidgety movements and abnormal axial tone and non-optimal Amiel-Tison examination (synthesis) at 12 weeks CA (Supplementary Material—Table S2.2).
The ROC curve showed the following good predictors of delayed sitting (AUC > 0.600): GM examinations at 12 weeks CA—absent fidgety movements (AUC = 0.804; CI 95%: 0.666–0.943; p < 0.001), synthesis of AT exam at 12 weeks CA (AUC = 0.754; CI 95%: 0.622–0.886; p < 0.001), AT exam at 12 weeks CA—popliteal angle (AUC = 0.709; CI 95%: 0.561–0.858; p < 0.008), synthesis of the AT exam at TEA (AUC = 0.690; CI 95%: 0.541–0.839; p < 0.017), AT exam at TEA—popliteal angle (AUC = 0.663; CI 95%: 0.508–0.819; p < 0.040) (Figure 4).
The binary logistic regression models built for the risk of delayed sitting, both at TEA (Table A3) and 12 weeks CA (Table A4), did not show any predictive value.
In the case of delayed/absent walking, the odds ratios were significant for GM evaluation at 12 weeks CA, abnormal scarf sign at TEA and 12 weeks CA, abnormal popliteal angle at TEA, and 12 weeks CA and abnormal axial tone and synthesis of the Amiel-Tison exam, both at 12 weeks CA (Supplementary Material S2, Table S2.3).
The ROC curve revealed the following factors as good predictors of delayed/absent walking (AUC > 0.600): GM 12 weeks CA—absent fidgety movements (AUC = 0.804; CI 95%: 0.666–0.943; p < 0.001), synthesis of the AT exam at 12 weeks (AUC = 0.754; CI 95%: 0.622–0.886; p < 0.001) and AT exam at 12 weeks CA—popliteal angle (AUC = 0.709; CI 95%: 0.561–0.858; p < 0.008), synthesis of the AT exam at TEA (AUC = 0.690; CI 95%: 0.541–0.839; p < 0.017), AT exam at TEA—popliteal angle (AUC = 0.663; CI 95%: 0.508–0.819; p < 0.040) (Figure 5).
We did not identify any statistically significant binary logistic regression models in the case of delayed/absent walking for the whole group (Table A5—examinations at TEA and Table A6—examinations at 12 weeks CA).
3.5.2. Logistic Regression Models—Subgroup of Preterm Infants
Subsequently, we attempted to build logistic regression models for the risk of CP, delayed sitting, and delayed/absent walking in the subgroup of the 62 preterm infants.
In the case of risk of CP, the odds ratios can be viewed in Table 9. The following statistically significant predictors were found for the GMA—absent fidgety movements at 12 weeks CA (p < 0.001), for the AT exam at TEA—abnormal scarf sign, abnormal popliteal angle, non-optimal result at the synthesis of the exam (p < 0.002). In the case of the AT exam at 12 weeks CA, an abnormal scarf sign and popliteal angle and the abnormal synthesis were found to be statistically significant predictors (p < 0.001).
In the subgroup of preterm neonates, the ROC curve revealed the following variables to be good predictors for CP (AUC > 0.600): the absence of fidgety movements at the GM exam at 12 weeks CA (AUC = 0.925; CI 95%: 0.819–1.031; p < 0.001), the synthesis of the AT exam at 12 weeks CA (non-optimal/abnormal) (AUC = 0.772; CI 95%: 0.620–0.924; p < 0.005), AT exam at 12 weeks CA—popliteal angle (abnormal) (AUC = 0.766; CI 95%: 0.599–0.932; p < 0.006), AT exam at 12 weeks CA—scarf sign (abnormal) (AUC = 0.763; CI 95%: 0.572–0.954; p < 0.007), the synthesis of the AT exam at TEA(non-optimal/abnormal) (AUC = 0.756; CI 95%: 0.589–0.923; p = 0.008), and AT exam at TEA—popliteal angle (abnormal) (AUC = 0.684; CI 95%: 0.495–0.874; p < 0.05) (Figure 6).
The tables of binary regression models for the risk of CP for the subgroup of preterm infants can be consulted in Appendix A (Table A7—examinations at TEA and Table A8—examinations at 12 weeks CA). No statistically significant model has been identified in the case of the examinations at TEA; two models that reached statistical significance have been identified at 12 weeks CA—one of the models included all the parameters investigated: GM exam at 12 weeks CA (absent fidgety), AT exam at 12 weeks CA—scarf sign (abnormal), AT exam at weeks CA 12—popliteal angle (abnormal), AT exam at 12 weeks CA—axial tone (abnormal) and the synthesis of the AT exam at 12 weeks CA (non-optimal) (p = 0.046; model 5). The odds ratio (Exp(B)) = 16.699, CI 95% = 1.851–327.783.
In the case of delayed sitting, the odds ratios can be observed in Table 10. The only two items associated with a statistically significant risk were the GM pattern at 12 weeks CA (p < 0.007) and AT exam synthesis at 12 weeks CA (p < 0.048).
The ROC curve (Figure 7) confirms the following factors as good predictors for delayed sitting (AUC > 0.600): GM exam at 12 weeks CA—absence of fidgety movements (AUC = 0.704; CI 95%: 0.515–0.893; p < 0.035). It showed no significance for the synthesis of the AT exam at 12 weeks (AUC = 0.661; CI 95%: 0.480–0.843; p < 0.095), the AT exam at 12 weeks CA—popliteal angle (AUC = 0.600; CI 95%: 0.407–0.793; p < 0.302), the synthesis of the AT exam at TEA (AUC = 0.645; CI 95%: 0.457–0.834; p < 0.133), and the AT exam at TEA—popliteal angle (AUC = 0.629; CI 95%: 0.435–0.823; p < 0.182).
The tables of binary logistic regression models for the delayed sitting—subgroup of preterm infants, can be consulted in the Appendix A (Table A9—TEA and Table A10—12 weeks CA). No statistically significant model has been identified for the exam at TEA. At 12 weeks CA, two models reached statistical significance, one of the models included, as in the case of the risk of CP, all the investigated items: GM exam at 12 weeks CA (absent fidgety), AT 12 scarf sign(abnormal), AT 12 popliteal angle (abnormal), AT 12 axial tone (abnormal) and AT 12 synthesis (non-optimal) (p = 0.001; model 5), the odds ratio (Exp(B)) = 3.191, IC95 = 1.907–5.341.
The odds ratios for the predictors of independent walking can be viewed in Table 11.
The ROC curve (Figure 8) revealed the following factors to be good predictors of delayed/absent walking (AUC > 0.600): GM pattern at 12 weeks CA (absent fidgety) (AUC = 0.826; CI 95%: 0.678–0.974; p < 0.001), synthesis of the AT exam at 12 weeks CA (AUC = 0.768; CI 95%: 0.624–0.912; p < 0.002) and AT exam at 12 weeks CA—popliteal angle (AUC = 0.728; CI 95%: 0.566–0.890; p < 0.010), the synthesis of the AT exam at TEA (AUC = 0.717; CI 95%: 0.555–0.880; p = 0.014), and the AT exam at TEA—popliteal angle (AUC = 0.667; CI 95%: 0.493–0.841; p < 0.050).
The tables for the logistic regression models for delayed/absent walking can be consulted in Appendix A (Table A11 and Table A12). No model has reached statistical significance.
As in the case of the analysis of the whole group, due to the small sample and the small number of patients with CP, these models are exploratory and prone to overfitting, and more studies on larger samples of patients are needed before we can attempt to generalize the findings.
3.6. Predictive Values and Regression Models—Synthesis and Comparisons
The following tables are a synthesis of the predictive values of the individual items/examinations (Table 12) and the binary logistic regression models (Table 13).
The absence of fidgety movements at 12 weeks CA was the strongest individual predictor for all the outcomes, followed by the synthesis of the Amiel-Tison examinations at TEA and 12 weeks CA (predictor for all the outcomes except delayed sitting in the group of preterm infants). For the risk of CP, an abnormal popliteal angle at 12 weeks CA was a good predictor in the case of the whole group and subgroup of preterm infants, while an abnormal scarf sign was a good predictor for CP in the preterm infants group. Abnormal popliteal angles both at TEA and 12 weeks CA have also been identified as good predictors for delayed/absent walking.
An atypical GM pattern at TEA (PR or CS) was found to be predictive for all the outcomes except delayed sitting in the whole group (Table 13).
The combination of features of the GMA and AT examinations at 12 weeks CA is a predictor for the diagnosis of CP, both in the whole group and the subgroup of preterm infants, and for delayed sitting in the subgroup of preterm infants. The absence of fidgety movements has also been the strongest individual predictor in the logistic regression models (Table 13).
Table 14, Table 15 and Table 16 present a synthesis of the sensitivity, specificity, positive predictive value, negative predictive value, and AUC for certain predictors for the risk of CP, delayed sitting, and delayed/absent walking in the case of the whole group and the subgroup of preterm infants.
4. Discussion
4.1. Interpretation of Main Results
Our research showed that in a population of preterm infants and infants with different congenital conditions, the neurologic examinations performed at TEA and 12 weeks CA had a good predictive value in the early identification of infants at risk of CP and gross motor delay. Combining the results of GMA and AT examinations at 12 weeks CA could improve the early detection of these pathologies.
Even if a couple of the regression models showed statistical significance, due to many limitations—see the section below for more details on the limitations of the study—the results should be regarded with caution, and the reader is advised not to over-interpret the odds ratios—this is one of the reasons why the tables showing the odds ratios have been moved to the Supplementary Materials Section.
A good individual predictor for all the outcomes followed has been the absence of the fidgety movements at 12 weeks CA. Indeed, the GM evaluation has the greatest predictive value for the risk of CP [4] during the first few months of life and has been a topic of many studies showing its capacity to early identify the infants at risk of CP [17,24], as well as other outcomes [37] and has proven its value both in preterm infants and infants with other neurologic or genetic conditions [38]. We were surprised by the fact that we did not find a predictive value for the GM evaluation at TEA, but when separating the outcomes as normal (writhing pattern) and abnormal (PR and CS), the GM evaluation at term is a predictor for all the outcomes in the logistic regression model. An explanation could be that CS, but especially PR observed at TEA is known to develop into either the absence or presence of fidgety movements, while the normal pattern at term is always followed by the occurrence of fidgety movements [22]. This is probably why the GM exam at 12 weeks CA is a better predictor of outcomes than the GM exam at TEA.
Regarding the predictive values of the synthesis of the AT exam at TEA and 12 weeks CA, these have been good predictors for all the outcomes, except for delayed sitting in subgroup of preterm infants. Items involving the passive tone, especially popliteal angle at 12 weeks CA, were also predictors of the risk of CP, delayed sitting, and delayed/absent walking. The explanation of this finding is that in the category of preterm infants that represented the majority of the study sample, the type of CP observed is usually spastic, particularly involving the legs [4,39,40], and abnormalities of the tone of the legs are the first signs that appear in these situations [41].
In a study investigating the inter-observer reliability of the items of the AT exam, even if the reliability was good for both the synthesis and the individual items, the lowest reliability has been found for the pull-to-sit maneuver [14]. This could be an explanation for the lowest predictive value of this item in our group for almost all the outcomes investigated.
We identified logistic regression models that reached statistical significance for the risk of CP, both in the whole group and in the subgroup of preterm infants and models predictive of delayed sitting in the subgroup of preterm infants. All the models combined the findings of the examinations at 12 weeks CA. The absence of fidgety movements and an abnormal passive tone (abnormal scarf sign and abnormal popliteal angle) are part of all the models. In the case of the preterm infant subgroup, an abnormal synthesis of the AT exam at 12 weeks is also a part of the model. The explanation for these findings resides in the pathophysiologic context of the CP. The lesions that determine CP, especially in former preterm infants, occur at the level of the motor pathways [42]. The ability of the GM patterns, and especially of (absent) fidgety movements, to identify the patients at risk of CP could be explained by the relation between the GM patterns and the integrity of the motor pathways. Thus, even if GMs originate subcortically [17], the variability and modulation of these movements occurs at a cortical level [17,20,21], giving them a normal appearance (pattern). In the case of patients with CP, the interruption of the motor pathways and the destruction of the sub-plate neurons—the lesions that are responsible for the disease—also result in a lack of modulation of GMs and abnormal GM pattern results [20,21]. According to previous studies, fidgety movements could also represent a fine-tuning of the proprioceptive system [17,43]. Lesions within the motor pathways also lead to a release of the gamma moto-neurons from the cortical inhibition, which leads to hypertonicity and exaggerated monosynaptic reflexes [41]. Thus, these signs (increased passive tone of the arms and legs and absence of fidgety movements) are all early markers of the lesions at the levels of the motor pathways and abnormal proprioception. Their identification is usually an early warning sign for the risk of CP [9,12,17]. The absence of the synthesis of the AT examination from the predictive models for all the groups could be explained at least in part by the results of the AT examination at 12 weeks CA in patients with congenital neurologic conditions and syndromes (see Supplementary Material S1). The synthesis of the AT examination is abnormal in half of the patients with CP and half of the patients without CP. Even if the number of patients in this subgroup is small and no reliable statistical analysis can be performed because of this, one should be aware that an abnormal/non-optimal neurologic exam in this group may not solely be due to tone abnormalities, since the involvement of the central nervous system can occur in different ways in these conditions [9,13]. It is again to be mentioned that the models are just exploratory and should not be over-interpreted.
4.2. Comparison with Previous Studies
It is already known from the medical literature [17,18] that there is a good correlation between the GM movement patterns (writhing, poor repertoire and cramped–synchronized) at TEA and the GM movement patterns identified after 9 weeks of CA (based on the presence or absence of fidgety movements). This was also the case in our research (see Table 4), and, as already demonstrated, the correlation is stronger for normal results of the examination [22]—all the patients with normal GM patterns at TEA presented with fidgety movements. We found, though, a statistically significant correlation between the GM patterns at TEA and the ATNAT results (Table 7). The correlation was better for the normal/optimal results. This good correlation between the results of the AT exam and the GM patterns has also been found at 12 weeks of CA. These good correlations have also been found by other groups [25,26] and in our opinion, could be explained at least in part by the fact that both examinations, even if they have different physiological backgrounds, are based on the concept of optimality [11,12,13,18]. The correlation has been observed not only for the synthesis of the AT examinations, but also for the selected items.
Integration of two examination techniques in order to increase the predictive power for the neuro-motor outcomes has been a topic of research before—Romeo and co-workers demonstrated that integrating the findings of GM assessment and HINE offers a better prediction for CP than either of the examinations alone [27]. The novelty of our approach, even if the small number of subjects requires the results to be regarded with caution, is to integrate findings of the two examinations in binary logistic regression models.
4.3. Limitations, Strong Points, and Future Directions
We are aware of the important limitations and weak points of the study. The most important one is the small sample studied. This is why we advise caution in interpreting the results, and we consider that a validation study in a larger and more homogenous group is needed. The small number of cases results in a limitation of the validity and value of the logistic regression models. With only seventy infants overall and fewer than twenty cases of cerebral palsy, the logistic regression models are inevitably underpowered and vulnerable to unstable estimates. This affects the reliability of the wide confidence intervals observed in some models.
Another limitation is represented by the inhomogeneous sample, with preterm infants and infants with congenital neurologic conditions being analyzed together. Although we considered this to be a limitation, we consider that including term neonates and neonates with congenital conditions in the group could increase the value of the findings, showing the predictive value of the tests in children with different pathologies. Of course, the groups needed also to be analyzed separately. The small size of the group of infants with congenital conditions did not allow for a correct statistical analysis, but we analyzed the group of preterm infants separately. Another weak point could be the high incidence of CP both in the whole group (21.42%) and in the subgroup of former preterm infants (17.74%). Even if it is known that the incidence of CP is higher in low and middle-income countries [39] than in high-income ones [40], the incidence in our sample is much higher than that reported—21.42% vs. 3.4‰ [39,40]—which is also true for the preterm infants subgroup. This could represent an involuntary selection bias, because patients with neurological problems have the tendency to attend follow-up visits more frequently than those perceived as having normal development [44]. The high incidence of CP and potential selection bias limit the generalizability of predictive values, and the models should be considered exploratory. Even if the CP has been classified at diagnosis, due to the small number of cases in each GMFCS category, we did not perform an analysis stratified on each subtype of CP and GMFCS grade.
The rate of loss from the follow-up was 9.09% of the whole group, with 0% in the term neonates and 10.14% in the preterm infants group. We decided from the beginning not to include in the study patients who did not complete the full program of visits, and this fact should lead to greater caution when it comes to over-interpreting the results. We believe that the high return rate in the term neonates is due to the severity of their conditions and the ability of our team to counsel them appropriately. In the case of the preterm infants, though, there has been, as expected, a drop-off rate from the program, but this has not been so high, although this fact is necessary to mention and certainly is a factor limiting the value of the findings.
We identified a number of confounding factors that could be considered in the interpretation of the data. Although the AT and GMA examinations were performed by different persons, a risk of interpretative bias could be present, because the persons performing the examinations were aware of the history and the results of the imaging studies of the patients. Also, both the examiners were trained in both methods—AIT had 20 years of experience performing AT examinations and received formal training and certification for GMA in 2023, while ARB had 10 years of experience in AT exam and also received training and certification in GMA in 2023. We did not cross-check for the inter-observer reliability, but GMA examination is known to have a very good inter-observer concordance [17,18,23,24]. This has also been the case for the AT examination (see above for the discussion in the Introduction; also see reference [14]). In our group, the AT examinations were performed by AIT and the GMA exams were scored by ARB. The interpretation bias could, however, result from the fact that both examiners were trained in the two methods, and this could induce an involuntary interpretation of the results based on observing other signs/items from the other technique, more on the side of the examiner that performed the AT examinations.
Our cohort of patients consisted of term and preterm neonates. Even if we analyzed them together, as stated above, the neonates with genetic conditions/CNS malformations are different from the former preterm infants regarding the type of lesions they have and the risk of CP [42,45,46]. This could be a confounding factor and this is why we attempted to separate the group of preterm infants from the whole group—they are more homogenous regarding the pathology and the type of CP encountered [42].
The gestational age distribution of our cohort is also slightly different from the studies that compared AT with GM examinations [25,26]. Our cohort has 4/62 patients with gestational ages of less than 30 weeks, while the cohort of Paro-Panjan and co-workers had 8/45 patients with GA of less than 30 weeks—in both cohorts, about half of the patients had a GA between 33 and 36 weeks. Thus, our cohort has a lower risk of adverse outcomes, but the data about the AT and GA examinations were comparable, because in both cases, the examinations were performed at TEA and 3 months CA/12 weeks CA in our study. Regarding the duration of follow-up, the previously cited cohorts surveyed the patients until 12–15 months corrected age [25,26] while our sample was followed until 24 months corrected age—it is known that a longer surveillance period is better because it allows more infants with anomalies to be identified [29,44]. It is known that the risk of CP and motor deficits is higher at lower gestational ages [39,40]. Accordingly, there could be two kinds of confounders—the higher gestational age in our sample, which would result in a lower number of infants at risk of CP and motor delays, and the longer interval of follow-up in our research, which could help to identify more infants with abnormalities than in the case of a shorter follow-up period up of 12–15 months (as in the previous cohorts).
Regarding the logistic regression models, given the sample size and event counts, these multivariable models are exploratory and prone to overfitting. This statement should be strongly considered when analyzing the predictive value of the models and the general value of the study.
There are, though, in our opinion, a couple of strong points of this research. The first one is represented by the methods used to avoid interpretation bias [32] (see the Methods Section). The persons performing the GM and AT examinations were different and were not aware of the results of each other’s evaluation, and the person responsible for the statistics was form another center, with no medical background, and received only the anonymized database together with the research questions. Also, a strong point of the research is its examination of the same subjects by the two methods (GMA and AT), making the comparison of the results more reliable.
Another strong point of this research is the attempt to use a binary logistic regression model to provide an objective estimate of the predictive value of adding the two examinations together at TEA and 12 weeks CA. The use of binary logistic regression models required a simplification of the coding of the values of the exam in normal and abnormal cases [35,36] in order to obtain binary outcomes and to simplify the analysis.
In future, it is necessary to replicate this research in a multicenter study, with a larger number of patients and a more homogenous and larger gestational age distribution, in order to increase the value of the findings. Also, it would be worth investigating whether the use of the two examinations could predict other outcomes (fine motor, language, cognitive, behavioral).
5. Conclusions
GMA examination, and especially the absence of fidgety movements at 12 weeks CA, represents the strongest individual predictor of the occurrence of CP and of delayed gross motor milestones in our sample of term and preterm infants. The results of Amiel-Tison neurologic assessment at term (ATNAT) and Amiel-Tison neurologic exam of the infant at 12 weeks CA are also good predictors for the occurrence of CP and delayed sitting and delayed/absent walking, both in the whole group and in the subgroup of preterm infants. There was a good correlation between the results of the GMA evaluations and Amiel-Tison examinations, both at TEA and 12 weeks CA. Combining the findings of the GM and AT examinations suggests an improvement in the predictive performance for the detection of CP in the whole group and CP and delayed sitting in the subgroup of former preterm infants. Based on our findings, using the two examination techniques could enhance early detection of the infants at risk of motor abnormalities and refer them to appropriate early intervention programs, although this result should be validated in larger and more homogenous cohorts of infants.
All the findings of our study (especially the logistic models) pertain to this specific heterogeneous sample. Due to homogeneity issues, the findings that could be generalized (although with caution) are those based on the subgroup of preterm infants. Caution should, though, be used when interpreting the results of this research due to sample composition and size limitations, and larger studies are needed before robust conclusions can be drawn.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Morgan C. Fetters L. Adde L. Badawi N. Bancale A. Boyd R.N. Chorna O. Cioni G. Damiano D.L. Darrah J. Early Intervention for Children Aged 0 to 2 Years with or at High Risk of Cerebral Palsy International Clinical Practice Guideline JAMA Pediatr.202117584685810.1001/jamapediatrics.2021.087833999106 PMC 9677545 · doi ↗ · pubmed ↗
- 2Palisano E. Rosenbaum P. Walter S. Russel D. Wood E. Galuppi P. Development and reliability of a system to classify gross motor function in children with cerebral palsy Dev. Med. Child Neurol.19973921422310.1111/j.1469-8749.1997.tb 07414.x 9183258 · doi ↗ · pubmed ↗
- 3Krägeloh-Mann I. Cerebral Palsy and Related Movement Disorders Aicardi’s Diseases of the Nervous System in Childhood 5th ed. Arzimanoglu A. Mac Keith Press London, UK 2018347374
- 4Novak I. Morgan C. Adde L. Blackman J. Boyd R. Brunstrom-Hernandez J. Cioni G. Damiano D. Darrah J. Eliasson A.C. Early, Accurate Diagnosis and Early Intervention in Cerebral Palsy Advances in Diagnosis and Treatment JAMA Pediatr.201717189790710.1001/jamapediatrics.2017.168928715518 PMC 9641643 · doi ↗ · pubmed ↗
- 5El Dib M. Massaro A.N. Glass P. Aly H. Neurodevelopmental assessment of the newborn: An opportunity for prediction of outcome Brain Dev.2011339510510.1016/j.braindev.2010.04.00420494536 · doi ↗ · pubmed ↗
- 6Volpe J.J. El-Dib M. Neurological Examination: Normal and Abnormal Features Volpe’s Neurology of the Newborn 7th ed. Volpe J.J. Elsevier Philadelphia, PA, USA 2025293323
- 7Sherrington C.S. The Integrative Action of the Nervous System Yale University Press New York, NY, USA 1948
- 8Dubowitz L.M.S. Dubowitz V. Mercuri E. The Neurological Assessment of the Pre-Term and Full-Term Newborn Infant 2nd ed.Mac Keith Press London, UK 1999
