Comparing imaging modalities and healing criteria for the estimation of fracture age: a systematic review
Charlotte G. Lawrence, Mark A. Williams, Charlotte Primeau

TL;DR
This systematic review compares imaging techniques used to estimate the age of fractures in clinical and forensic settings.
Contribution
The study identifies the lack of standardized criteria for fracture age estimation and highlights the potential of high-resolution 3D imaging.
Findings
Most studies use 2D radiography, primarily on pediatric populations.
Variation in healing stages and criteria makes comparison between methods difficult.
Histology is the gold standard but is destructive and postmortem.
Abstract
The estimation of fracture age is important for both clinical and forensic purposes. The objective of this systematic review is to identify the imaging modalities currently used in clinical and forensic practice for fracture ageing, with a view to appraise technologies used, patient demographics studied in research, and the healing stages and criteria defined to estimate time since injury. After conducting a bibliographic literature search, we identified 21 suitable publications for inclusion in the review. Comparison of the literature for fracture ageing found that most research has used 2D radiography, predominantly using an antemortem paediatric population. The remaining publications used MRI, CT, histology, and macroscopy. Although the most frequent number of radiographic stages assessed was six, no two methods assessed the same combination of features. Indeed, variation in the…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6- —http://dx.doi.org/10.13039/501100000266Engineering and Physical Sciences Research Council
- —West Midlands Police
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutopsy Techniques and Outcomes · Trauma and Emergency Care Studies · Injury Epidemiology and Prevention
Introduction
Fractures are a common consequence of traumatic injuries, and their identification and analysis can yield information that is important for both clinical and forensic investigations. In a clinical environment, the diagnosis of fractures is essential to identify appropriate treatment pathways, and later monitor their effectiveness based on the progress of healing to consider potentially adjusting treatment [1]. Alternatively in forensic investigations, the detection of skeletal injuries can offer insights into the circumstances surrounding traumatic events and, when relevant, help establish cause of death. For both living and deceased individuals, this may be a single traumatic incident, or in cases of suspected abuse, can include repeated injuries sustained over an extended period [2]. Indeed, identifying a timeline over which injuries occurred based on extent or lack of healing can help clarify the persons involved; such as when a child or vulnerable adult spent time with different caregivers [3]. Such conclusions also yield important investigative implications for determining subsequent charges against the persons responsible (e.g., homicide, voluntary vs. involuntary manslaughter, infanticide, child cruelty, and neglect [4]).
Fracture ageing in both clinical and forensic contexts is extremely challenging. Although a collection of biologic features are associated with notable events in healing, the rate at which an individual progresses through these is variable, often with considerable overlap between phases [3, 5]. Biological variation such as age and health status are often cited to affect the temporality of healing [5–7], however the small volume of published data on each demographic group renders it difficult to quantify this effect. Severity and context of the fracture origin may also affect the progression of healing and therefore age estimation. This is a particular issue in cases of abuse where individuals may experience a delay in their access to medical treatment, allowing the fracture to remain mobile as the bone continues to be used [3, 5, 8]. This can result in malunion of the fracture fragments or delayed healing [3, 5], and cause a single injury to exhibit several stages of healing simultaneously [9].
The purpose of this systematic review is to compare the imaging modalities currently used to estimate the age of bone fractures, with a view to identify future directions in fracture analysis. It will build upon the previous reviews by Prosser et al. [3] and Messer, et al. [8], which focussed on the radiological dating specifically of paediatric fractures and expand this to consider the imaging modalities themselves and their use on patients of all ages. Thus, this review will clarify which imaging modalities are currently used for estimating fracture age, the patient demographics studied, as well as the healing stages and criteria used by practitioners to define time since injury. The review also aims to capture a holistic overview of the imaging modalities and analytic approaches currently used for fracture ageing and envisage which modalities may form the focus of future research.
Materials and methods
Search strategy
A literature search of six electronic databases was performed in summer 2024, screening the earliest available records up until July 2024. Databases searched were Scopus, ProQuest, Web of Science, PubMed, Ovid (encompassing Embase and Embase Classic), and EBSCOhost (covering EBSCOhost eBook collection, Child Development & Adolescent studies, CINAHL, and MEDLINE). The following search query was used: (bone or skele*) and (ag? ing or dating or time or interval) and (fracture or injury or trauma or callus) and (heal*) and (postmortem or forensic). Asterisk () was used to enable truncated terms to be searched for words with multiple endings, such as heal* used in place of ‘heal’, ‘healed’ and ‘healing’. Question mark (?) was used to ensure words with variable spelling were included, with ag? ing used to retrieve results for both ‘aging’ and ‘ageing’.
Paper selection and data extraction
Papers were selected based on the following inclusion criteria:
- I.Be disseminated in a peer-reviewed English language publication,
- II.Be based on data from human populations,
- III.Have a focus on the methodology for ageing fractures (i.e., not a clinical or forensic case report),
- IV.Offer clearly defined stages or criteria for ageing fractures,
- V.Include a timeframe associated with fracture healing stages/criteria.
The workflow, following the Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines (PRISMA) [10], can be seen in Fig. 1. A total of 1,328 results were returned after input of the search terms across all six databases. The title and abstract of each publication were then screened to assess eligibility according to inclusion criteria, yielding 259 potentially relevant papers, from which 173 duplicates were manually removed. The 86 remaining texts were assessed for eligibility, and if their inclusion was uncertain, they were reviewed by a second author to reach a final decision. 10 of the 86 papers satisfied the inclusion criteria. The references within these were then screened, yielding a further 11 publications, totalling a final sample of 21 publications. The papers were then assessed to compare the year of publication, demographic information about the study sample and method of fracture ageing; including imaging modalities and the criteria defined to assess healing.
Fig. 1PRISMA flowchart of the publication selection method for this systematic review
Results
Context and demographics of the literature
The 21 peer-reviewed publications covered 37 years of research from 1987 until 2024 (Fig. 2). For the first 23 years, between 1987 and 2010, only four works were published, starting with O’Connor & Cohen [11] and their book chapter ‘Dating fractures’ within Kleinman’s 1987 book ‘Diagnostic Imaging of Child Abuse’. Between 2011 and 2023, however, a steady increase is evident, with a further 16 papers published, averaging 1.23 papers per year.
Fig. 2. Bar chart illustrating the number of papers published each year (in five-year intervals) included in this systematic review
Comparing the context and aims of the 21 publications, 19 papers [6, 12–29] studied a defined sample of fractures, while the remaining two book chapters [11, 30] drew on the existing literature and clinical experience to propose timetables for ageing. Of the 19 papers with a defined fracture population, 14 papers (73.7%) studied fractures in living patients [6, 13, 15, 17–21, 24–29], while five papers (26.3%) assessed fractures in postmortem cases [12, 14, 16, 22, 23]. Of the five postmortem samples, two were from archaeological human remains dating from the 17th to 20th centuries [12, 16], while three publications documented trauma from modern medicolegal samples [14, 22, 23].
Considering the age of the patients studied in each publication, 15 papers (71.43%) focused exclusively on a paediatric population, with ages ranging from birth until 18 years [6, 11, 15, 17–21, 23–25, 27–30]. A further 5 (23.81%) studied only adults, with ages ranging from 19 to 97 years [12–14, 22, 26]. One remaining publication did not specify patient age, although they can be presumed to be adult based on contextual information about the sample [16].
19 of the 21 publications specified the number of patients from which the ageing methods were developed [6, 12–29] (Fig. 3). The remaining two [11, 30] were book chapters from two different editions of Kleinman’s ‘Diagnostic Imaging of Child Abuse’ where the timetables of healing were developed from the author’s clinical experience, meaning no fracture sample was described. For the 19 publications in which patient number was specified, this ranged from 8 to 162, with a mean of 63.21 patients per study. Similarly, 13 papers specified the number of fractures assessed [6, 14, 16, 18, 20, 22–29], which ranged from 9 to 212 (mean 87.54). Lastly, 11 publications detailed the number of images reviewed in each study, such as the number of individual radiographs or histology slides [6, 13, 15, 17, 18, 20, 21, 24, 26–28]. This ranged from 31 to 707 (mean 241.91).
Fig. 3. Bar chart showing the number of patients, fractures, and images (e.g., radiographs, MRI scans, and histology slides) assessed in each publication
Finally, reviewing the region of the body studied (Fig. 4), most samples (n = 851/1289, 66.02%) were from long bones, especially the radius (n = 250, 19.39%). The frequency and percentage data reported here reflects 20 of the 21 papers in full. For the remaining paper [22], it was stated that 30 samples were combined from the clavicle, humerus, ulna, ribs, pelvis, patella, and tibia, but the number for each bone was not given (these form the ‘Unspecified’ column in Fig. 4). For the skull, 158 (12.26%) samples were assessed, meanwhile 32 (2.48%) samples were of the ribs.
Fig. 4. Bar chart showing the number of samples per anatomical region studied across all publications
Of the 19 papers specifying the bone fracture sample they studied, the exact age of the fracture was known in 14 of these, for example being a birth related injury, or through detailed clinical notes [6, 12–15, 17–19, 21, 23, 24, 26–28]. For a further three papers [20, 22, 29], it can be assumed that fracture ages were known based on descriptions of the study sample, but this is not definitively stated. De Boer and colleagues’ [16] study using archaeological samples did not specify whether the timing of fractures was known based on historical records. Lastly, Sanchez, et al. [25] did not know the original timing of injuries due to their focus on non-accidental rib fractures, for which clinical histories may have been incomplete or deliberately concealed.
Imaging modalities
Of the modalities used to assess fracture healing for ageing, traditional radiographic technologies were by far the most common, with 15 (71.43%) of the 21 published methods using only conventional 2D X-rays [6, 11, 15, 17–21, 24–30] (Fig. 5). One paper [16] compared 2D X-rays with histology, and a further paper [14] combined 2D X-rays with both histology and CT. Two papers used just histology [22, 23], and one paper applied only macroscopic methods to dry archaeological bone samples, with no imaging technology implemented beyond the sporadic use of a light microscope to verify macroscopic observations [12]. Lastly, one paper focused only on MRI [13].
Fig. 5. Chart illustrating the distribution of the 21 papers across imaging modality
To compare the single most used modality, a summary of all 17 publications using 2D projection radiography, and their associated fracture ageing stages, can be seen in Table 1. The most frequently reported features of healing have been identified and grouped into 7 overlapping and continuous stages, some containing further substages. For practicality, the remaining four modalities using macroscopy, MRI, and histology alone could not be included in the table due to their use of a different suite of healing features than the radiographic methods. In Table 1, to denote time, duration has been specified with d = days, w = weeks, m = months, and y = years. Names are marked with asterisks in the author column when the stages of healing defined in an earlier paper in this systematic review have been incorporated verbatim into the method of a later publication which is also included in this review (e.g., Cappella and colleagues’ [14] (2019) use of the healing stages previously defined by De Boer, et al. [16] in 2012). Under Timing, R is used to denote when timeframes were given as a range from the earliest to latest reported manifestation of the feature. P is the reported peak timing when a feature is most often observed, and M is used where the mean timing of a feature’s appearance was given. Where ‘All long bones’ is given under Fracture location, this means that the sample consisted of at least one of each long bone: humerus, radius, ulna, femur, tibia, and fibula. SPNBF refers to subperiosteal new bone formation. Due to the breadth of variation in the number and nature of healing features assessed across the 17 papers, the data presented in Table 1 represents a summary of the published timetables. This means that in some cases, the terminology titling each stage varies from the wording used in the original method. Additionally, to prevent Table 1 from becoming too large, features that were only studied in a single publication are omitted. Lastly, Hosokawa, et al. [19] studied a sample of 12 infants, seven of whom had no underling disease, while three had osteogenesis imperfecta, one had myotonic dystrophy, and one had arthrogryposis. The time until healing for each group was reported independently, so for consistency with all other publications (of which none studied patients with a known underlying condition), only the times reported for healthy children are included in Table 1.
Table 1. Summary of the literature on fracture ageing as visualised with 2D projection radiographyAuthorYearN patientsN fractureFracture locationPatient ageN stages/features assessedTiming (Range, Peak, Mean)Radiographically visible stages of fracture healingStage 1Stage 2Stage 3Stage 4Stage 5Stage 6Stage 7No skeletal healingFracture gap widening/granulationSPNBF/ periosteal reactionCallus formationReduction of fracture lineBridgingRemodelling1a. No healing1b. Soft tissue swelling1c. Resolution of soft tissues4a. Early/soft callus4b. Late/hard callus5a. Loss of fracture line definition5b. Sclerosis O’Connor & Cohen 1987N/AN/AN/A (Clinical experience)Paediatric6R2 – 21 d4 – 21 d10 – 21 d14 – 90 d10 – 21 d3 m – 2 yP4 – 10 d10 – 14 d14 – 21 d21 – 42 d14 – 21 d1 y Yeo & Reed 19942525Femur0 – 14 y3R9 – 14 d45 – 66 d15 – 23 dM11.7 d55.6 d17.2 d Islam, et al. 2000141205Radius,Ulna1 – 17 y8R3 – 7 w2 – 14 w2 – 14 w5 – 14 w3 – 11 w5 – 14 w4 – 14 wP4 – 6 w4 – 7 w4 – 7 w13 w4 – 6 w13 w9 w Halliday, et al. 20113137All long bones14 d – 44 m6R1 – 10 d4 d8+ d20 – 106 d12 – 77 d Malone, et al. 2011107107Radius,Tibia0 – 5 y6R0 – 14 d4 – 50 d4 – 50 d15 – 75 d24 – 156 d24 – 93 d42 – 750 dM3.3 d21 d21 d38.4 d65.2 d43.9 d313.3 d Prosser, et al.* 20126382All long bones,Clavicle0 – 5 y6R1 – 31 d5 – 96 d12 – 66 d19 – 96 d19 – 300 d45 – 421 dP1 – 2 d15 – 35 d22 – 35 d≥ 22 d≥ 36 d≥ 36 d De Boer, et al.** 20122122All long bones, Sternum,Ribs,MetacarpalsNot reported6R10 – 12+ d15 – 17+ d21 – 28+ d2 – 3+ m Sanchez, et al. 20131623Ribs1 – 11 m6R0 – 1 w1 – 3 w3 – 5 w5 – 7 w9 – 11 w7 – 9 w Walters, et al.*** 2014131131Clavicle0 – 3 m2R0 – 7 d8 – 10+ d9 – 15+ d9 – 15+ d Kleinman & Walters 2015N/AN/AN/A (Clinical experience)Paediatric4R6 – 21 d9 – 21 d14 – 90 d10 – 21 dP10 – 14 d15 – 21 d21 – 42 d14 – 21 d *Fadell, et al. (using ) 201661Not specifiedClavicle0 – 2 m4R7 – 49 d11 – 61 d11 – 61 d20 – 63 d35 – 151 d1^st^ P11 d12 d12 d22 d49 d2^nd^ P42 d61 d61 d63 d59 d *Warner, et al. (using ) 20174059Humerus,Femur9 d – 12 m4R7 – 130 d9 – 130 d9 – 130 d15 – 130 d51 – 247 dP9 – 49 d9 – 26 d9 – 26 d15 – 67 d51 – 247 d Hosokawa et al. 201712Not specifiedFemurNewborn (healthy)3R9 – 23 d10 – 23 d16 – 32 d **Cappella, et al. (using ) 201989Skull,Ribs34 – 84 y6R10+ d15 – 17+ d21+ d2+ m *Mahmoud et al. (using ) 2020100Not specifiedLong bones (unspecified)1 – 18 y6R< 7 d8 – 35 d8 – 35 d≥ 36 d≥ 36 d≥ 36 d **Crompton, et al. (using *) 202130Not specifiedFemur1 – 33 m3R7 – 59 d15 – 198 d15 – 198 d26 – 198 d Viero, et al. 2021162212All long bones,Metacarpals,Hand phalanges,Metatarsals21 – 97 y7R1 – 82 d9 – 80 d15 – 80 d22 – 179 d22 – 179 d23 – 142 d30 – 160 d29 – 385 dM11 d23.5 d40.9 d68.6 d68.6 d51.4 d76.3 d147.7 d
Review of healing stages and assessment criteria
For the 17 papers using 2D X-ray methods (Table 1), the number of stages used to assess healing to estimate fracture age ranged from 2 to 8, with a mean of 5 and a mode of 6. The use of 6 or fewer stages to assess age was more common (Fig. 6), with only 2 timetables [20, 26] detailing more than 6 stages. Whether the timing of each stage was presented as a timetable to guide practitioners, or as more continuous data on when the bony features were observed by a majority of patients, varied by publication. Additionally, some studies focused only on a subprocess of healing, such as Yeo & Reed’s [29] provision of three stages only for the healing callus. Furthermore, just two papers [6, 26] defined a ‘no healing’ stage at the start of the timeline in their method, but a further two [25, 27] reported the period of time before healing was first detectable in the results (Table 1).
Fig. 6. Pie chart showing the number of healing stages assessed to estimate fracture age across the 17 papers using 2D x-rays. N numbers represent the number of papers using the specified number of stages
Comparing the data in Table 1, it can be seen across all 17 papers, that the only stage consistently reported in all methods was callus formation, with soft and hard callus distinguished as separate stages in 12 papers, and a single callus timeframe given in the remaining 5 papers. Periosteal reaction was then the second most frequently documented stage, described in 13 of the 17 papers. This was followed by remodelling, featured in 12 papers, and bridging which was noted in 10 papers. The remaining stages were infrequently reported, with loss of fracture line definition featured in 4 papers, no healing, fracture gap widening, and sclerosis each in 3 papers, and lastly soft tissue swelling and resolution of the soft tissues, which were both only documented in two papers.
However, the terminology used to name each healing feature (or stage) varied across the papers. To tabulate the data into Table 1, the most frequently used terms for each biological process were used. The ages given in Table 1 are therefore reported in the column for the stages that matches or best fits the process definition given in the original publications.
Considering the histological literature, the number of stages assessed was notably greater. Although Moirangthem, et al. [22] only described 4 stages, these were all situated within the first 5 days after injury. In comparison, De Boer, et al. [16] identified 13 histologically visible features associated with fracture age extending up to 1–2 + years after trauma, while Navqi and colleagues’ [23] give 14 features for infants aged ≤ 1 year.
For the one paper in which CT was used to age fractures, Cappella and colleagues’ [14] implemented the previously defined 13 stages of De Boer, et al. [16], and used CT in combination with 2D X-rays and histology to score a series of histomorphological healing features in a modern adult postmortem sample.
Lastly, one paper used MRI. Baron, et al. [13] identified 9 quantifiable features of healing, spanning 6 stages (or ‘phases’) and 3 substages for ageing.
Discussion
Limitations of the fracture demographics studied
Comparison of the literature immediately identifies some limitations in the generalisability of the ageing methods applied due to the bone samples from which the age stages were developed. Specifically, although all 21 papers, acknowledged the importance of accurately ageing injuries for forensic applications, the published methods of fracture ageing are predominantly of a clinical origin, which may not be fully representative of the challenges experienced in a forensic context. For example, many of the clinical radiographs are acquired to assess alignment of the fracture during treatment [31, 32], which may be assisted through use of immobilisation aids such as plaster casts [15]. These negatively affect image quality [15, 18], thereby potentially reducing the level of detail detectable on the radiographs from which to assess healing. Furthermore, victims of non-accidental injuries may have delayed or prevented access to medical treatment, and hence with no immobilisation, secondary microtraumas may occur at the mobile fracture site [24], and timelines of healing developed from immobilised fractures may therefore not be applicable [15, 27, 30].
Additionally, bone samples assessed in the ageing literature do not always reflect the areas of greatest forensic need. Walters, et al. [27] and Fadell, et al. [17], for example, studied birth-related clavicular fractures. This is because clavicular fractures are not immobilised with plaster casts, therefore mitigating issues of reduced image quality due to casting. Additionally, the fractures are of known age due to their occurrence at birth. However, these injuries are rarely ascribed to abuse and are therefore not necessarily representative of the injuries more frequently assessed for forensic purposes [25]. Furthermore, the clavicle undergoes a unique growth process, wherein it is formed through both endochondral and intramembranous ossification [33]. Since long bones develop through endochondral ossification only, this may potentially render healing studies on the clavicle as less generalisable to other bones [17].
The most frequently studied bone was the radius, which made up 19.39% of all samples when pooling the 20 papers. This is likely a reflection of the radius being the most frequently accidentally fractured bone during childhood [34–36], thereby offering a high number of clinical radiographs for research. The fracture locations most relevant for forensic purposes, such as to the rib cage and long bone metaphyses are often underrepresented, despite being among the most suspicious injuries for non-accidental abuse, especially in infants [2, 37, 38]. Indeed, ribs, as evidenced here, are one of the least studied bone locations, being assessed in only 4 publications [14, 16, 22, 25].
Limitations of the imaging modalities
Beyond the types of bone samples assessed, limitations are further seen throughout the literature due to the often-incomplete data on fracture ageing in the later stages of healing. Antemortem radiographic studies benefit from their ability to provide a more longitudinal insight into healing over time. The use of imaging is generally determined by clinical need with follow-up imaging in the later stages of healing often being minimal [25], as the clinical value does not justify the ionising radiation dose to the patient (e.g. [15, 17]). Healing progression is therefore well documented during the immediate post-injury period, but without regular imaging, it is difficult to ascertain precisely when each stage of healing becomes radiographically visible. This issue becomes increasingly pronounced through the later stages of healing [24].
This issue in data completeness is also observed in histological methods. Bone histology is widely accepted to be the gold standard of fracture ageing, with its superior resolution enabling features of bone healing to be visualised at a cellular level [23, 39, 40]. This is especially beneficial during the first weeks after injury, when fractures progress through a predictable cascade of cellular events that cannot be identified radiographically [14, 16, 23]. In the forensic context, this is especially beneficial as it allows greater precision in estimating the time elapsed between a trauma and death. However, like radiographic methods, the ageing of fractures using histology becomes increasingly variable, imprecise, and broad as time since injury increases (e.g. [16, 23]).
Being destructive, histology is also an exclusively postmortem analysis and can therefore only offer a snapshot into the status of healing at the time of death. This means that for antemortem purposes, a precise and reliable method of fracture ageing is still lacking. This is needed not only for investigations of suspected abuse [24, 27, 28], but also for clinical purposes to monitor the progress of fracture healing with a view to adjust treatment plans if the fracture has not achieved the expected level of healing within a given timeframe [1, 32, 41].
Accordingly, a primary objective of Baron and colleague’s [13] research using MRI was to develop a method of fracture ageing without the limitation of ionising radiation for living individuals. Promisingly, the results found that the early healing phases were characterised by greater definition between T1 and T2 sequences, likely reflecting the greater activity of soft tissues processes like inflammatory response in the initial weeks after injury. These values then decreased overtime in a pattern broadly reflecting the wider radiographic literature where estimates of fracture age become broader with increasing post-injury interval (e.g. [6, 24, 26]). However, in Baron and colleagues’ study [13], MRI was not used to visualise the more subtle features of healing such as loss of fracture line and bridging, which occur between the stages of callus formation and remodelling (Table 1). This may be because MRI is less sensitive than radiography for diagnosing subtle skeletal injuries like rib fractures and metaphyseal lesions [5, 42], although potential soft tissue indicators of fractures such as oedema can be seen [5, 42]. Furthermore, bone features identifiable on MRI may persist for weeks or months after initial appearance, reducing their ability to be precisely aged. As such, MRI may have limited value for ageing fractures beyond the acute timeframe [5].
Limitations of the ageing methods
Beyond the types of samples and imaging modalities assessed, the timetables developed from them for fracture ageing also face further limitations. Foremost is the lack of terminological standardisation which hinders comparison between methods due to uncertainty that the bony features used to assess healing are being described and diagnosed consistently across studies. This is compounded by differences in the combination of healing features described in each timetable. Although the most common number of stages assessed was six, no two six-point systems assessed the same combination of features. SPNBF, callus formation, and remodelling were most consistently reported, but even this was not unanimous. This continual introduction of new healing stages and definitions has often been justified under the provision that the research is preliminary and requires validation using a larger number of fractures [15, 29, 43]. However, to date just one paper was published with the explicit intent to validate a pre-published method; Cappella and colleague’s [14] test of De Boer, et al. [16]. As such, existing methods of fracture ageing are primarily based on a low volume of data and lack robust scientific validation from secondary research.
Further demonstrating this, Cappella, et al. [14] was the only study in this systematic review to use cross-sectional CT imaging for fracture ageing, which they used in combination with histology. Although they found that CT could offer a more accurate and precise estimate of fracture ages than 2D radiography, histology nonetheless remains the superior modality. They suggest therefore that CT may serve as a complement to histology, but that the precision of CT will remain limited as long as it must use stages of ageing developed from 2D radiography, which are often subjective and vague. Indeed, a fracture in their study was inaccurately aged possibly due to unclear definition of the term ‘union’ in De Boer and colleagues’ [16] original method. They also highlight further subjectivity and inconsistency more generally throughout the stages, which is a concern echoed broadly throughout the literature [5, 8, 44].
The subjective basis to many ageing methods may then account for the broad variability in observer agreement often noted within studies. The most critical conclusions were reached by Halliday, et al. [18], who found SPNBF to be the only one of six healing features reliably identified between observers, despite their high level of professional experience. Describing their results as “disappointing and surprising”, the authors conclude that the radiographic analysis of healing features is unreproducible, and advocate caution in their use for ageing fractures. Throughout the remaining publications reported levels of interobserver agreement more often ranged from moderate to excellent. This reflects variation between studies in the definitions of ageing stages, the combination of features assessed, level of observer experience, or byproducts of the imaging technology such as image quality. For the above reasons, ageing fractures using radiographic methods is therefore considered inexact, and are cautioned when used especially for forensic purposes [5, 7, 18, 45].
Outside of the systematic review
It is important to acknowledge that this is a systematic review, as opposed to a literature review. As such, while it did not purposefully exclude any imaging modalities or research publications, some were not captured due to the inclusion/exclusion criteria. For example, previous systematic reviews by Prosser, et al. [3] and later updated by Messer, et al. [8] focussed on the radiological dating of paediatric fractures. The former identified just three suitable publications to be included in their review [20, 29, 46], all of which acknowledged the continuous and overlapping nature of fracture healing. Prosser et al. [3] concluded fracture ageing to be an “inexact science”, reporting little consistent data between the three papers, and advocated the adoption of larger-scale studies with the aim to achieve a more consistent and standardised understanding of fracture healing. Later in 2020, Messer and colleagues [8] then reiterated this. Their updated review identified 10 publications [6, 17–20, 24, 25, 27–29], all of which are included in this current systematic review. Their conclusions similarly found variation both between the timelines of fracture healing, as well as research methodologies themselves. They also noted variation in the terminology and definitions used to assess healing radiographically and found that between the different studies there were differences in the ranges, mean and peak times, and first and last appearance dates of different healing features, all of which complicated comparison.
Perhaps part of the challenge in standardisation is that within the limited body of research, many methods draw upon and synthesise the stages of previously published methods. For example, the radiographic ageing stages defined by Hufnagl ([47] not included in this review due to being an unpublished thesis), provided the basis for Malone and colleagues’ [6] timetable, and in turn drew upon the stages outlined by Toal & Mitchell [48] and Hendrix [49] (the former excluded for using an animal sample, and the latter for not providing defined stages of healing). Tritella, et al. [50] similarly proposed the Adapted Fracture Healing Scale (AFHS) based on the earlier work of Malone, et al. [6], Prosser, et al. [24], and de Boer, et al. [16]. This AFHS was then later used by Viero, et al. [26], and is summarised in Table 1 accordingly. When trialled by five observers, the scale was found to perform well, with the highest level of agreement seen between radiographically trained practitioners, i.e., two radiologists and a forensic anthropologist, as compared to the pathologist and orthopaedist. Additionally, it was seen that agreement between observers improved when broader features of healing were assessed. This is demonstrated when categorising callus as simply present or absent, rather than attempting to differentiate between early ‘fluffy’ callus and well-demarcated mature callus [50]. In other words, consistency increased as specificity decreased.
Drury & Cunningham [43], however, provide one of the only direct comparisons of ageing estimates between three different methods [6, 20, 24]. Rather than attempt to derive a new method by synthesising pre-published works, the authors assessed the agreement of each method by using each to estimate the age of 112 long bone fractures across 96 children, where actual fracture age was unknown. They found that while none of the methods were in perfect agreement, the highest level of similarity was seen between Islam, et al. [20] and Malone, et al. [6]. Nonetheless, differences in results still included variation in the duration of the time range estimated, as well as the start and end dates of the range, with some estimates differing by almost 2 months. Although this highlights concern over variation in the time estimates achieved by each method, it is an important result that encourages similar research in the future to test the methods on fractures of known age, to better understand the accuracy and consistency between each method.
Outlook and directions for future research
It is important that future research remains aware of the challenges highlighted within this paper to help manage and potentially address them. Of high priority should be testing the methods on fractures of known age. This should be conducted similarly to Drury and Cunningham’s [43] comparison of agreement between three methods, but with a view to test their accuracy and precision, as well as consistency in the results they generate.
It would be beneficial to test the methods on a larger sample size of fractures- a need acknowledged in several of the papers in this review (e.g. [6, 13, 29]). Ideally this should include collecting data on different patient and fracture demographics to better understand the role of inter and intra-individual variation, such as the effect of fracture location and patient age on healing rate [5, 14, 26],. For example, infants are often reported to have a faster healing rate than older children and adults [3, 6, 7, 51, 52]. As early as 1988, Skak and Jensen [53] investigated the total time taken to achieve healing union in 275 diaphyseal femoral fractures from 265 children. While they reported a log-normal correlation between age of the patient and time required to achieve union, they acknowledge that this fit was only achieved through the exclusion of patients with delayed union and pseudoarthrosis from the teenage population. The authors suggested that this reflected the increasing variation in healing time with advancing age, and indicates a less predictable model of trauma ageing for older individuals [53]. Despite this, many fracture timelines, namely Yeo & Reed [29], Islam, et al. [20], and Mahmoud, et al. [21], have nonetheless been developed using a combined paediatric demographic, ranging from neonates, infants, and children to older teenagers. The combined timelines may therefore be skewed and not accurately representative of the healing rates of each age group [25]. As such, where sample size permits, it is advisable to study younger children separately from older children and adolescents, specifically isolating those aged ≤ 2 years, whom in the context of abuse commonly have fractures [7, 54–56], and therefore form the demographic on whom fracture ageing is most often performed for forensic purposes [28].
Additionally, immobilisation may affect healing rate as the fracture ends are already in contact, meaning that the extent of SPNBF and callus formation is reduced, and mature lamellar bone is deposited earlier [15, 57]. This has not been tested so any effect on ageing is not understood [15]. and it is not clear whether ageing methods developed from immobilised fractures are applicable to non-immobilised injuries. While the role of immobilisation cannot be experimentally tested using a clinical human sample due to the ethical implications, it is nonetheless a factor that researchers should remain mindful of when comparing results across different methods in which different immobilisation techniques may have been used.
While acknowledging that the breadth of biological variation means that estimates of fracture age will never be absolute, the use of a method less dependent on practitioner experience may also help to support its role in medicolegal investigations [7]. In 2021, Hržić, et al. [58] were the first to use artificial intelligence (AI) as a tool to support the process of ageing paediatric fractures. The preliminary study analysed 7140 wrist radiographs spanning 3570 fractures, with the age of each categorised into one of three groups (fractures aged 0 weeks old, 1–3 weeks old, or older than 3 weeks). They concluded that although the model requires further refinement, it showed promisingly accurate results by assigning fractures to the group of known correct age when information regarding the patient’s sex and age were incorporated. One year later, Kyllonen, et al. [59] then tested a similar machine learning approach. A total of 1,813 radiographs spanning 942 paediatric fractures were aged using the Malone, et al. [6] method, as well as a new four-feature scoring criteria describing callus appearance, fracture discontinuity, bridging, and sclerosis. They found that although differences between the two methods were statistically insignificant, manual assessment using Malone, et al. [6] still offered greater precision. Tsai, et al. [60] similarly used a deep learning algorithm to date clavicular fractures which, like Walters, et al. [27] and Fadell, et al. [17], focused on birth-related injuries to ensure fractures were of known age. Although their model accurately estimated fracture age to within 1 week in 83.7% of cases, the sample was limited to infants aged 3 months or less. Given that they still found accuracy to decrease as time since injury increased and radiographic healing features became less distinct, this suggests that the model may not be applicable to non-birth related fractures in older patients and may not succeed the accuracy of age estimations already possible using manual methods.
AI represents a rapidly developing field and has demonstrated some early potential for fracture ageing. However, current constraints in its accuracy are primarily a byproduct of the resolution of the imaging modality, among other variables such as bone sample and patient demographics. With histology remaining the gold-standard for ageing, this suggests that until radiographic image quality is improved to visualise finer features of healing, ageing will remain broad and nonspecific regardless of its manual or automatic assessment.
To help address this, several publications [61–63] are calling for research to investigate the potential of other imaging modalities for fracture ageing. Accordingly, micro-CT is now among the most frequently cited modalities presenting a promising direction for future research [31, 62–65]. While medical CT scanners can generally offer a resolution down to 600 μm (or 0.6 mm) [66, 67], micro-CT has shown the capability to scan forensic samples for fracture detection in child abuse cases with a resolution as low as 60 μm (or 0.06 mm) [68], where higher resolution can be achieved with smaller samples [68, 69]. Depending on sample size, this gives micro-CT the unique ability to visualise and measure 3D structures with resolution potentially comparable to histopathology [70], and has seen an increasing uptake in forensic practice over recent years [39, 62, 64, 68, 71]. It is acknowledged that micro-CT is an exclusively postmortem analysis, thereby limiting its clinical applications. This is mainly due to the high radiation doses that may cause adverse biological effects, as well as limitations in sample sizes and accessibility to the equipment [72, 73]. Additionally, there are challenges associated with slow data acquisition and processing times, and storage of the large volumes of data generated [73]. Nonetheless, recognising its potential as a postmortem analysis, micro-CT has become the focus of extensive experimental research on fracture ageing. Morgan and colleagues [74], for example, used of a murine model to assess changes in internal callus microstructure overtime, as correlated with the regain of mechanical function. Although their research was clinically aligned, the results demonstrated the potential of micro-CT for fracture ageing, with changes in tissue mineral density, as measured though grayvalue, exhibiting a strong temporal correlation.
More recently, Steyn and colleagues’ [75] then assessed posttraumatic survival time in cranial trauma. Like Cappella, et al. [14], Steyn, et al. [75] used the histomorphological stages of De Boer, et al. [16] to assess healing in dry bone samples, however instead supplemented the histological and radiographic methods with micro-CT [75]. The results demonstrated that although histopathology still offered the highest resolution, micro-CT could valuably visualise some healing features such as early callus formation that were otherwise unable to be detected radiographically. Tsai, et al. [76] and later Baier, et al. [39, 62] then further demonstrate the utility of micro-CT as a 3D complement to histology for fracture detection, finding equally promising results. Micro-CT is therefore increasingly emerging as a technology used in forensic casework to assist in the detection and ageing of fractures, and improve the lucidity of evidentiary presentation in court [39, 62, 68].
Considering micro-CT’s potential as a standalone method for fracture age estimation, initial work by Viero and colleagues [63] has additionally sought to compare micro-CT with 2D projection radiography for the assessment of fracture healing. Using six defined healing stages, they qualitatively assessed the morphology of nine rib fractures of known age, comparing the extent of healing information attainable with both micro-CT and 2D projection radiography. Unsurprisingly, the former offered significantly higher detail for visualising internal callus microstructure and was concluded to be a promising modality for fracture ageing. Given that the study was still reliant on the practitioner’s subjective interpretation of morphological features, Viero, et al. [31] then went on to explore the potential for a more quantitative measure of fracture age. Using the same fracture sample as their previous publication [63], the authors identified five histomorphometric parameters measurable in the healing callus; namely anisotropy, connectivity density, mineralisation volume, and trabecular separation and thickness. It was found that these consistently demonstrated measurable differences associated with age and maturity of the callus, and after further validation may offer a more quantitative and objective means to estimate fracture age. This would then help to further build upon the earlier findings of Sanchez, et al. [25], who found that even when using 2D projection radiography, measurement of callus thickness was a more reliable estimate of fracture age than the qualitative criteria traditionally used. Further research is now needed to expand upon the work of Viero, et al. [31] using a larger sample of fractures with more regular post-traumatic intervals to gain a more precise understanding of the quantifiable evolution of fracture healing and associate this with a usable timetable. The high resolution, non-destructive, and 3D nature of micro-CT therefore situates it as an ideal modality through which to complement plain 2D bone histology and advance novel, more robust methods for objective fracture age estimation.
Conclusion
It is clear that the estimation of fracture age is a multidisciplinary endeavour, with both clinical and forensic applications. Comparison of the numerous modalities and healing criteria used to age fractures has revealed little methodological consistency between or within modalities, and few attempts to validate approaches for forensic application. While histopathology offers the greatest sensitivity for fracture analysis, it is only applicable to postmortem samples, meaning that radiographic approaches represent most of the published methods. Despite this, low resolution, small sample sizes, inconsistent bone type and patient samples, and the subjective and variable definition of age stages are still among the factors that limit current radiographic applications. Accordingly, the more recent utilisation of industrial imaging modalities for fracture ageing such as micro-CT has seen promising results, however this is yet to be widely researched. Further work is therefore needed to synthesise the strengths of the existing approaches with modern advancements in imaging technology, to advance more holistic and sensitive approaches for fracture ageing.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Crown Prosecution Service (2024) Homicide: murder, manslaughter, infanticide and causing or allowing the death or serious injury of a child or vulnerable adult. Crown Prosecution Service. https://www.cps.gov.uk/legal-guidance/homicide-murder-manslaughter-infanticide-and-causing-or-allowing-death-or-serious. Accessed 31 March 2025
- 2Royal College of Paediatrics and Child Health (2020) Child Protection Evidence – Systematic review on Fractures. Royal College of Paediatrics and Child Health. https://childprotection.rcpch.ac.uk/child-protection-evidence/fractures-systematic-review/ Accessed 3 March 2025
- 3The Society and College of Radiographers and The Royal College of Radiologists (2018) The Radiological Investigation of Suspected Physical Abuse in Children. The Society of Radiographers. https://www.sor.org/learning-advice/professional-body-guidance-and-publications/documents-and-publications/archive-documents/the-radiological-investigation-of-physical-abuse-i#:~:text=It%20revamps%20initial%20guidance%20from%202008%20and%20incorporates,This%20document%20is%20available%20as%20a%20PDF%20only. Acc
