Comparative analysis of bone density measurement techniques: a systematic review of quantitative ultrasound and dual-energy X-ray absorptiometry
Ibrahim Hadadi

TL;DR
This systematic review compares two bone density measurement techniques, QUS and DXA, to assess their effectiveness in detecting osteoporosis.
Contribution
The study provides a comprehensive evaluation of the correlation and diagnostic performance of QUS and DXA across diverse populations.
Findings
DXA and QUS showed variable correlation (r = 0.17–0.86) in measuring bone parameters.
QUS is suitable for preliminary screening in resource-limited settings but lacks the precision of DXA for diagnosis.
Diagnostic performance varied across populations, with inconsistent results in pediatric and disease-specific groups.
Abstract
Bone mineral density (BMD) and bone-related parameters are essential for osteoporosis detection. Different screening modalities are used, including quantitative ultrasound (QUS) and dual-energy X-ray absorptiometry (DXA). This systematic review aimed to evaluate the correlation and clinical utility of DXA and QUS measurements. A literature search (2005–2025) was conducted in PubMed, Scopus, Web of Science, EMBASE, Google Scholar, and Cochrane Library for English-language studies. A narrative synthesis was performed to summarize the study characteristics and outcomes. Of the 1,247 identified records, 24 studies met the inclusion criteria. DXA and QUS were used to assess bone parameters, such as BMD, bone mineral content (BMC), speed of sound (SOS), broadband ultrasound attenuation (BAU), and stiffness index (SI). The correlation between DXA and QUS varied widely (r = 0.17–0.86), with…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1| Category | Search terms |
|---|---|
| Primary measurement techniques | “quantitative ultrasound,” OR “QUS,” OR “dual-energy X-ray absorptiometry,” OR “DXA,” OR “DEXA” |
| Clinical parameters | “bone density,” OR “bone mineral density,” OR “BMD,” OR “osteoporosis diagnosis” |
| Study types | “comparative studies,” OR “validation studies,” OR “clinical trials” |
| Population terms | “postmenopausal,” OR “elderly,” OR “adult,” “children,” OR “diabetic” |
| Outcome | “correlation” OR “accuracy,” OR “precision,” OR “sensitivity,” “specificity” |
| Final Search String | (1 AND 2 AND 3 AND 4 AND 5) |
| Study characteristics | Participant characteristics | |||||||
|---|---|---|---|---|---|---|---|---|
| Study ID | Country | Study design | Sample size | Study settings | Gender (M:F) | Age (Years) | BMI (Kg/m2) | Comorbidities |
| ( | Saudi Arabia | Cross-sectional | 437 (Osteoporosis, osteopenia, normal) | Primary care clinics, employee health clinic | 0:437 | 47 | NA | NA |
| ( | Germany |
| 26 (Cadavers) | Hospital-based | 12:14 | 56–96 | NA | NA |
| ( | China | Cross-sectional | Postmenopausal with T2DM:76, | Hospital-based | Postmenopausal with T2DM: 0:76, | 46–83 | 24.2 | Diabetes |
| ( | Greece | Cross-sectional | Severe hemophilia A: 17, moderate hemophilia: 10 | Hospital-based | NA | 11.75–13.19 | NA | Hemophilia and one patient with hepatitis C |
| ( | Sweden | Cross-sectional | 80 postmenopausal women | Hospital-based | 0:80 | 63 | 24.6 | NA |
| ( | India | Retrospective | 101 (Normal, premenopausal, postmenopausal) | University campus | 0:101 | 20–65 | Normal = 23.47, premenopausal = 26.02, postmenopausal = 26.99 | NA |
| ( | Malaysia | Validation study | 134 children | Part of SEANUTS | 69:65 | 9.3 | Male = 17.7, Female = 17.6 | NA |
| ( | Germany | Case-control | Postmenopausal women: 91, Health control: 91 | Clinical | Postmenopausal women: 0:91, Health control: 0:91 | Postmenopausal women: 78.2, Health control: 75.7 | Postmenopausal women: 25.1, Health control: 26.6 | NA |
| ( | Thailand | Retrospective | 181 children | School-based and part of SEANUTS | 90:91 | 9.1 | Male = 17.9, Female = 17 | NA |
| ( | Spain | Cross-sectional | 107 adolescents | The PRO-BONE Study | 107:0 | 13.2 | 18.7 | NA |
| ( | Nepal | Retrospective | 115 (females with menopause) | Hospital-based | 50:65 | 60.17 | NA | Diabetes, COPD, hyperthyroidism, rheumatic arthritis, chronic renal failure |
| ( | Ireland | Cross-sectional | 56 | Hospital-based | 43: 13 postmenopausal | 58 | 28.4 | NA |
| ( | New Zealand | Cross-sectional | 124 children | Human nutrition research unit | 58:66 | 10 | 18.7 | NA |
| ( | Vietnam | Longitudinal | 2043 | Part of the Vietnam Osteoporosis Study | 773:1270 | 44–45.9 | Male = 23, Female = 23.4 | NA |
| ( | Sweden | Prospective | 62 diabetic patients | Foot clinics | 34:28 | 50-65 | T1DM = 24, | Diabetes |
| ( | Taiwan | Retrospective | 772 osteoporotic and non-osteoporotic patients | Hospital-based | 352:420 | 72.9 | NA | NA |
| ( | South Africa | Longitudinal | Child with HIV: 80, Child without HIV: 90 | Hospital-based | Child with HIV: 40:40, Child without HIV: 51:39 | 7.14–7.29 | 16.1–16.6 | HIV |
| ( | China | Prospective | 274 (Healthy, osteopenia, Osteoporosis) | Hospital-based | NA | 53.78–67.11 | Health = 24.40, Osteopenia = 23.87, Osteoporosis = 22.75 | NA |
| ( | UK | Cohort | 216,753 | UK-biobank | 100,065:116,688 | 57.8–58.7 | NA | NA |
| ( | Thailand | Cross-sectional | 67 | Hospital-based | 67:0 | >50 | NA | COPD |
| ( | India | Cross-sectional | 90 (postmenopausal women) | Hospital-based | 0:90 | 55.82 | 25.40 | Back pain |
| ( | Switzerland | Prospective cohort | 1345 (postmenopausal women) | Community-based (OsteoLaus cohort) | 0:1345 | 65 | NA | NA |
| ( | Italy | Cross-sectional | 201 (females were menopausal) | Two Italian centers | 11:190 | 62.1 | NA | Rheumatic musculoskeletal disease, CKD, and Diabetes |
| ( | Uganda | Cross-sectional | 167 | Hospital-based and subtype of the CHAPAS-4 trial | 85:82 | 9.4 | NA | HIV |
| Study ID | Technical characteristics | |||||
|---|---|---|---|---|---|---|
| Modalities compared | Protocol measurements | Equipment specification | Calibration | Quality control procedures | Measurement sites | |
| ( | DXA vs QUS | DXA: BMD | DXA: PIXI (Lunar GE, Radiation Corporation, Madison, WI USA) | References values were provided by manufacturer for comparison | Procedural manuals were used | DXA: Lumbar spine and femoral neck |
| ( | QUS vs DXA | DXA and QUS: Structural and mechanical | QUS: DBM Sonic 1200 (IGEA, Carpi, Italy) | NA | NA | Epiphyseal condyle site and meta-diaphyseal site |
| ( | QUS vs DXA | QUS: SOS | QUS: Sunlight Omnisense (7000P device) Petach Tikva, Israel | NA | QUS: SOS verification phantom | QUS: Radius, phalanx, tibia |
| ( | QUS vs DXA | QUS: SOS | QUS: Sunlight Omnisense (7000P device) Petach Tikva, Israel | QUS: Manufacturers’ verification phantom | NA | QUS: Peripheral bones |
| ( | QUS vs DXA | QUS: SOS, BAU | QUS: LUNAR (Achilles, Madison, WI, USA) | NA | Using a phantom every day | QUS: Calcaneal |
| ( | QUS vs DXA | QUS: SOS | QUS: Omnisense cbone densitometer | NA | NA | QUS: Radius and tibia |
| ( | QUS vs DXA | QUS: SOS | QUS: Commercial device (Omnisense 8000P, Sunlight, Petah Tikva, Israel) | QUS: Verification phantom was used for calibration provided by the manufacturer | NA | QUS: SOS |
| ( | DXA vs QUS | DXA: BMD | DXA: Prodigy® bone densitometer (GE/Lunar Corporation, Madison, WI, USA | NA | DXA: Quality assurance scans were performed on a daily basis | DXA: lumbar spine (L1-L4) and the hip (femoral neck and total hip) |
| ( | QUS vs DXA | QUS: SOS | QUS: Sunlight Omnisense, Petah Tikva, Israel (Model 8000P) | NA | NA | QUS: Non-dominant arm |
| ( | DXA vs QUS | DXA: BMC | DXA: GE Lunar Healthcare Corp, Madison, WI | Lumber spine phantom as recommended by | NA | DXA: Bilateral |
| ( | QUS vs DXA | QUS: SOS | QUS: Sunlight MiniOmni bone sonometer (BeamMed Ltd., Tel Aviv, Israel) | DXA: Using spine phantom before the measurements | NA | Radius, left and right femur, spine |
| ( | QUS vs DXA | QUS: SOS, BUA, SI | QUS: General healthcare Lunar (Achilles InSight Densitometer | QUS: Daily calibration | DXA: International recommendations were used | QUS: Calcaneus |
| ( | QUS vs DXA | QUS: calcaneal BMD, SOS, BUA and SI | QUS: Sahara Clinical Bone Sonometre (Hologic Inc, USA) | DXA: Calibration materials were used | NA | QUS: Calcaneus |
| ( | QUS vs DXA | QUS: BUA | QUS: Portable ultrasound (Sahara, Hologic Corp., Bedford, MA, USA) | DXA: Phantom was used before measurement | NA | QUS: Calcaneus |
| ( | QUS vs DXA | Bone density and fractures | QUS: Lunar achilles bone densitometer | NA | NA | QUS: Calcaneus in both feet |
| ( | QUS vs DXA | DXA: BDM | QUS: a Pegasus device (BeamMed Ltd., Tel Aviv, Israel) | DXA: A spine phantom was used prior to measurements | NA | Hip and spine |
| ( | QUS vs DXA | QUS: SOS, BUA | QUS: Lunar Achilles Insights device (GE Healthcare, Madison, WI, USA) | NA | QUS: Built-in quality assurance test before each use | QUS: Calcaneal |
| ( | QUS vs DXA | DXA: BDM | QUS: OSTEOKJ7000+ (Kejin, Nanjing, China) with a multichannel convolutional neural | NA | DXA: Daily standard procedures were followed | QUS: 1/3 distal radius of non-dominant hand |
| ( | QUS vs DXA | QUS: SOS, BUA | QUS: Sahara Clinical | DXA: Manufacturer’s phantom was used | QUS: Using a phantom, as per | QUS: Heel |
| ( | QUS vs DXA | QUS: SOS, BUA | QUS: Acoustic Osteo-Screener ultrasound device (AOS-100, Aloka Co., Ltd., Japan) | NA | NA | QUS: Calcaneus |
| ( | QUS vs DXA | BMD | NA | NA | NA | Neck, lumber, wrist |
| ( | QUS vs DXA | QUS: SI, SOS, BUA | QUS: Achilles Express (GE-Lunar, USA) | Daily calibration of QUS per manufacturer instructions | QUS and DXA performed by same operator; daily phantom calibration | QUS: Heel (right, or left if right fractured) |
| ( | QUS vs DXA | QUS: SOS, BUA | QUS: OsteoSys BeeTLe | NA | All DXA scanners underwent daily quality control | Femoral, total hip and lumbar spine levels |
| ( | DXA vs QUS | DXA: BMD | DXA: DXA Hologic Discovery Wi DXA scanner Hologic Bedford | DXA: The DXA scanner was calibrated daily using a spine phantom and auto air calibration | NA | DXA: Head, lumbar spine; QUS: non-dominant foot |
| Study ID | Outcomes | ||||
|---|---|---|---|---|---|
| Statistical analysis performed | Correlation coefficient | Diagnostic accuracy | Follow-up | Conclusion | |
| ( | Correlation analysis | Low to moderate (r = 0.43–0.64, p = 0.000) | NA | NA | QUS may not be used as a screening tool |
| ( | Correlation analysis | QUS significantly correlated with DXA (r = 0.69–0.79, p <0.05) | NA | NA | QUS is superior for structures |
| ( | Correlation analysis | r = 0.26–0.75, p <0.05 | NA | NA | Measurement did not change in parallel |
| ( | Correlation analysis | No correlation | NA | NA | No correlation was observed |
| ( | Correlation analysis and sensitivity | Significant correlation | Mean Sensitivity QUS: 79% | 7 years | QUS is highly correlated with DXA |
| ( | Correlation coefficient | Significant correlation (SOS radius and tibia (r = 0.858 and 0.860) | NA | NA | QUS is a sensitive screening tool |
| ( | Correlation analysis, ROC | The mean difference between the two techniques was relatively large | AUC: 0.94 | NA | Radial QUS and DXA are not comparable |
| ( | AUC | NA | DXA: AUC: Femoral neck = 0.69, Total hip = 0.71, Lumber spine = 0.59; | NA | The Sahara, Achilles, and InSight QUS devices showed similar hip fracture discrimination when compared to DXA |
| ( | Correlation | No correlation was observed | NA | NA | At radius, the SOS measurements were not appropriate for the assessment of bone quality status |
| ( | Correlation | Fair to good intra-class correlation coefficients of agreement (r = 0.60–0.68) | NA | NA | QUS and DXA had comparable outcomes |
| ( | Correlation, ROC analysis | Significant correlation | AUC = 0.69 | NA | QUS is a sensitive screening tool |
| ( | Correlation analysis, AUC | QUS significantly correlate | AUC = 0.77 | NA | QUS identify patients with osteoporosis |
| ( | Pearson correlation coefficients, linear regression | Positive correlations between QUS and DXA (r = 0.30–0.45, P <0.01) | NA | NA | Calcaneal QUS and DXA are not interchangeable methods for measuring bone density in children |
| ( | Linear regression model | BUA modestly correlated with lumbar spine BMD (r = 0.34; P <0.0001) and femoral neck BMD (r = 0.35; P <0.0001) | NA | NA | QUS BUA is not a reliable method for screening osteoporosis |
| ( | Correlation and logistic regression | Positive correlation | NA | 10–11 years | QUS is an appropriate modality and is used for calcaneus as a fracture risk predictor |
| ( | Correlation analysis, ROC analysis | Low correlation (r = 0.17) | AUC: 0.731 | NA | A meaningful but low correlation between QUS and DXA |
| ( | Correlation analysis | strong correlations at the calcaneus | NA | 12 months | The two methods did not correlate well longitudinally |
| ( | AUC | NA | QUS: Sensitivity = 80.86%, Specificity = 84.23%, Accuracy = 83.05% | NA | QUS tools are promising future developmental directions |
| ( | Reliability, correlation and sensitivity | Low to modest correlations | Sensitivity: Very poor (0.05–0.23) for osteoporosis, and poor | NA | QUS has the potential to produce |
| ( | Correlation, sensitivity, specificity | Significantly moderate correlation | Sensitivity = 10.4% | NA | QUS cannot replace DXA as an alternative |
| ( | Correlation analysis, AUC | Significant correlation | Sensitivity = 86.36% | NA | QUS can be used as an alternative screening tool |
| ( | Correlation and logistic regression analysis | Significant correlation | NA | 6.7 years | QUS predicts fractures independently of FRAX, BMD, and TBS; suitable as pre-screening tool, not for monitoring |
| ( | Repeatability, ROC analysis | NA | QUS: AUC: Femoral neck = 0.81, Total hip = 0.72, Lumber spine = 0.78 | NA | QUS demonstrated good repeatability and performance similar to DXA |
| ( | Correlation, AUC | Moderate to weak correlation | QUS was a weak predictor of DXA Z-score equal to or less than −2 (area under the ROC curve = 0.59) | NA | QUS may not be an appropriate substitute for DXA scan |
| Study ID | Study design | Selection | Comparability | Outcomes | |||||
|---|---|---|---|---|---|---|---|---|---|
| Item 1 | Item 2 | Item 3 | Item 4 | Item 5 | Item 6 | Item 7 | Item 8 | ||
| ( | Cross-sectional | No description | * | * | * | * | * | No follow-up | No statement |
| ( | Cross-sectional | * | * | * | * | * | * | No follow-up | No statement |
| ( | Cross-sectional | No description | * | * | * | * | * | No follow-up | No statement |
| ( | Cross-sectional | * | No stated | * | * | * | * | * | * |
| ( | Retrospective | * | * | * | * | * | * | No follow-up | No statement |
| ( | Validation study | * | * | * | * | * | * | No follow-up | No statement |
| ( | Case-control | No description | Not stated | * | * | * | * | * | * |
| ( | Retrospective | No description | * | * | * | * | * | No follow-up | No statement |
| ( | Cross-sectional | * | * | * | * | * | * | No follow-up | No statement |
| ( | Retrospective | * | * | * | * | * | * | No follow-up | No statement |
| ( | Cross-sectional | * | * | * | * | * | * | No follow-up | No statement |
| ( | Cross-sectional | * | * | * | * | * | * | No follow-up | No statement |
| ( | Longitudinal | * | * | * | * | * | * | No follow-up | No statement |
| ( | Prospective | * | * | * | * | * | * | * | * |
| ( | Retrospective | * | * | * | * | * | * | No follow-up | No statement |
| ( | Longitudinal | * | * | * | * | * | * | * | * |
| ( | Prospective | * | * | * | * | * | * | No follow-up | No statement |
| ( | Cohort | No description | No follow-up | No statement | |||||
| ( | Cross-sectional | * | * | * | * | * | * | No follow-up | No statement |
| ( | Cross-sectional | * | * | * | * | * | * | No follow-up | No statement |
| ( | Prospective | * | * | * | * | * | * | * | No statement |
| ( | Cross-sectional | * | * | * | * | * | * | No follow-up | No statement |
| ( | Cross-sectional | * | * | * | * | * | * | No follow-up | No statement |
| GRADE domain | Judgement description | Concerns |
|---|---|---|
| Methodological limitations | Overall, studies were found with low risk of bias, however, concerns in the selection of patients and follow-up exist ( | Not serious |
| Indirectness | Studies provided details of the characteristics of patients, screening modalities, and clinical outcomes (correlation coefficient and ROC). | Not serious |
| Imprecision | Overall, acceptable number of patients were included in the selected studies. Most studies clearly define the selection criteria for patients. | Not serious |
| Inconsistency | Low to fair correlation was reported. | Not serious |
| Publication bias | Even the funnel plot was not constructed for publication bias, however, we did not find any publication bias, as both negative and positive outcomes were reported. | Not suspected |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBone health and osteoporosis research · Nutrition and Health in Aging · Parathyroid Disorders and Treatments
Introduction
1
Osteoporosis is characterized by a progressive decline in bone mineral density (BMD) and deterioration of the bone microarchitecture, which substantially increases skeletal fragility (1, 2). Often remaining asymptomatic until the occurrence of a fragility fracture, the condition is frequently underdiagnosed, despite its significant global health impact. Such fractures account for millions of new clinical cases annually and are a primary driver of increased morbidity and mortality in aging populations (3–5).
Although the World Health Organization (WHO) diagnostic criteria for osteoporosis remain centered on BMD T-scores measured at clinically relevant skeletal sites, the clinical application of these standards relies heavily on Dual-energy X-ray absorptiometry (DXA) (6). Currently, the reference standard, DXA, is favored for its reproducibility and low radiation exposure (7, 8). However, DXA measurements may be affected by technical and procedural factors, including patient positioning and analysis protocols, and its availability is largely restricted to hospital-based settings (8).
A significant limitation of DXA is its reliance on two-dimensional areal density, which does not account for bone volume. Consequently, larger bones may yield artificially inflated BMD values compared to smaller bones with identical volumetric densities, potentially leading to diagnostic inaccuracies based solely on bone size (9). Furthermore, it is largely confined to hospitals and remains poorly suited for bedside screening or use in rural and community settings. Quantitative ultrasound (QUS) has emerged as a viable non-invasive, portable, and radiation-free alternative, to DXA offering the portability required for broader clinical and community-based applications. Its parameters include broadband ultrasound attenuation (BUA), speed of sound (SOS), and stiffness index (SI) (10). QUS is usually used for easily accessible bones, such as the calcaneus, tibia, patella, metatarsal bones, phalanges, and radius (11). Large population-based datasets, including the UK Biobank, have further supported the use of QUS-derived estimates of bone status in epidemiological and genetic studies (12). Both DXA and QUS are associated with fracture risk prediction, particularly for hip fractures in older populations. However, established clinical guidelines emphasize that QUS should serve as a pre-screening tool to identify individuals who may benefit from confirmatory DXA assessment, rather than as a replacement for the reference standard (13, 14).
Although QUS has been expanded for clinical adoption, its performance relative to DXA remains inconsistent across different populations and devices. These discrepancies in the literature are often rooted in the heterogeneity of QUS technologies, differing skeletal measurement sites, and varying definitions of outcomes. Beyond fracture risk prediction, the degree of clinical interchangeability between these modalities remains contentious. This systematic review investigates these correlations to clarify the clinical utility of QUS and to define its specific role within established osteoporosis screening frameworks.
Methodology
2
This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, with a completed PRISMA checklist provided as a Supplementary File (15). The study protocol was registered in the International Prospective Register of Systematic Reviews (PROSPERO) under the registration number CRD420251146250 on 12 September 2025. The registration process was retrospective. No amendments were made to the protocol after registration.
Search strategy
2.1
For the selection of studies, the PICO framework was used: P (Population): Patents scanned for bone density measurement, I (Intervention): QUS used for scanning, C (Comparator): DXA used for comparison, O (Outcomes): Bone density measurements, including BMD, which reflects the mineral concentration/unit area, BMC quantifies the total mineral amount in a bone region, SOS reflects bone density and elasticity, BUA measures the reduction in the ultrasound signal when it passes through bone, SI is the combination of SOS and BUA, diagnostic accuracy and precision, correlation, and clinical outcomes. These parameters were selected because they represent both the structural and functional aspects of bone health. The literature search was conducted in PubMed, Scopus, Web of Science, EMBASE, Google Scholar, and Cochrane Library. The search covered publications from January 2005 to July 2025. The search terms were strategically combined using Boolean operators and included variations of the key concepts (Table 1).
Inclusion and exclusion criteria
2.2
Studies were eligible if they were comparative and analyzed the two modalities of bone density assessment. In terms of outcomes, the primary outcome was direct comparison of bone density measurements, with secondary outcomes including diagnostic accuracy and precision, correlation between techniques, and clinical outcomes, where possible with clear reporting around measuring protocols. Studies published between January 2005 and July 2025 in English language and peer-reviewed journals.
Studies that did not compare the two modalities with accuracy and precision outcomes were excluded. In addition, case reports, reviews, commentaries, animal studies, non-English publications, conference abstracts without full texts, letters to the editor, and opinion pieces were excluded. Exclusion criteria were also applied on technical grounds, including obsolete technology, non-standard protocols, lack of description of methodology, absence of statistical analysis, and insufficient quality control, among others.
Studies selection process
2.3
Studies were initially identified through searches of multiple databases, and duplicate records were excluded. The titles and abstracts of the retrieved records were screened by the author according to predefined inclusion and exclusion criteria. Full-text articles of potentially relevant studies were subsequently retrieved and assessed for eligibility by the author. Eligibility criteria were uniformly applied across all screening stages to ensure consistency in study selection. Studies excluded at the full-text stage were documented, along with the reasons for exclusion. All studies meeting the eligibility criteria were included in this systematic review for qualitative synthesis.
Data extraction process
2.4
Four areas were covered in the extraction process. Study characteristics were extracted first, including the author, year of publication, study design, methodology, sample size, population characteristics, study geographical location, and setting. Second, technical parameters were recorded, including the measurement protocols for each modality, equipment specifications and calibration methodologies, quality control procedures, and information on the measurement sites and parameters. Third, outcome data obtained (i.e., the number of participants meeting defined primary and secondary outcomes, estimates of statistical analyses performed, correlation coefficients, and diagnostic accuracy measures). Finally, quality indicators were evaluated to assess the strength of the study design, statistical handling, control of confounders, and completeness of follow-up.
Definition of study variables
2.5
The primary variables of interest included BMD measurements obtained using DXA at clinically relevant skeletal sites and QUS-derived parameters, including BUS, SOS, and SI, as defined in the original studies. The outcome variables included reported measures of association between QUS and DXA, diagnostic performance metrics where available, and fracture-related outcomes. The secondary variables comprised skeletal measurement sites, participant demographic characteristics, and technical characteristics of the measurement devices, as reported in the included studies.
Quality assessment
2.6
Standardized evaluation tools relevant to each study design were employed to appraise the methodological quality of the included studies. The quality of observational studies was evaluated using the Newcastle–Ottawa Scale according to selection, comparability, and outcome assessment. The methodological quality of the in vitro studies was assessed using the QUIN assessment tool. This assessment tool comprises items such as aim/objectives, sample size calculation, comparison group, methodology explanation, operator details, randomization, method of measurement of outcomes, outcome assessment or details, blinding, statistical analysis, and presentation of results. Single in vitro study was evaluated according to these items and rated as yes (allocating 1–2 points), no with 0 points or not applicable, high risk of bias (RoB) scores <50%, 50%–70% were the medium RoB, and >70% were the low RoB (16). These established tools provide a rigorous and objective assessment of the methodological quality of all eligible studies.
Certainty of evidence
2.7
The Grading, Reporting, Assessment, Development, and Evaluation (GRADE) framework was used to assess the certainty of the evidence. Outcomes were rated in the domain of methodological limitations, indirectness, imprecision, inconsistency, and publication bias as low, high, or not serious.
Statistical analysis
2.8
Descriptive statistics were used to summarize the study characteristics, population demographics, measurement parameters, and reported outcomes. Given the substantial clinical and methodological heterogeneity across the included studies, no quantitative pooling or meta-analysis was performed. Instead, the findings were synthesized descriptively, with reported associations between QUS and DXA measurements, diagnostic performance metrics, and fracture-related outcomes summarized narratively, as presented in the original studies. Where applicable, results were described according to relevant study characteristics to facilitate a qualitative comparison.
Results
3
Study selection and characteristics
3.1
Identification of the included studies
3.1.1
A total of 1,247 records were identified through database searches. Prior to screening, 255 records were removed, including 71 duplicate records, 94 records marked as ineligible by automation tools, and 90 records removed for other reasons. The remaining 992 records were screened based on their titles and abstracts, of which 748 were excluded. Full-text reports were sought for 244 records, of which 53 were not retrieved. The remaining 191 full-text articles were assessed for eligibility, and 167 were excluded for predefined reasons (PICO not followed, n = 85; required outcomes not reported, n = 39; no comparison with DXA, n = 43). Ultimately, 24 studies were included in the qualitative synthesis (Figure 1).
PRISMA flow chart showing study selection process.
Demographic and study population characteristics
3.1.2
A total of 24 studies conducted across multiple countries were included, encompassing populations from Europe, Asia, Africa, and Oceania (Table 2). All studies followed non-randomized designs, with the majority employing cross-sectional methodologies (17–27), followed by retrospective (28–31), prospective (32, 33, 40), and longitudinal designs (34, 35). A single study followed the cohort, validation, in vitro, and case-control study designs (Table 2).
The cumulative sample size across all 24 studies (223,676 participants), with individual study populations ranging from 27 to 216,753 participants (22, 36). Furthermore, 26 cadavers were used to compare the screening techniques (37). Most studies were hospital-based and conducted as part of larger research cohorts. Variation was also observed in the sex distribution of the included studies. Most studies (14, 58.3%) included mixed populations, while others specifically focused on menopause/postmenopausal females or males or targeted distinct groups such as diabetic patients or elderly individuals (19–21, 24, 25, 27, 29, 38). The mean age across studies ranged from 7.14 to 96 years (35, 37), and reported body mass index values ranged from 17 Kg/m^2^ to 28.44 Kg/m^2^ (25, 29). The comorbidities are listed in Table 2.
Technical characteristics of screening modalities
3.2
Overall, studies have compared DXA and QUS screening modalities and evaluated various bone parameters, including BMD, BMC, SOS, BAU, SI, and fracture rates. Various models of both screening modalities were used. The calibration methods also differed with some studies using manufacturer-supplied phantoms. Quality control procedures, including routine instrument checks, have been reported in some studies. A significant variation was observed in the measurement sites: QUS primarily targeted the calcaneus (heel), tibia, forearm, and radius, whereas DXA focused more on the lumbar spine, femoral neck, total hip, and sometimes the total body (Table 3).
Summary of key findings by modality comparison
3.3
Correlation analyses were the most frequently reported statistical approach used to evaluate the associations between DXA and QUS measurements. Reported correlation coefficients ranged from low (r = 0.17) to high (r = 0.86) (29, 31), with some studies reporting no statistically significant associations (22, 30). The diagnostic performance was evaluated in a subset of studies using receiver operating characteristic analysis. The area under the curve (AUC) values varied across studies, devices, and populations, as summarized in Table 4.
Clinical applications and patient-specific outcomes
3.4
Postmenopausal women
3.4.1
Eight studies included postmenopausal women (17, 24, 25, 27–29, 38, 40). These studies reported correlations between QUS parameters and DXA measurements, as well as sensitivity and repeatability metrics (Table 4).
Elderly population
3.4.2
Studies conducted in elderly populations (50–96 years) have reported associations between QUS parameters and fracture-related outcomes, including hip fractures, using calcaneal measurements across multiple QUS devices (25–28, 33, 36, 38). The reported correlation coefficients varied across studies, with some demonstrating low correlations between QUS and DXA (r = 0.17) (31) (Table 4).
Special populations
3.4.3
Studies involving children aged 7.1–13.2 years have reported correlations between QUS and DXA measurements ranging from no association to fair correlation (18, 22, 23, 30, 35, 39). However, one study reported a fair to good correlation (r = 0.60–0.80) between QUS and DXA measurements (20) (Table 4).
Disease-specific analysis
3.4.4
Several studies have evaluated disease-specific populations, including patients with diabetes, COPD, rheumatic arthritis, and chronic renal failure. Reported AUC values ranged from 0.69 to 0.81 across different skeletal sites and outcomes (17, 28). However, other studies have found no comparable outcomes between QUS and DXA in these populations (21–24, 35) (Table 4).
Quality assessment outcomes
3.5
The methodological quality varied across the studies. Most studies demonstrated adequate reporting of the selection and comparability domains, although five studies lacked sufficient descriptions of participant selection (19, 22, 30, 36, 38). Adequate follow-up was reported in four studies (27, 32, 35, 38) (Table 5). The in vitro study demonstrated a low risk of bias across most assessed domains, although sample size justification, randomization, and blinding procedures were not reported (37).
Certainty of evidence
3.6
Overall, most studies achieved moderate methodological quality based on the Newcastle-Ottawa Scale, with several cross-sectional and retrospective designs scoring lower in the selection and follow-up domains. Common limitations included incomplete description of participant selection, lack of follow-up, and insufficient reporting of confounder control. Despite these issues, the outcome assessments were generally well reported.
In the methodological limitation domain, no serious concerns were identified, as most studies demonstrated a low risk of bias. Similarly, indirectness was not considered serious because the studies provided sufficient details of patient characteristics, screening modalities, and clinical outcomes. Imprecision was also judged as not serious, as the majority of studies adequately explained patient selection, although five studies did not provide complete details of patient selection. Both positive and negative results were reported across studies; thus, publication bias was not suspected. According to GRADE (Table 6), the overall certainty of evidence was rated as low to moderate, with inconsistency in correlation values remaining the main factor limiting confidence in the findings.
Discussion
4
In this systematic review, QUS demonstrated a low-to-fair correlation with DXA across diverse populations, accompanied by variability in diagnostic accuracy and methodological quality among studies. These findings highlight both the potential and limitations of QUS when considered alongside DXA, particularly in the context of screening rather than diagnosis. The observed heterogeneity across studies underscores the importance of population characteristics, measurement sites, and device-specific factors when interpreting QUS performance.
Our findings are partially consistent with those of Flöter et al. (41), who reported the variable diagnostic performance and moderate sensitivity of calcaneal QUS compared with DXA. However, their review was limited to the calcaneus, whereas our analysis included multiple skeletal sites, offering a broader and more comprehensive evaluation of the skeletal sites. Similarly, a meta-analysis by Moayyeri et al. (42), demonstrated a significant association between heel QUS parameters and fracture risk, although the predictive accuracy was moderate and varied across studies. Importantly, their work focused mainly on fracture prediction rather than diagnostic agreement with DXA, highlighting the complementary role of QUS in relation to DXA.
Evidence from the included studies involving postmenopausal women and older adults suggests that QUS is associated with moderately correlated with DXA and has reported sensitivity values for the detection of low bone density. These findings support the potential role of QUS as a preliminary screening tool in high-risk populations, particularly in settings where access to DXA is limited. Early identification of bone loss in postmenopausal women is clinically relevant because therapeutic interventions become less effective with advancing age (43). In elderly populations, QUS might help triage individuals who require further DXA evaluation, especially in primary care and low-resource environments (44, 45).
In pediatric populations, the available evidence indicates a poor agreement between QUS and DXA measurements. Studies investigating children and adolescents have consistently shown weak or inconsistent correlations, suggesting that QUS cannot reliably substitute DXA in younger age groups (46, 47). Factors such as rapid skeletal growth, anatomical variability, and challenges in standardized measurement sites likely contribute to these findings, thereby limiting the clinical applicability of QUS in pediatric settings.
The findings were particularly inconsistent in patients with chronic conditions, such as diabetes, COPD, and chronic kidney disease. These populations exhibited variable correlations and diagnostic agreement between QUS and DXA, influenced by disease-related alterations in bone metabolism, measurement site selection, and T-score thresholds (14, 48, 49). These findings indicate that QUS should be used cautiously in clinically complex populations and highlight the need for population-specific calibrations and validations.
Substantial variability in the reported correlations across studies may be attributed to differences in QUS device models, manufacturers, calibration methods, skeletal measurement sites, and operator-dependent factors. These methodological differences limit the comparability across studies and complicate the interpretation of pooled findings.
From a clinical and public health perspective, the findings support the use of QUS as a screening or triage tool rather than a diagnostic replacement for DXA. Its portability, affordability, and lack of ionizing radiation make it particularly attractive for large-scale screening in elderly and postmenopausal populations, especially in low-resource settings. However, confirmatory DXA assessment remains essential for diagnosis and clinical decision-making.
This review provides a comprehensive synthesis of the evidence comparing QUS and DXA across multiple populations and skeletal sites. The strengths of this review include the broad inclusion of study designs and populations and the application of standardized quality and certainty assessment tools. Limitations include the inability to perform a meta-analysis due to substantial heterogeneity, lack of standardized QUS protocols, and variability in outcome definitions and thresholds. Additionally, the overall certainty of the evidence was rated as low to moderate, primarily due to inconsistency and indirectness.
Future studies should focus on standardizing the QUS measurement protocols, device calibration, and diagnostic thresholds. Prospective studies with consistent outcome reporting and longer follow-up are required to better define the role of QUS in fracture risk assessment and screening. Population-specific validation studies, particularly in pediatric and disease-specific cohorts, are essential for improving clinical confidence and applicability.
Conclusion
5
This systematic review summarizes the available evidence comparing QUS and DXA for bone assessment in various populations. The findings indicate that QUS may serve as a preliminary screening tool, particularly in older adults and selected clinical populations, although considerable variability exists across studies. Importantly, the current findings do not support the use of QUS as a substitute for DXA, which remains the diagnostic reference standard for osteoporosis. Further well-designed studies with longer follow-ups and standardized methodologies are required to better define the role of QUS in osteoporosis screening pathways.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Aibar-Almazán A Voltes-Martínez A Castellote-Caballero Y Afanador-Restrepo DF Carcelén-Fraile Md C López-Ruiz E . Current status of the diagnosis and management of osteoporosis. Int J Mol Sci. (2022) 23:9465. doi: 10.3390/ijms 23169465, PMID: 36012730 PMC 9408932 · doi ↗ · pubmed ↗
- 2Zhang L Luo X Liu H Zhu W Zhang X Zhu S . Prevalence and risk factors of osteoporosis and osteopenia among residents in Hubei province, China. Arch Osteoporos. (2023) 18:49. doi: 10.1007/s 11657-023-01245-7, PMID: 37060512 · doi ↗ · pubmed ↗
- 3Salari N Ghasemi H Mohammadi L Behzadi MH Rabieenia E Shohaimi S . Global prevalence of osteoporosis among the world older adults: a comprehensive systematic review and meta-analysis. J Orthopaedic Surg Res. (2021) 16:669. doi: 10.1186/s 13018-021-02821-8, PMID: 34657598 PMC 8522202 · doi ↗ · pubmed ↗
- 4Le Boff MS Greenspan SL Insogna KL Lewiecki EM Saag KG Singer AJ . The clinician’s guide to prevention and treatment of osteoporosis. Osteoporos Int. (2022) 33:2049–102. doi: 10.1007/s 00198-021-05900-y, PMID: 35478046 PMC 9546973 · doi ↗ · pubmed ↗
- 5Yu JS Krishna NG Fox MG Blankenbaker DG Frick MA Jawetz ST . ACR appropriateness criteria® Osteoporosis and bone mineral density: 2022 update. J Am Coll Radiol. (2022) 19:S 417–s 432. doi: 10.1016/j.jacr.2022.09.007, PMID: 36436967 · doi ↗ · pubmed ↗
- 6WHO . Assessment of fracture risk and its application to screening for postmenopausal osteoporosis. Report of a WHO Study Group. World Health Organ Tech Rep Ser. (1994) 843:1–129. 7941614 · pubmed ↗
- 7Golding PH . Dual-energy X-ray absorptiometry (DXA) to measure bone mineral density (BMD) for diagnosis of osteoporosis - experimental data from artificial vertebrae confirms significant dependence on bone size. Bone Rep. (2022) 17:101607. doi: 10.1016/j.bonr.2022.101607, PMID: 35937936 PMC 9352459 · doi ↗ · pubmed ↗
- 8Slart RHJA Punda M Ali DS Bazzocchi A Bock O Camacho P . Updated practice guideline for dual-energy X-ray absorptiometry (DXA). Eur J Nucl Med Mol Imaging. (2025) 52:539–63. doi: 10.1007/s 00259-024-06912-6, PMID: 39316095 PMC 11732917 · doi ↗ · pubmed ↗
