The Validity of Bioelectrical Impedance Analysis Compared to a Four-Compartment Model in Healthy Adults: A Systematic Review

Christopher J. Oliver; Luke Del Vecchio; Michelle Minehan; Mike Climstein; Nedeljka Rosic; Stephen Myers; Grant Tinsley

PMC · DOI:10.3390/jfmk11010065·January 31, 2026

The Validity of Bioelectrical Impedance Analysis Compared to a Four-Compartment Model in Healthy Adults: A Systematic Review

Christopher J. Oliver, Luke Del Vecchio, Michelle Minehan, Mike Climstein, Nedeljka Rosic, Stephen Myers, Grant Tinsley

PDF

Open Access

TL;DR

This review compares bioelectrical impedance analysis to a four-compartment model for body composition and finds BIA devices are not clinically equivalent.

Contribution

The study systematically evaluates BIA's validity against the 4C model and highlights the need for standardization and alternative methods.

Findings

01

BIA devices showed wide limits of agreement with the 4C model for body fat and fat-free mass.

02

Mean bias for body fat ranged from -3.5% to +4.4%, and for fat-free mass from -3.9 kg to +1.8 kg.

03

Variations in BIA device design and 4C methodology contributed to discrepancies in results.

Abstract

Background: The four-compartment (4C) model is a criterion method for evaluating body composition tools like bioelectrical impedance analysis (BIA). This systematic review assessed the clinical equivalence of BIA devices compared to the 4C model and explored limitations in using the 4C model as a criterion method. Methods: Twelve cross-sectional and baseline longitudinal studies involving healthy, weight-stable, non-athlete, non-pregnant adults were included. The primary outcome was a Bland–Altman analysis, with bias, limits of agreement, and proportional bias extracted from each paper. The study quality was evaluated using the AXIS tool. Due to the high variability across studies, a meta-analysis was not performed. Results: BIA devices generally performed poorly against the 4C model estimates of percentage body fat and fat-free mass. Across the 12 studies, mean bias for percentage body…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Chemicals5

Mo alcohol water deuterium SFB7

Diseases1

injury to

Figures4

Click any figure to enlarge with its caption.

Keywords

body compositionequivalenceBland–Altman

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBody Composition Measurement Techniques · Nutrition and Health in Aging · Electrical and Bioimpedance Tomography

Full text

1. Introduction

Body Composition Models

Body composition assessment is preferable to body mass index (BMI) in providing an estimation of body composition for more robust health and disease risk estimates. There is a range of body composition assessment techniques, including anthropometry, chemical analysis of tissue by chemical or in vivo neutron activation analysis, electrical impedance such as bioelectrical impedance analysis (BIA) or bioimpedance spectroscopy (BIS), ultrasound, and radiation imaging, computed tomography (CT), dual x-ray absorptiometry (DXA), and non-radiation imaging, i.e., magnetic resonance imaging (MRI) [1]. Many body composition techniques considered to be of high precision, e.g., MRI or CT, currently have limited clinical applicability, owing to their availability, cost, complexity in use, or radiation exposure [1].

Bioelectrical impedance analysis (BIA) is widely used to estimate body composition in commercial, clinical, and research settings, as it is non-invasive, cost-effective, safe, and does not require specialized training. There are many iterations of BIA machines, for example, single/multifrequency, supine/vertical, hand-to-hand, leg-to-leg, arm-to-leg, and whole-body/segmental configurations [2]. Commonly used body composition outputs from BIA include percentage body fat (%BF), fat mass (FM), and fat-free mass (FFM). These outputs are indirect estimates validated against other reference techniques, the most frequent being DXA, and the four-compartment (4C) model. However, while DXA is often used as a reference method to assess BIA devices, the 4C model is regarded as a criterion method [3].

The 4C model is a level 2 molecular model based on four distinct chemical components within the body, i.e., fat, water, minerals, and protein (Figure 1). Multiple 4C model equations have been derived by researchers using first principles, incorporating various assumptions, for example, the densities of tissues and the ratio of bone to non-bone masses [4]. The 4C model base equation expands on the 2C model of body mass (BM) = fat mass (FM) + fat-free mass (FFM) by providing better estimates of the FFM component, i.e., BM = FM + total body water (TBW) + bone mineral content (BMC) + residual. The base equation is rearranged so that the equations solve for FM and %BF. The popular 4C model equation by Wang et al. [5] is given as an example:

[eqn]

[eqn]

[eqn]

[eqn]

Each of the components in the 4C model—total body water, body volume, and bone mineral content—requires measurement by criterion methods. However, the methods used to measure these components can vary between studies. For example, body volume could be estimated by either hydrodensitometry (underwater weighing), plethysmography (i.e., Bod Pod), or, more recently, in ‘rapid’ 4C models, by the use of DXA-derived volume estimates [7,8]. Bone mineral density is estimated by DXA, for which there are different makes, models, and software versions. Total body water can be measured by isotope dilution or bioimpedance spectroscopy (BIS). This means there are at least twelve possible permutations (3 body volume × 2 total body water × 1/2 bone mineral content) of the 4C model methods per equation. Yet, there is a paucity of research comparing estimates of body composition from BIA machines to 4C criterion models, particularly in examining the effects of different 4C model equations and their component methodology.

This review investigated studies in which percentage fat mass and fat-free mass estimates from bioelectrical impedance analysis (BIA) machines were compared to those from a criterion four-compartment (4C) model. The primary aim was to evaluate the level of agreement observed between BIA devices and the 4C model using the Bland–Altman limits of agreement analysis for percentage fat mass and fat-free mass. This review addresses the gap in the literature regarding the variability in 4C model equations and techniques, as well as the different configurations of BIA machines.

2. Methods

The study was registered in PROSPERO (CRD42023266802).

One of the authors searched PubMed, Medline (via EBSCO), SCOPUS, and CINAHL databases from inception until 31 May 2025, limiting the search criteria to English language, healthy adults, and papers that included cross-sectional data. The search string was designed for PubMed and translated for use in other databases using the Polyglot Search Translator [9]. The complete search strings for all databases are in Supplementary Table S1.

2.1. Study Selection and Screening

Two review authors independently screened the titles and abstracts for inclusion against the inclusion criteria; one author retrieved full texts, and the two authors screened the full texts for inclusion. Any disagreements were resolved by discussion or referred to a third author. Data extraction was conducted using standardized forms, and statistical summaries and calculations (including Bland–Altman metrics) were cross-checked for accuracy.

2.2. Inclusion Criteria

Design: Data was included from cross-sectional studies and baseline data from longitudinal studies.

Participants: Adults ≥ 18 years of age, healthy, non-pregnant women, non-athletes, and mass stable.

Outcome: The primary outcome was the percentage of body fat or fat-free mass from a 4C model compared to the same measure from bioelectrical impedance analysis. Studies must have used BIA machines as the comparator, and the 4-compartment models must have used DXA to estimate bone mineral content but not used DXA to obtain body volume. Studies must have used Bland–Altman limits of agreement (LOAs) in their accuracy analysis. When studies did not report LOAs, LOAs were calculated from the reported mean difference and standard deviation or estimated, if possible, from published Bland–Altman plots. Some studies had insufficient data to compute the bias and standard deviation; attempts were made to obtain this data if the papers were published within a reasonable timeframe.

Date: No limit publication date.

Language: English only.

2.3. Exclusion Criteria

Studies on children, pregnant women, athletes, or studies in which total body water in the 4C model was measured by a BIA machine were excluded. Studies or data from studies that compared changes in 4C model outcomes to BIA after intervention were excluded unless they had a baseline comparison. Papers that used Bioelectrical Impedance Spectroscopy (BIS) machines as a comparator to the 4C model were excluded, given the differences between BIA and BIS devices. However, papers that used BIS devices for the estimation of total body water as part of the 4C model were included.

2.4. Data Extraction

Data extraction included participant demographics (age, sex, race, and health status), data fields relating to the BIA machine (make, model, frequency type, and configuration, information relating to the validation of equations), 4C criterion method (4C model used, techniques used for body volume, total body water, and bone mineral content) and statistical methods Bland–Altman Limits of Agreement (LOA). Data on proportional bias were extracted from LOA plots where these were available or from text.

2.5. Quality and Risk of Bias Assessment

Study quality was assessed independently by two authors using the Appraisal tool for Cross-Sectional Studies (AXIS), which is a series of twenty questions with a three-option response choice of ‘yes’, ‘no’, or ‘don’t know/comment’ [10]. Seven (Q1, 4, 10, 11, 12, 16, and 18) of the questions related to reporting quality; seven (Q 2, 3, 5, 8, 17, 19, and 20) of the questions related to study design quality; and six related to the possible introduction of biases in the study (6, 7, 9, 13, 14, and 15). Several questions relating to response rate (Q7, 13, 14) were excluded, as they were not relevant. The AXIS questionnaire does not provide an overall aggregate score. Given the importance of fluid and alcohol intake and strenuous exercise on hydration status, premeasurement restrictions on these factors were extracted from each study.

2.6. Primary Outcome Analysis

The primary endpoint was the individual comparability between the 4C model and the BIA machine for %BF or FFM (kg). The Bland–Altman approach, which plots the difference between methods against the mean of the two methods, was chosen, as it was the most frequently used analytical method. While other statistical methods were found within the included papers, the use of the Bland–Altman method as the primary outcome metric allowed comparability across the greatest number of studies. This was assessed using the estimates of bias (mean difference between BIA and 4C method) and the limits of agreement (LOAs) as set out by Bland and Altman [11,12]. The LOA provided an estimate of the upper and lower boundaries of the bias between the two methods. The smaller the LOA, the more accurate the estimate of the bias. The upper and lower LOAs are calculated on the mean difference between the 4C model and the BIA comparator (often referred to as bias); in some cases, the LOAs were estimated from the provided plots. Proportional bias, the slope of the regression line taken on the Bland–Altman plot was also considered; ideally, there should be no clinically relevant proportional bias.

A meta-analysis of the Bland–Altman data was deemed inappropriate given the significant heterogeneity of the studies, both with respect to the 4C model methodology and among the BIA devices with respect to machine manufacturer and type. Several authors have suggested guidelines for the use of the Bland–Altman test; these have been evaluated by Gerke et al. [13], with the recommendation to use the checklist of Abu-Arafeh et al. [14]. A modified checklist of Abu-Arafeh et al. [14] was used on the included trials in this review (see Supplementary Table S4).

3. Results

The electronic database search retrieved 491 records, with 262 duplicates subsequently removed, and a further 178 were excluded for not being relevant after reviewing the title and abstract. A total of 51 full-text articles were reviewed for eligibility (Figure 2). In two of these studies [15,16], the X-axis of the Bland–Altman plot contained the 4C model estimate and not the average value of the 4C model and the comparator BIA as specified by Bland–Altman. In one instance, this was a deliberate decision of the authors [16] based on the paper of Krouwer (2008) [17]; however, the argument for not using the average of the two comparator machines on the X-axis value has been disputed [18]. Consequently, a decision was made to exclude these two papers from the analysis to preserve homogeneity with the remaining papers included in the analysis. Therefore, a total of twelve studies that met the eligibility criteria were included in this review (Figure 1). A list of eligible articles and reasons for exclusion of papers is available in Supplementary Table S2.

For this analysis, data for %BF and FFM are reported where possible. While twelve studies were included in this review, some studies included more than one device or provided additional data based on gender or ethnicity; these permutations were treated as separate entries for the purposes of evaluation, giving thirty-four evaluations for percentage body fat from eleven studies, and twenty-nine evaluations for fat-free mass from seven studies.

Jebb et al. [19] investigated two different BIA machines, while Gibson et al. [20] investigated two different BIA models from the same manufacturer (Table 1). Bosy-Westphal et al. [21] tested a BIA machine to develop a prediction equation in a sample of German adults and then applied the equation to a multi-cultural validation sample in the USA (Table 1). Nickerson et al. [22] used one BIA machine while testing four separate body fat equations, providing data on the total sample as well as by sex (Table 1). In a separate study, Nickerson et al. [23] tested one BIA machine and provided an analysis on the total cohort, as well as men and women separately. Blue et al. [24] tested one BIA machine and provided data on different ethnic groups, but the Bland–Altman analysis was conducted only on the total group. Brandner et al. [25] investigated a BIA smartwatch and a multifrequency bioelectrical impedance analysis (MFBIA) machine; only data from the MFBIA machine was included. The Siedler et al. study [26] investigated 15 different BIA machines, most of which were for domestic use; we chose to include just the medical-grade BIA machine (Seca 515/514) that would be used in clinical or research studies. The papers by Gibson et al. [20] and Brewer et al. [27] were included after it was confirmed with the manufacturer that the InBody devices were BIA devices and not BIS devices, as the title of the Gibson paper suggests (email correspondence, 2nd August 2023).

3.1. Study Characteristics

Details of the included studies, information on the study populations, the 4C equation used, and the component methodology are provided in Table 1, including the types of BIA machines and their manufacturers.

3.2. Study Quality

The quality of the studies, as assessed by the AXIS questionnaire, is presented in Supplementary Table S3a,b. Overall, the quality of the studies was judged as adequate except for sample size justification and questions relating to statistical reporting, as described below. Only two of the included studies, Brandner et al. [25] and Blue et al. [24], provided statistically based sample size calculations, though some justification for the sample size was provided in several cases. With regard to premeasurement restrictions of fluid, alcohol, and strenuous exercise, minimal information was supplied by studies prior to 2013 (Supplementary Table S4).

3.3. Statistical Methods

Only studies that used the Bland–Altman agreement analysis for the assessment at the individual level were included in this review. Whilst the Bland–Altman analysis method is straightforward, there is a preferred methodology for using this assessment method. Several guidelines for the use of the Bland–Altman test have been evaluated by Gerke (2020) [13], who recommended the checklist of Abu-Arafeh et al. [14]. A modified checklist of Abu-Arafeh et al. [14] was used on the included trials in this review (see Supplementary Table S5). Assessment of proportional bias through a regression analysis on the plot data points provides additional information as to whether high or low estimates can skew agreement. Some studies reporting using the Bland–Altman method did not supply LOA plots. The Fuller et al. study [28] provided %BF for the 4C model but not the BIA machine, only providing the bias and LOA; some, e.g., Jebb et al. [19], provided only the bias data.

A variety of different statistical methods were reported for random error (precision/reliability) and systematic error or bias reflecting accuracy or validity. Test–retest reliability tests included calculating precision error, using intraclass correlation coefficients (ICC), least-significant change, and the root mean square coefficient of variation. Agreement between devices other than Bland–Altman analysis was assessed by multiple methods, including paired t-tests, Pearson correlation coefficient, Deming regression, root mean square error, concordance correlation coefficients (CCC), and equivalence testing using two one-sided t-tests (TOST).

3.4. 4C Model vs. BIA Results

3.4.1. Bland–Altman—Percentage Body Fat

The summary for the Bland–Altman tests for %BF comparison between BIA and the 4C model is in Table 2. The overall bias between the percentage of body fat recorded by the 4C model and the BIA machine could be considered low, ranging from −2.9% to 4.4% per cent. However, there are wide limits of agreement around the bias estimates in nearly all cases.

3.4.2. Bland–Altman—Fat-Free Mass

The summary for the Bland–Altman tests for FFM comparison between BIA and the 4C model is in Table 3. The bias between FFM recorded by the 4C model and the BIA machine could be considered low, ranging from −3.9 to 1.8 kg; however, there are wide limits of agreement around the bias estimates in all studies.

4. Discussion

While BIA machines have been tested against the 4C model numerous times, different criteria have been used to assess the validity of estimates of %BF and FFM from BIA to the 4C model. For our analysis, the use of Bland–Altman limits of agreement was specified as the primary criterion for assessing agreement between a BIA machine and a 4C model, as this was the most used statistical method across studies and allowed for assessment at the individual clinical level. The results of the Bland–Altman analysis comparison in this paper of cross-sectional studies show that at the individual level, estimates of %BF or FFM from the included BIA machines were not adequate estimators of these components compared to the criterion 4C model used (Table 2 and Table 3). The BIA machines tested in this review showed bias estimates ranging from relatively low to high, both in absolute and relative terms (Table 2 and Table 3). On average, the bias for %BF was greater than that for fat-free mass, both in absolute and relative terms. However, these biases were frequently accompanied by wide 95% limits of agreement, reducing their suitability for individual-level assessment of either measure. For clinical interchangeability, even a larger bias can be acceptable if the limits of agreement are narrow and consistently on one side of zero, as this allows for a straightforward correction. In contrast, a small bias with wide limits that cross zero provides little practical value. In those studies, with proportional bias estimates, proportional bias was observed only very occasionally.

Several critical issues need consideration when evaluating the significance of these findings. These are the complexities of the measurements made by the BIA machines and those made by the 4C model, and the method for assessing equivalence between the two methods.

4.1. BIA—Validation of Percentage Body Fat and Fat-Free Mass

A diverse array of BIA machines was used by the studies included in this review, differing in several important ways. Firstly, machines could differ in their physical design: from single-frequency, supine, whole-body estimations, using electrodes, to vertical, multiple-frequency, segmental estimations, using foot plates, handles, or handrails; or combinations of these factors (Table 1). Secondly, BIA machines do not directly measure %BF or FFM but rather provide estimates of these components based on validation studies against other body composition techniques used as reference standards. BIA devices may have used different validation methods, which themselves may have needed external validation. For example, if DXA was used as the reference standard, the BIA estimates of %BF or FFM would be limited to the generalization restrictions based on the DXA-specific validation sample characteristics, for example, age, sex, race, body composition, and health status. BIA validity against a 4C model could be confounded by the inherent limitations of the DXA reference model and validation sample. In this case, comparing BIA estimates of %BF and FFM to 4C model estimates may, in fact, be a de facto 4C model assessment of the DXA validation method. BIA machines can use unpublished proprietary regression-based equations in their estimates of TBW, body fat, FFM, or muscle [38]. Information on how these equations are adjusted for race, gender, and age may not be available. Issues can also arise with published equations to generate estimates of %BF or FFM from resistance or reactance data. For example, in the Nickerson et al. paper [22], four different BIA equations were tested in a sample of young men and women; three studies used an RJL device (Deurenberg [35], Chumlea [34], and Sun [37]); the other study used a Xitron 4000B, which is actually a BIS machine (Kyle [36]). The problem with published equations is that different BIA machines do not necessarily give the same raw reactance and resistance measurements [39,40,41]. When using published body composition equations, if the BIA machine being utilized is not using the same BIA machine as, or has not been validated against, the specific machine used in making these equations, then issues of accuracy will occur [42].

While the limitations concerning the accuracy and precision of BIA machines have been discussed [43,44], an often-unexplored issue is the accuracy and precision of the criterion 4C method used itself. Both sides, the comparator and the reference standard, need validation, as they both contribute to the total error observed.

4.2. 4C Model—More than One Model

The critical outcome of validation studies is whether an estimate of body composition from one method can be substituted for the same body composition estimate by another method. There are two key factors with respect to measurement instruments: the level of repeatability and accuracy. For a device to have good accuracy, good reliability is very important. Bland and Altman noted that if the criterion method has poor repeatability, then even if the comparator is perfect, the methods will not agree; if both methods have poor repeatability, any comparison will be very problematic [12,45].

If the precision of the 4C model used is unknown, there can be uncertainty as to what degree and on which side of the equation the error lies. There are multiple BIA machines and potential BIA-based equations that could be used to derive estimates of %BF and FFM. There are also multiple 4C model equations available for use, as well as multiple ways to measure them. All 4C equations are derived from first principles; an obvious first consideration is whether all 4C model equations are equivalent or whether there is a preferred 4C model and preferred component methodology.

Heymsfield et al. [4] examined the variability in %BF estimates from eleven different 4C model equations using the same raw data obtained from body volume measured by Bod Pod, bone mineral density by iDXA (GE Lunar), and total body water by deuterium dilution. The results showed minimal variation in percentage body fat between the models, ranging from 31.0 ± 1.0 to 32.7 ± 1.0%BF (Figure 3). Although the paper revealed instances of statistically significant proportional bias between models using Bland–Altman analysis, no limits of agreement for the eleven equations were provided. However, without verifying the limits of agreement, even small differences (bias) between methods do not guarantee that one equation can be substituted for another at the individual level. Additionally, there was no comparison of FFM across the eleven equations.

4.3. 4C Model—Methods Substitution

A related second important consideration is the effect of the methodological substitution of a component in a 4C model on estimates of %BF and FFM. While in the study of Heymsfield 2015 [4], all the equations utilized the same raw data obtained using identical component methodologies, this does not necessarily happen between different studies, as shown in Table 1. While the 4C model requires using criterion-based estimation of body volume, bone mineral density, and total body water, the actual methods used for each of these components can vary between studies, as discussed previously. Overall, we can assume there are at least six to twelve possible 4C model iterations, i.e., three body volume * two total body water * one to two bone mineral content for any 4C model equation used (Figure 4). There is also a push for rapid 4C model methodology, using DXA to estimate both body volume and bone mineral content, and BIS to estimate total body water instead of isotope dilution [4,56]. While rapid 4C model methodology may reduce the possible iterations to four, there is a question as to whether some validation against a ‘reference standard’ 4C model is required.

An example of method substitution on the outcomes using the 4C model is with total body water. Total body water can be estimated by isotope dilution or, more recently, for logistical reasons, using the simpler BIS method, which in recent publications has often been obtained using the SFB7 (ImpediMed) device. What would be the effect on the 4C model body composition estimates when SFB7 is used to calculate TBW data instead of isotope dilution?

Blue et al. [24] reported on %BF (and FFM kg) in several ethnic groups using the criterion 4C model of Wang (2002), in which TBW was measured both by deuterium dilution and an SFB7 device. The BIS 4C model estimate of BF% in the various ethnic groups was assigned a Heyward and Wagner rating [57] ranging from very good to excellent-ideal, based on the assessment of the standard error of the estimate (SEE). The Bland–Altman analysis of the same total sample comparing the BIS 4C model estimate of %BF to the criterion 4C model using deuterium dilution showed a bias of −1.5% with LOAs of −5.1% to 2.4%, with a statistically significant proportional bias, R^2^ = 0.1492, p < 0.001.

In a study of collegiate athletes, the 5C model of Wang et al. [5] using deuterium dilution to measure total body water (TBW) was compared to two BIS machines, one of which was an SFB7 machine, as well as an MFBIA machine [58]. Firstly, when comparing estimates of TBW per se according to Bland–Altman analysis, the MFBIA machine was the better performer, with a bias of −1.14 L and LOAs of −11.55 to 9.27 L, compared to the SFB7 BIS machine, which had a bias of −1.78 L and LOAs of −16.21 to 12.65 L. Although both devices had very wide LOAs, TOST analysis with 5% equivalence of the TBW estimates found the MFBIA machine was equivalent to the criterion deuterium dilution method, but not the SFB7. Estimates of the 5C model fat mass, using either BIA or SFB7 to estimate TBW in the equation, saw the MFBIA machine having a bias of 0.42 kg and LOAs of −6.85 to 7.69 kg, and the SFB7 BIS machine having a bias of 2.43 kg and LOAs of −18.15 to 23.01 kg. An analysis of fat mass estimates by TOST with 5% equivalence found that none of the machines were equivalent to the criterion method. The authors concluded that the substitution of methods in criterion models could be problematic [58].

In a 4C study in a diverse group of Australian adults, where TBW was once again estimated by deuterium dilution and SFB7, the Bland–Altman analysis between the two techniques gave a bias of 2.53 L and LOAs between −5.92 to 7.07 L, with no proportional bias [59]. When the SFB7 data was substituted into a 4C model (Withers 1998 equation [52]), the bias for fat mass was 1.87 kg with LOAs of −5.16 kg to 4.38 kg, and for fat-free mass, the bias was 1.87 kg, and LOAs were −4.38 kg to 5.16 kg [59].

The most commonly cited paper for SFB7 validation of TBW estimation is that of Moon et al. [60], where the SFB7 was compared to deuterium dioxide in twenty-eight (presumably healthy) young Caucasian men and women aged between 19 and 35 years (mean 24 ± 4 years). For the men, the mean BMI was about 26.0, while for the women, the BMI was low at 20.7. While the bias between SFB7 and deuterium dioxide was small (all subjects −0.09 L, males −0.8 L, females 0.62 L), Bland–Altman analysis of the LOAs was relatively large (all subjects −4.5 to 4.31 L, males −6.11 to 4.49 L, and females −2.20 to 3.45 L). Similar results were seen in another study comparing SFB7 to deuterium dioxide isotope dilution in resistance-trained males, with a bias of only −0.48 L observed, but with LOAs of −5.57 to 5.09 L that were not insignificant [61]. These findings suggest that SFB7 cannot be used in place of isotope dilution for the estimation of TBW at the individual level, at least in ‘healthy’ individuals, in validation studies.

Five of the eleven papers included in this review used BIS to measure total body water; four utilized the SFB7 device, and these four papers were four of the five most recent papers included in this review (Table 4). These results imply that using the SFB7 device in 4C models used in validation studies may introduce a potential source of error, meaning any lack of agreement between the 4C model and the BIA machine used could be in part owing to the 4C model methodology used itself, rather than inherent issues with the BIA machine. The use of the SFB7 device to calculate TBW makes comparison to studies using isotope dilution difficult.

4.4. Beyond the 4C Model

Despite the overall poor concordance between the studied BIA machines and the 4C model for estimating %BF and FFM in this review, it is difficult to assess the clinical worth of a BIA machine without evaluating all its critical outputs as discussed.

Owing to these issues, researchers and clinicians have sought to explore the use of the primary outputs of BIA machines, i.e., reactance and resistance, to assess either aspects of body composition or physiological function, such as grip strength or mortality risk [62]. BIA estimates of total body water (TBW), extracellular (ECW), and intracellular water (ICW) can be derived against isotope criterion methods, such as deuterium dilution [63]. The ratio of ECW/TBW and ECW/ICW can also provide useful information on the health status of individuals, with the latter providing particular insight into the health of the muscle component of the body [64].

There is also increasing interest in using another BIA data output, i.e., phase angle. A phase angle provides information on the health of cellular membranes and has been studied with respect to several health outcomes across a range of populations, including muscular health [65,66,67,68,69]. Even here, greater granularity may be needed, as individuals with similar PhA values can have markedly different fluid volumes or %BF [70]. Phase angles can be incorporated into a more in-depth analysis using bioelectrical impedance vector analysis (BIVA) or specific BIVA, though the latter requires the measurement of several body circumferences. Phase angle estimates are again machine-dependent, and the use of BIVA requires population-specific normative data, which is currently limited. However, the BIA International Database project aims to build a multi-ethnic dataset of BIA raw measures using data from multiple countries [71]. This initiative will hopefully provide a substantial normative database for researchers and clinicians.

An additional advantage of many BIA devices is their ability to provide regional body composition estimates, something not quantified by a 4C model. Knowing the size, location, and quality of skeletal muscle, as well as adipose tissue, should be more informative at the individual patient level than whole-body estimates of fat mass and fat-free mass [72]. BIA manufacturers should be encouraged in the evolution of their machines to include additional criterion methods, such as MRI or CT, for total and regional estimates of adipose tissue, skeletal muscle, and muscle quality. There have been large-scale body composition data acquisition projects using MRI, e.g., UK Biobank. MRI can be used to assess skeletal muscle and adipose tissue, both whole body and regionally, and to evaluate muscle ectopic fat deposition.

5. Limitations

This systematic review has only dealt with cross-sectional studies in adults designated as healthy, though information on health status was often lacking. This review excluded important population groups, such as athletes or those with chronic disease, given the aim to establish the equivalence between BIA devices and the 4C model in a population with minimal confounding. This review also did not include longitudinal studies. The utility of a body composition device to accurately detect meaningful change either to a treatment intervention or just to monitor disease or health status longitudinally is clinically highly relevant. Most BIA device manufacturers have proprietary equations for their body composition estimates. For many of the included devices in this study, the knowledge regarding their validation methods and cohort demographics is very limited. Premeasurement protocols, which help control for the effects of heavy exercise and water and alcohol intake on hydration status, were inconsistently reported between studies (see Supplementary Table S4). It is difficult to quantify the potential influence of different measurement protocols on the performance of either the BIA device or the 4C model.

6. Conclusions

This review compared a combination of BIA machine types and several permutations of the 4C model, varying by equation and methods. The Bland–Altman analysis, with few exceptions, saw low bias coupled with high limits of agreement, indicating acceptable use of BIA at the population level but not at an individual level when using 4C criterion estimates of %BF and FFM. Over the period of the papers selected for this review, there have been significant changes not only in BIA technology, but more recent validation studies have far more precision and validity statistics than earlier papers.

There are several issues concerning both the methodology in BIA machines and that used in 4C model studies, the doubly indirect estimates of %BF and FFM in BIA, and the choice of equation and impact of component substitution in the 4C model. For validation and research applications, isotope-based measures of total body water remain preferable to BIS-derived substitutions. Conceptual issues arise from the use of a more accurate estimation of a 2C model when using the 4C model, while modern body composition assessment requires regional and total estimates of muscle and adipose tissue, as well as an estimate of muscle quality.

Greater emphasis may be placed on raw impedance measures, such as resistance, reactance, and phase angle, which may offer clinically meaningful information independent of proprietary prediction equations. The use of BIA raw data to assess health holds promise. However, the lack of generalizable raw data across BIA machines is a significant shortcoming of this technology. Regardless, the clinical usefulness of BIA should not be defined solely on estimates of fat mass or fat-free mass and needs further improvement to include more comprehensive clinical evidence from future studies.

Bibliography72

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Thomas D.M. Crofford I. Scudder J. Oletti B. Deb A. Heymsfield S.B. Updates on methods for body composition analysis: Implications for clinical practice Curr. Obes. Rep.202514810.1007/s 13679-024-00593-w 39798028 · doi ↗ · pubmed ↗
2Dupertuis Y.M. Jimaja W. Beardsley Levoy C. Genton L. Bioelectrical impedance analysis instruments: How do they differ, what do we need for clinical assessment?Curr. Opin. Clin. Nutr. Metab. Care 20252837938710.1097/MCO.000000000000114240667712 PMC 12337901 · doi ↗ · pubmed ↗
3Laforgia J. Dollman J. Dale M.J. Withers R.T. Hill A.M. Validation of DXA body composition estimates in obese men and women Obesity 20091782182610.1038/oby.2008.59519131939 · doi ↗ · pubmed ↗
4Heymsfield S.B. Ebbeling C.B. Zheng J. Pietrobelli A. Strauss B.J. Silva A.M. Ludwig D.S. Multi-component molecular-level body composition reference methods: Evolving concepts and future directions Obes. Rev.20151628229410.1111/obr.1226125645009 PMC 4464774 · doi ↗ · pubmed ↗
5Wang Z. Pi-Sunyer F.X. Kotler D.P. Wielopolski L. Withers R.T. Pierson R.N.Jr. Heymsfield S.B. Multicomponent methods: Evaluation of new and traditional soft tissue mineral models by in vivo neutron activation analysis Am. J. Clin. Nutr.20027696897410.1093/ajcn/76.5.96812399267 · doi ↗ · pubmed ↗
6Wang Z.M. Pierson R.N.Jr. Heymsfield S.B. The five-level model: A new approach to organizing body-composition research Am. J. Clin. Nutr.199256192810.1093/ajcn/56.1.191609756 · doi ↗ · pubmed ↗
7Wilson J.P. Mulligan K. Fan B. Sherman J.L. Murphy E.J. Tai V.W. Powers C.L. Marquez L. Ruiz-Barros V. Shepherd 47J.A. Dual-energy X-ray absorptiometry-based body volume measurement for 4-compartment body composition Am. J. Clin. Nutr.201295253110.3945/ajcn.111.01927322134952 PMC 3238462 · doi ↗ · pubmed ↗
8Nickerson B.S. Esco M.R. Bishop P.A. Kliszczewicz B.M. Park K.S. Williford H.N. Validity of four-compartment model body fat in physically active men and women when using DXA for body volume Int. J. Sport. Nutr. Exerc. Metab.20172752052710.1123/ijsnem.2017-007628787184 · doi ↗ · pubmed ↗