Investigation of the inter-rater reliability of three different plaque indices used in patients with fixed orthodontic appliances
Christina Erbe, Teresa Temming, Daniela Ohlendorf, Irene Schmidtmann, Priscila Ferrari-Peron, Ambili Mundethu, Heinrich Wehrbein, Sameh Attia, Sameh Attia, Sameh Attia

TL;DR
This study compares how consistently different plaque indices can be used by orthodontic evaluators with varying experience levels.
Contribution
The study evaluates inter-rater reliability of three plaque indices in orthodontic patients, revealing the impact of evaluator experience and index choice.
Findings
The Attin and mBB indices showed higher inter-rater reliability than the TQH index.
Orthodontic experience did not significantly affect reliability of plaque index assessments.
Calibration of raters is recommended to improve consistency in oral hygiene classification.
Abstract
To analyze the inter-rater reliability of three different plaque indices with regard to raters’ orthodontic experience. The study analyzed 50 photographs of patients with maxillary and mandibular multibracket appliances (MB), captured via Digital Plaque Imaging Analysis (DPIA) for plaque assessment. Three indices - the modified Turesky index (TQH index), Attin index, and modified bonded bracket index (mBB index) were used. Fourteen evaluators with varying orthodontic experience levels (four with limited, five with moderate, and five with extensive experience) assessed the images. The highest agreement among the evaluators in terms of ICC was obtained using the Attin index and the mBB index. The TQH index yielded the poorest agreement among evaluators. Orthodontic experience had no significant effect on inter-rater reliability. The evaluators with little orthodontic experience scored…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDental Radiography and Imaging · Orthodontics and Dentofacial Orthopedics · Oral microbiology and periodontitis research
Introduction
In Germany, the Fifth Oral Health Study (Deutsche Mundgesundheitsstudie DMS V) published in 2014 found a decrease in caries in the population. The prevalence of tooth loss is expected to decrease by 72% from 1997 to 2030 [1]. However, in patients treated with multibracket appliances (MB), an increase in plaque accumulation and gingivitis was observed [2,3]. The irregular surface of the MB appliance restricts the physiological cleaning function of oral muscles and saliva [4]. Furthermore, studies have shown that patients with MB appliances exhibit altered microbial flora [5–7]. Particular predilection sites for plaque accumulation include the areas behind arch wires [8], areas under bands of washed-out cementum [9], composite surfaces adjacent to the bracket base, the areas under the wings and in the slots of brackets, and composite-enamel interfaces [10]. Persistent plaque accumulation may result in the formation of demineralization, so-called white spot lesions [11], and the development of gingivitis [12].
Plaque indices serve as crucial tools for quantifying plaque accumulation and are widely employed in research to evaluate the effectiveness of oral hygiene products in various scientific studies [13–16]. In clinical practice, they also play a role in motivating patients to enhance their oral hygiene routines [17]. In patients with MB appliances, plaque tends to accumulate in different predilection sites compares to patients without MB appliances. Current studies focus on computer-assisted plaque assessment using planimetric quantification by plaque image analysis as an accurate, truly quantitative method [18,19]. However, these techniques are costly and necessitate complex equipment. In addition, a recent study showed conflicting results in accurately quantifying plaque in patients with MB [20]. In contrast, conventional plaque indices are quick and easy to use. However, few studies have specifically investigated the application of dental plaque indices in this population [21]. Furthermore, there is a scarcity of data on the reliability and reproducibility of orthodontic plaque indices in clinical research settings [21], and direct comparisons between standard dental plaque indices and those tailored for orthodontic patients remain infrequent [21].
The aim of this study was to investigate the inter-rater reliability of three different plaque indices used in orthodontics in patients with MB. In addition, the effect of the orthodontic experience of the raters on the results was analyzed. Each plaque index represented a plaque index system.
Materials and methods
Study design and subject matter
This study utilized a set of n = 50 photographs (accessed in June 2020 for research purposes, with no identifying patient information) captured using Digital Plaque Imaging Analysis (DPIA), a computer-assisted photographic analysis method which provides swift data collection and excellent reproducibility, as described by Klukowska et al [3]. The frontal and lateral intraoral photographs of the vestibular and lingual teeth surfaces were taken with a Nikon D80 reflex camera (Nikon Corporation, Tokyo, Japan), using cheek retractors and corresponding mirrors. Each photograph represented a distinct subject, resulting in a total of 50 subjects with a mean age of 13.9 ± 1.98 years. All subjects had previously been examined at the Department of Dentofacial Orthopedics and Orthodontics, University Medical Center of the Johannes Gutenberg-University, Mainz (Rhineland-Palatinate, Germany), and were fitted with a MB appliance in the both maxilla and mandible at the time of the study. Ethical approval was granted by the Freiburg Ethics Committee International (Baden-Württemberg, Germany) under feki code 07/2113. Detailed inclusion and exclusion criteria are available upon request from the Freiburg Ethics Committee. All subjects and their legal guardians provided verbal and written informed consent.
Survey of plaque indices
In this study, the following plaque indices were examined as a representative of a plaque index system:
- The TQH index is recognized as the international standard plaque index and is one of the most commonly used plaque indices in dental studies [22–25]. The Turesky index, along with the Quigley-Hein index, has also been frequently used in orthodontic studies [26–29]. The TQH index represents a modified dental plaque index. The classification of vestibular tooth surfaces of the TQH index were modified according to Cugini et al. [30] and divided into three regions (mesial, central, distal). Plaque accumulation was classified into the following assessment grades [24]:
grade 0 - no plaque,grade 1 - single plaque islands along the gingival margin,grade 2 - thin plaque line (≤ 1 mm) along the gingival margin,grade 3 - plaque line > 1 mm, plaque covers ≤ 1/3 of the tooth surface,grade 4 - plaque covers ≤ 2/3 of the tooth surface,grade 5 - plaque covers > 2/3 of the tooth surface.
- The Attin index was developed specifically for orthodontic patients with MB [31]. The development of the Attin index focused on the application of the plaque index in everyday clinical practice. This plaque index considers the predilection sites for plaque accumulation in patients with MB - the mesial, distal, and cervical areas of the bracket. Plaque accumulation was classified into the following assessment grades:
grade 0 - no visible plaque,grade 1 - plaque islands on the proximal surfaces,grade 2 - in addition to the proximal surfaces, plaque islands cervical of the bracket,grade 3 - plaque covered > 1/3 of the surface cervical of the bracket.
- The mBB index according to Delaurenti et al. [32] combines a dental and an orthodontic plaque index. The evaluation grades, adapted from the OHI according to Greene and Vermillion [33], represent the dental component and the four-part classification of the tooth surfaces (incisal, distal, mesial, gingival) according to Williams et al. [34] corresponds to the orthodontic component. Plaque accumulation was classified into the following assessment grades:
grade 0 - no plaque,grade 1 - plaque covers ≤ 1/3 of the tooth surface,grade 2 - plaque covers ≤ 2/3 of the tooth surface,grade 3 - plaque covers > 2/3 of the tooth surface.
Plaque indices were collected by n = 14 evaluators using the DPIA photographs. The vestibular tooth surfaces of all twelve anterior teeth (six maxillary, six mandibular) were examined. The evaluators had varying levels of orthodontic experience. At the time of the study, n = 10 assistant dentists were undergoing further orthodontic training and n = 4 evaluators were orthodontic specialists at the Department of Orthodontics of the Johannes Gutenberg University Mainz.
The evaluators were divided into three groups according to
little (≤ 1 year),moderate (≥ 2 years to ≤ 5 years), andextensive (> 5 years) orthodontic experience.
Accordingly,
n = 4 raters had little orthodontic experience,n = 5 raters had moderate experience, andn = 5 raters had extensive orthodontic experience.
Each rater assessed the plaque accumulation of all n = 50 subjects and collected each of the three plaque indices once. All plaque indices were collected within a timeframe of two weeks, and the examinations were carried out at intervals of at least two days. All raters received a written description of the plaque indices in addition to a pictorial explanation for the evaluation.
Statistical analysis
In this study, the results of plaque indices were summarized in Excel 2013 (Microsoft Office 2013, Microsoft Corporation, Redmond, WA, USA). R 4.4.1 (The R Foundation for Statistical Computing, 2016, download: https://cloud.r-project.org) was used in performing the statistical analysis. Conversion of all plaque values to percentage plaque values was achieved as follows:
The inter-rater reliability related to the classification of the subjects into oral hygiene categories was determined with Fleiss’ Kappa for multiple raters [35] and interpreted according to Landis and Koch [36]. For this analysis, the percentage plaque values of the four plaque indices were classified into the oral hygiene categories according to Lange [37]. Inter-rater reliability using the percentage plaque values was analyzed using the intra-class correlation coefficient (ICC) [38]. In this study, the ICC (2.1) was applied and interpreted according to Cicchetti [39]. The ICC was determined with and without consideration of the orthodontic experience of the raters.
Results
Fleiss’ Kappa showed moderate agreement among the raters for all three plaque indices according to Landis and Koch with regard to the classification of the subjects in the same oral hygiene category (Table 1). The evaluators rated the oral hygiene of the subjects with the TQH and Attin indexes somewhat more consistently than with the mBB index.
Table 1: Evaluation of the consistency of the oral hygiene categories among raters using Fleiss’ Kappa with 95% confidence intervals.
The raters achieved excellent agreement with the Attin index in relation to the congruence of the plaque values according to the Cicchetti interpretation. With the TQH and mBB index, there was good agreement among the evaluators (Table 2).
Table 2: Evaluation of the consistency of plaque values among raters without considerations of their orthodontic experience with the ICC.
In this study, the Attin index was found to be most appropriate for the raters with little orthodontic experience. For the raters with moderate orthodontic experience, there was no significant difference between the plaque indices. For the raters with a lot of orthodontic experience, the Attin index and the mBB index were found to be the most appropriate (Table 3).
Table 3.: Evaluation of the consistency of plaque values among raters with consideration of their orthodontic experience with the ICC.
Discussion
The use of plaque indices has been investigated less frequently in orthodontic studies than in dental studies. In 2010, Raggio et al. [40] analyzed the Silness and Löe index and the Turesky index concerning reliability and discriminatory ability in a dental study. Furthermore, Eaton et al. [41] and Kingman et al. [42] investigated the reliability of the Silness and Löe index. Quirynen et al. [43] tested the discriminatory ability of the Quigley-Hein index and other plaque indices in 1991. Matthijs et al. [44] specified the intra-rater reproducibility of the Turesky index and the Navy plaque index according to Elliott et al. [45] in 2001. Marks et al. [46] also analyzed the Turesky index for reliability and reproducibility in 1993. Few recent studies have examined the reliability of conventional plaque indices [47–50]. Paschos et al. [21] are among the few authors who investigated the reliability of plaque indices in subjects with MB.
The results of our study show that the consistent classification of subjects into the same oral hygiene category by multiple raters using a plaque index proved difficult. The classification of plaque into subdivided oral hygiene categories is of particular clinical importance. Oral hygiene categories provide an indication of a patient’s oral hygiene status and help classify patients into an appropriate prophylaxis program. In daily clinical practice, the classification of patients into a prophylaxis program to improve or maintain oral hygiene is more important than a precisely determined plaque score. The calibration of multiple raters plays a minor role in the collection of plaque indices in practice as opposed to studies [51]. However, the present study showed that the subjects were classified into the same oral hygiene category by the raters with insufficient agreement. Consequently, the calibration of raters in practice may lead to a more consistent classification of patients into the same oral hygiene category. The correct classification of patients into the appropriate oral hygiene category is relevant in clinical practice during orthodontic treatment with MB, as it helps to maintain or improve oral hygiene in the best possible way. A disadvantage of the classification of plaque values into oral hygiene categories was posed by the category boundaries. These resulted in two plaque indices being less consistent than was the case in reality.
The inter-rater reliability of the TQH index compared to the mBB index was better than the subsequent analysis with the ICC. Due to the higher number of scoring levels of the TQH index, the probability of agreement between the subjects’ oral hygiene categories was higher with the TQH index than with the mBB index. This may be the reason why, compared to the mBB index, the TQH index obtained a better result in the analysis of inter-rater reliability with Fleiss’ Kappa than in the analysis with the ICC.
The results of the ICC showed that an orthodontic plaque index (Attin index) and a combined dental and orthodontic plaque index (mBB index) were more appropriate for the raters than a modified TQH index. Compared with the three-part vertical division of tooth surfaces of the TQH index, the four-part division of the mBB index possibly allowed a better assessment of plaque accumulation. In addition, the results indicated that the mBB index was easier to collect with a lower number of assessment grades than the TQH index. However, this meant that the mBB index lost precision. It should be noted that in this study the ICC was diminished by low variability in the subject population due to the inclusion criteria [52]. Orthodontic experience showed no significant effect on inter-rater reliability. The results of our study suggest that the number of dental surfaces to be evaluated may have played a role. The Attin index, which assessed the entire tooth surface, was apparently easier to collect for raters with little orthodontic experience than the mBB and TQH indexes, which assessed multiple tooth surfaces.
In 2012, Hefti and Preshaw [53] pointed out the importance of training and calibration of raters when collecting a plaque index in clinical trials. The authors also mentioned appropriate procedures in this regard [53]. In this study, the inter-rater reliability of plaque indices was evaluated without training and/or calibrating the raters. It can be assumed that training and calibrating of the raters would lead to better inter-rater reliability. Consequently, training and calibration are of great importance when collecting conventional plaque indices in studies. Before the start of a planned study, it must always be ensured that the raters are trained and calibrated.
Numerous studies on the reliability of plaque indices in dentistry were found in the literature [49,50,54]. However, there are only a few studies on the reliability of plaque indices in orthodontics. In 2014, Paschos et al. [21] analyzed the use of four plaque indices in subjects with MB. The study examined the orthodontic Attin index, the Modified Orthodontic Plaque Index according to Paschos et al., the Quigley-Hein index, and the Navy Plaque Index according to Clemmer and Barbano [55]. The Quigley-Hein index represented a dental plaque index, as did the TQH index in this study. The Modified Orthodontic Plaque Index, similar to the mBB index in this study, consisted of a combination of a dental and an orthodontic plaque index. The results of Paschos et al. were comparable to the results of our study. The study by Paschos et al. also showed better reliability for the Attin index and the Modified Orthodontic Plaque Index than for the Quigley-Hein index. The study by Paschos et al. further examined the plaque indices in terms of orthodontic experience. Their study found that the Quigley-Hein index as a dental plaque index was more sensitively related to educational level than an orthodontic or combined dental and orthodontic plaque index. The results of our study and the study by Paschos et al. were consistent with the literature in that dental plaque indices are inappropriate for patients with MB. In 2001, Matthijs et al. [44] also pointed out difficulties in collecting the Turesky index. The aforementioned authors investigated the intra-rater reliability of the Turesky index in dental subjects. The study discussed the assumption that the raters changed the criteria for collecting the plaque index between two measurements. Matthijs et al. [44] mentioned the scoring levels of the Turesky index as a reason. The inter-rater reliability reported here is consistent with previous paediatric research, with an ICC of 0.68–0.88 and 0.48–0.77 for n = 2 raters [56]. In a study by Marks et al. [46], the Turesky index collected from n = 11 raters yielded an ICC of 0.70 [46]. The reason for the slightly better correlation compared with our study may have been the extensive training and calibration procedures of the raters. In addition, both studies were conducted on dental subjects.
In orthodontic studies, the Silness and Löe index is the most commonly used plaque index [57]. Like other plaque indices for orthodontic patients, this index was modified by Williams et al. [34]. Unlike all other conventional plaque indices, the Silness and Löe index assesses the thickness of the plaque. Silness and Löe [58] stated that the method of choice for distinguishing grade 1 and 2 of the Silness and Löe index is the use of a probe. Consequently, the collection of the Silness and Löe index with photographs is limited. In addition, the collection of this plaque index by multiple raters is complicated because the use of a probe results in damage to the plaque film [41]. For these reasons, the mBB index is a good alternative to the Silness and Löe index for scientific studies. In daily clinical practice, the focus is on the quick, easy application of plaque indices. Especially in patients with MB, the evaluation of oral hygiene and the motivation of the patient are of outstanding importance. Based on the results of this study, we recommend the use of the Attin Index, especially for everyday practice. The Attin index was developed specifically for daily use in the clinic and not for epidemiological and experimental studies [31]. In our study, the Attin index achieved comparable results to the mBB index in the evaluation of inter-rater reliability. In contrast to the mBB index, a plaque value per tooth is collected. This simplifies the practical application. The Attin index is thus well suited for orthodontic practice in patients with MB.
Conclusion
The mBB index used as a combined dental and orthodontic plaque index, and the Attin index used as an orthodontic plaque index, achieved good agreement among the raters. Consequently, we recommend these two plaque indices for determining plaque accumulation in patients with MB. The TQH index, the international standard plaque index, achieved the poorest agreement among raters and is therefore not suitable for use in patients with MB. In principle, when using conventional plaque indices, special attention should be paid to the training and calibration of the evaluators in advance.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Jordan AR, Stark H, Nitschke I, et al. Epidemiological trends, predictive factors, and projection of tooth loss in Germany 1997-2030: part I. missing teeth in adults and seniors. Clin Oral Investig. 2021;25:67–76.10.1007/s 00784-020-03266-9PMC 778554033219875 · doi ↗ · pubmed ↗
- 2Chen I, Chung J, Vella R, Weinstock GM, Zhou Y, Jheon AH. Alterations in subgingival microbiota during full-fixed appliance orthodontic treatment—A prospective study. Orthod Craniofac Res. 2022;25:260–8. doi: 10.1111/ocr.12534 34538018 · doi ↗ · pubmed ↗
- 3Klukowska M, Bader A, Erbe C, Bellamy P, White DJ, Anastasia MK, et al. Plaque levels of patients with fixed orthodontic appliances measured by digital plaque image analysis. Am J Orthod Dentofacial Orthop. 2011;139(5):463–70. doi: 10.1016/j.ajodo.2010.05.019 21536188 · doi ↗ · pubmed ↗
- 4Karabekiroğlu S, ÜnlüN, Küçükyilmaz E, Şener S, Botsali MS, MalkoçS. Treatment of post-orthodontic white spot lesions with CPP-ACP paste: A three year follow up study. Dent Mater J. 2017;36:791–7. doi: 10.4012/dmj.2016-228 28835597 · doi ↗ · pubmed ↗
- 5Reichardt E, Geraci J, Sachse S, et al. Qualitative and quantitative changes in the oral bacterial flora occur shortly after implementation of fixed orthodontic appliances. Am J Orthod Dentofac Orthop. 2019;156:735–44.10.1016/j.ajodo.2018.12.01831784007 · doi ↗ · pubmed ↗
- 6Hodges K, Famuliner P, Kingsley K, et al. Oral Prevalence of Selenomonas noxia Differs among Orthodontic Patients Compared to Non-Orthodontic Controls: A Retrospective Biorepository Analysis. Pathogens; 13. Epub ahead of print 8 August 2024. doi: 10.3390/pathogens 13080670 PMC 1135760339204270 · doi ↗ · pubmed ↗
- 7Yañez-Vico R, Iglesias-Linares A, Ballesta-Mudarra S, Ortiz-Ariza E, Solano-Reina E, Perea E-J. Short-term effect of removal of fixed orthodontic appliances on gingival health and subgingival microbiota: a prospective cohort study. Acta Odontol Scand. 2015;73:496–502. 25631494 10.3109/00016357.2014.993701 · doi ↗ · pubmed ↗
- 8Mei L, Chieng J, Wong C, Benic G, Farella M. Factors affecting dental biofilm in patients wearing fixed orthodontic appliances. Prog Orthod. 2017;18:4. doi: 10.1186/s 40510-016-0158-5 28133715 PMC 5276803 · doi ↗ · pubmed ↗
