The Reliability of the Tonnis Grading System in Patients Undergoing Hip Preservation. The American journal of sports medicine Pullen, W. M., Carreira, D. S., Wong, I., Aoki, S. K., Lynch, T. S., Mather, R. C., Ayeni, O. R., Byrd, J. W., Safran, M. R. 2023: 3635465221147055

Abstract

BACKGROUND: The presence of pre-existing osteoarthritis (OA) has been associated with poor results after hip arthroscopic surgery. There is limited evidence validating the currently available grading systems of hip OA in patients undergoing hip preservation.PURPOSE/HYPOTHESIS: Our purpose was to evaluate the interobserver and intraobserver reliabilities of 2 grading systems in a group of patients undergoing hip preservation: the Tonnis grading system and a simple 4-choice Likert scale. The hypothesis was that interobserver and intraobserver reliabilities using the Tonnis grading system would be poor among surgeons experienced in hip preservation and that a 4-choice Likert scale would be more reliable.STUDY DESIGN: Cohort study (diagnosis); Level of evidence, 3.METHODS: A total of 100 hip radiographs were reviewed by 8 experienced hip preservation surgeons. Overall, 2 rounds of reviews were performed, at least 3 weeks apart, assessing for the presence, degree, and/or location of joint space narrowing, joint space asymmetry, subchondral cysts, osteophytes, and sclerosis. The radiographs were assigned a Tonnis grade as well as a Likert grade of OA, reported as none, mild, moderate, or severe. Statistical analysis was conducted to provide Fleiss kappa values with 95% CIs. Agreement was classified as poor for <0.00, slight for 0.00-0.20, fair for 0.21-0.40, moderate for 0.41-0.60, substantial for 0.61-0.80, and almost perfect for >0.80.RESULTS: A total of 50 patients (28 female and 22 male) with a mean age of 42.8 ± 14.2 years (range, 19-70 years) were reviewed. The Tonnis grade demonstrated an interobserver kappa value of 0.30 (95% CI, 0.26-0.34). The Likert grade demonstrated an interobserver kappa value of 0.33 (95% CI, 0.28-0.37). All other measures demonstrated interobserver kappa values classified as slight or fair except for subchondral cysts which was moderate. Intraobserver reliabilities were statistically significantly higher than interobserver reliabilities. Intraobserver reliabilities for both the Tonnis grade (kappa = 0.55 [95% CI, 0.51-0.60]) and Likert grade (kappa = 0.59 [95% CI, 0.55-0.63]) demonstrated similar kappa values, consistent with moderate agreement. Subchondral cysts demonstrated the strongest interobserver (kappa = 0.53) and intraobserver (kappa = 0.85) reliabilities.CONCLUSION: Interobserver and intraobserver reliabilities were fair and moderate, respectively, for grading OA. Given the limited interobserver reliability, caution should be used when interpreting and translating studies that utilize the Tonnis grade or other rating to dictate treatment algorithms.

View details for DOI 10.1177/03635465221147055

View details for PubMedID 36645041