This study demonstrated that the Physics Toolbox Suite mobile application is a reliable measurement tool for assessing the dynamic balance of healthy adults. The application showed good to high intra- and inter-rater reliability for APSI, MLSI, and OSI scores. Assessment of balance can be an important part of the rehabilitation protocol for an athletic injury, such as knee or ankle (Arnold and Schmitz, 1998). Measurement of dynamic balance can be used to assess the risk of injury, determine the severity of injury and monitor progress in rehabilitation protocols (Mattacola et al., 1995). In assessing dynamic balance, measurement tools that are reliable, cost effective, and do not suffer the limitations of existing mobile applications are required. The reliability of the Physics Toolbox Suite mobile application, which is similar to the working mechanism and scoring system of the BBS system and does not include the shortcomings of existing mobile applications, was evaluated in this study. Reliability indicates the degree of consistency between repetitions of a measurement or test (Weir, 2005; Ulupınar, 2022). The ICC is the most frequently preferred reliability indicator in research because it provides comprehensive information about the correlation of measurements (Hopkins, 2000; Ulupınar, 2022). For the first time in the literature, Pincivero (1995) conducted a dynamic balance reliability study with 20 university students on BBS. When OSI measurements were evaluated for the dominant and non-dominant legs at level 2 resistance (resistance levels 1: least stable, 8: most stable), both ICC scores were 0.60 (Pincivero, 1995). Following Pincivero (1995), Schmitz and Arnold (1998) evaluated the reliability of dynamic balance measurement with BBS using 30 seconds measurements with 19 people on dominant leg. They reported an inter-rater ICC value for OSI of 0.43 and 0.80 for intra-rater. In the study conducted with the Physics Toolbox, the OSI score ICC values were 0.73 for inter-rater and 0.87 for intra-rater. Although the platform used in this study moved in all directions without resistance restrictions, the ICC values were higher than those in the other two studies. The reason for this difference may be that the measurements made with the Physics Toolbox were made on two legs instead of a single leg due to the platform’s very mobile nature. The OSI score values show high intra-rater and good inter-rater reliability, indicating that the Physics Toolbox application is reliable in OSI evaluation. The APSI score provides detailed information for evaluating anterior and posterior changes in dynamic balance. Reliability measurements for APSI scores performed using the Physics Toolbox application showed good intra-rater (ICC(3,1) = 0.67) and inter-rater (ICC(3,1) = 0.73) reliability. In their evaluation with BBS, Schmitz and Arnold reported an APSI score of 0.68 for inter-rater and 0.80 for intra-rater (Schmitz and Arnold, 1998). The different testing protocols may explain the partially different reliability values. According to these results, it seems that the Physics Toolbox application is a reliable method for evaluating anterior-posterior dynamic balance. Another component of dynamic balance evaluation is medial-lateral assessment. Schmitz and Arnold, in their reliability evaluation of MLSI scores on the dominant leg, reported an inter-rater ICC of 0.42 and an intra-rater ICC of 0.40 (Schmitz and Arnold, 1998). Even if the measurement was made on the dominant leg, measurement on the other leg may have caused the ICC to be reported as low. In the evaluation conducted using the Physics Toolbox, high intra-rater reliability (ICC(3,1) = 0.90) and good inter-measurer reliability (ICC(3,1) = 0.71) were found in MLSI scores. The possible reason why MLSI scores show higher reliability than APSI scores is that the ankle is more anatomically stable in the medial and lateral aspects and has a lower joint range of motion. Although many studies have evaluated static balance with mobile applications, dynamic balance evaluation has been examined in a limited number of studies. In one important application study evaluating static balance, high reliability was reported for the Gait & Balance application for postural stability in the anterior-posterior axis (ICC(0.78)) and for the medial-lateral axis (ICC(0.84)) (Rashid et al., 2021). In the static balance assessment performed using the FDA approved Sway application, good reliability was reported in intersession (0.6-0.76) and intrasession (0.47-0.78) measurements (Amick et al., 2015). When considering the few studies that evaluated dynamic balance, Kuznetsov et al. evaluated the dynamic balance of participants on one leg and reported the ICC reliability value of the AccWalker smartphone application as 0.59 (Kuznetsov et al., 2018). In the study using the AccWalker application, the phone was fixed to the thigh during the measurement. In measurements performed with Sway, the phone was fixed on the body, similar to the AccWalker study (Amick et al., 2015; Eldesoky et al., 2017). High reliability values cannot be achieved in reliability studies on smartphone applications because of the negative impact of placing the phones on the body. However, it has been shown that there is no significant difference between smartphone applications and classical and reliable systems such as the BSS in dynamic balance measurement (Eldesoky et al., 2017). In their study, Eldesoky et al. (2017) evaluated the dynamic balance (at level 8) using the Sway application and the Biodex balance system simultaneously. They reported that there was no significant difference between Sway and BBS OSI scores in the measurement of dynamic balance (Eldesoky et al., 2017). Unlike previous mobile applications, in this study, which was carried out with the Physics Toolbox application, the phone was not placed on the body, and measurements were carried out by fixing it on a platform similar to the BBS platform. Fixing the phone on the platform may reduce measurement errors caused by the patient and may contribute to the higher reliability of this study compared with previous studies. Although ICC provides detailed information about the correlation of measurements, it is recommended to support it with calculations such as SEM, MDC and CVSEM, because of ICC results are affected by the number and homogeneity of subjects. The SEM value provides important information about the number of possible measurement errors, and it is stated that evaluations with lower SEM values are more reliable (Chiu et al., 2016; Ulupınar, 2022). Schmitz and Arnold reported interrater SEM values between 0.65 and 0.90 and interrater values of 0.61 and 0.71 in their dynamic balance measurements on one leg using BBS (Schmitz and Arnold, 1998). In a similar study conducted with BBS, Akhbari et al. (2015) found inter-session SEM values in the range of 3-5. Parraca et al. (2011), who evaluated the dynamic balance of physically active individuals, calculated the SEM value for the OSI score as 0.19. The difference between these results in the literature may be explained by using different difficulty levels in dynamic balance measurements with BBS. In their study using the Sway application, which is considered reliable, Amick et al. (2015) reported SEM values between 0.47 and 0.76 and CV% values as 5.95% - 10.47%. In the dynamic balance evaluation performed using the Physics Toolbox application, intra-rater SEM values were found to be in the range of 0.20-0.38 and inter-rater values were found to be 0.36-0.50. CVSEM, which shows the ratio of the possible error amount to the average was evaluated and was found to be between 8.79% and 16.67%. According to these SEM results, the sensitivity of measurements obtained using the Physics Toolbox application was considered to be good. The smallest difference indicating the change between measurements that was not caused by measurement errors was calculated using MDC (Wilkerson and Nitz, 1994; Chiu et al., 2016). In the dynamic balance measurement by Akhbari et al. (2015) using BBS, MDC values were determined between 0.1 and 0.8. When the dynamic balance measurement was performed using the Sway application, which uses a scoring system different from the BBS system, MDC values were reported to be between 14.95 and 20.96 (Amick et al., 2015). In the evaluation performed with the Physics Toolbox application using the BBS scoring system, MDC values were found to be in the range of 0.58-1.40. As expected in the evaluations conducted using the Physics Toolbox, the intra-rater MDC values were measured at a better level than the inter-rater values. The reason that the evaluation made with the Physics Toolbox application found results that were partially higher than those made with BBS may be due to Akhbari et al. (2015) using a medium level of stability difficulty level in their evaluation. As a limitation of this study, measurements could only be made on two legs instead of one leg due to the placement of the mobile phone on a very mobile platform. Another limitation of the study is that only healthy young adult participants were included, and no reliability evaluation was performed for individuals with balance impairment. Future development and balance testing is needed to optimize the design of the app for use in patients with balance disorders and other comorbidities (eg, diabetes mellitus, neurologic disease). The findings of this study indicate that the Physics Toolbox Suite mobile application is a reliable and accessible tool for assessing dynamic balance in healthy adults, exhibiting high intra- and inter-rater reliability. This application offers a practical solution by addressing the limitations of existing mobile applications and providing an affordable, accessible alternative to costly and complex systems such as BBS. |