Research article - (2022)21, 356 - 375
DOI:
https://doi.org/10.52082/jssm.2022.356
Are Wrist-Worn Activity Trackers and Mobile Applications Valid for Assessing Physical Activity in High School Students? Wearfit Study
Jesús Viciana1, Carolina Casado-Robles1, Santiago Guijarro-Romero2,, Daniel Mayorga-Vega3
1Department of Physical Education and Sport, University of Granada, Granada, Spain
2Department of Didactic of Musical, Plastic and Corporal Expression, University of Valladolid, Valladolid, Spain
3Departamento de Didáctica de las Lenguas, las Artes y el Deporte, Facultad de Ciencias de la Educación, Universidad de Málaga, Málaga, España

Santiago Guijarro-Romero
✉ Department of Didactic of Musical, Plastic and Corporal Expression, Faculty of Education and Social Work, University of Valladolid, Paseo de Belén, 1, 47011, Valladolid, Spain
Email: santigr93@gmail.com
Received: 29-03-2022 -- Accepted: 26-05-2022
Published (online): 01-09-2022

ABSTRACT

The purpose was to examine the validity of three wrist-worn commercial activity trackers (Samsung Galaxy Watch Active 2, Apple Watch Series 5, and Xiaomi Mi Band 5) and six mobile apps (Pedometer and Pacer for android and iPhone mobiles, Google Fit for android, and Apple Health for iPhone mobiles) for estimating high school students’ steps and physical activity (PA) under free-living conditions. A sample of 56 (27 females; mean age = 14.7 years) and 51 (25 females; mean age = 14.0 years) high school students participated in Study 1 and 2, respectively. Study 1: Students performed a 200-meter course in four different conditions while wearing the wearables. Step counting through a video record was used as the golden standard. Study 2: Students wore the three wrist-worn commercial activity trackers during the waking time of one day, considering ActiGraph model wGT3X-BT accelerometers as a standard of reference. Afterward, the agreement between the PA scores measured by the commercial activity trackers and the video (study 1) or accelerometers (study 2) were calculated as follows: Equivalence test, Limits of Agreement (LOA); Mean Absolute Error (MAE); Mean Absolute Percentage Error (MAPE); and Intraclass Correlation Coefficient (ICC). Results showed that all the wearables presented excellent validity for assessing steps in structured free-living conditions (study 1; MAPE < 5%), although their validity was between poor-excellent based on ICC (95% confidence interval) values (ICC = 0.56-1.00). Regarding Study 2, the Xiaomi wristband and the Samsung Watch presented acceptable-excellent (MAPE = 9.4-11.4%; ICC = 0.91-0.97) validity for assessing steps under unstructured free-living conditions (study 2). However, the Apple Watch presented questionable-excellent validity (MAPE = 18.0%; ICC = 0.69-0.95). Regarding moderate-to-vigorous PA (MVPA) and total PA, only the Apple Watch showed low-acceptable validity for MAPE value and questionable-excellent validity for the ICC values for MVPA assessment (MAPE = 22.6; ICC = 0.67-0.93). All wearables checked in this study have shown adequate validity results in order to assess steps in both structured and unstructured free-living conditions for both continuous and dichotomous variables. Moreover, for assessing MVPA, only the Apple Watch reported valid results for compliance or non-compliance with the daily PA recommendations. However, the results showed low validity for total PA and MVPA as continuous variables. In conclusion, depending on the user’s/researcher’s aim and context, one or another wearable activity tracker could be more adequate, mainly because of its valid measurements and its costs.

Key words: Consumer-wearables, daily steps, physical activity recommendations, accuracy, validity, schoolchildren

Key Points
  • The Samsung Galaxy Watch Active 2, the Apple Watch Series 5, the Xiaomi Mi Band 5, and the mobile apps (i.e., Pedometer and Pacer for android and iPhone mobiles, Google Fit for android, and Apple Health for iPhone mobiles) were valid to measure high school students’ steps under structured and unstructured free-living conditions as a continuous variable.
  • The Samsung Galaxy Watch Active 2, the Apple Watch Series 5, and the Xiaomi Mi Band 5 wrist-worn activity trackers were valid to measure the compliance or non-compliance of high school students’ daily step-based physical activity recommendation (i.e., 10,000 steps/day) in unstructured free-living conditions.
  • The Apple Watch Series 5 wrist-worn activity tracker was valid to measure the compliance or non-compliance of high school students’ moderate-to-vigorous physical activity-based recommendation (i.e., 60 minutes of moderate-to-vigorous physical activity/day) in unstructured free-living conditions.
  • Considering the user’s/researcher’s objective and context, one or another wearable activity tracker could be more adequate, mainly because of its valid measurements and its costs.
INTRODUCTION

The high prevalence of physical inactivity in all ages (i.e., in adolescence, 77.5% of males and 84.7% of females; Word Health Organization, WHO, 2020; Guthold et al., 2020) and its associated consequences for health is well known (Poitras et al., 2016; WHO, 2020). For instance, physically inactive high school students are more likely to become physically inactive adults who are more susceptible to all-cause mortality (Paluch et al., 2022). Consequently, a current public health priority is to improve the ratio of high school students that meet the daily physical activity (PA) recommendations of 60 minutes of moderate-to-vigorous PA (MVPA) in aerobic activities (WHO, 2020) or its equivalent in steps, that is to say 10,000 per day (Mayorga-Vega et al., 2021).

Additionally, it is not an easy ability for high school students to interpret the intensity level of daily activities and their relationship with MVPA (Crossley et al., 2019). In order to help the population in this task of regulating daily PA and its quantification, the consumer-wearable activity trackers have become a valuable asset as they provide a display for self-monitoring of the users’ PA (Strath and Rowley, 2018). Particularly, due to several advantages, two types of wearable activity trackers stand out as being the most popular: (a) wrist-worn activity trackers (including smartwatches and activity wristbands) have shown to be one of the most valued and used type of wearable because of their characteristics, including an attractive display, real-time feedback, low weight and price, audible sedentary break warnings, and goal alerts (Maher et al., 2017); and (b) mobile apps, which are widely used among high school students, are freely available, and let them register many PA parameters. Therefore, wrist-worn activity trackers and mobile apps could be useful, accessible, and feasible devices to objectively assess and promote high school students’ daily PA (Casado-Robles et al., 2022; Da Silva et al., 2015). Moreover, these two wearable activity trackers are widely extended all over the world, and the number of users is continuous and exponentially increasing every year (Vailshery, 2021).

Concerning the measurement of these wearables, it is necessary to know to what extent the information that consumers receive from their device regarding PA is accurate and valid. For instance, it is also important for educational researchers and Physical Education teachers in the school setting who are interested in applying wearables in Physical Education-based physical activity promotion programs, as it affects students’ daily physical activity, which is a priority in the educational system of most countries around the world (OECD, 2019). Consequently, valid wearable activity trackers could help researchers and Physical Education teachers to achieve this educational standard with reduced costs. That is, they could better control the PA carried out by high school students, even differentiating between different parts of the day. Additionally, students could have immediate feedback about the PA that they are performing during the day, which may facilitate them to achieve the daily PA recommendations. However, researchers need to know if they can use wearable activity trackers only as a strategy to increase habitual PA or also as a dependent variable.

In regards to PA intensity, the doubly labeled water assessment of energy expenditure is the gold standard (Westerterp, 2017), but the oxygen uptake measurement substitutes this expensive method, and is considered as the “reference standard” for assessing MVPA. Nevertheless, because of viability motives (e.g., small, light, comfortable, and reliable and valid; Migueles et al., 2017), accelerometers have been used as the most appropriate alternative in free-living conditions (Cosoli et al., 2020; Mayorga-Vega et al., 2019; Westerterp, 2013). In fact, ActiGraph accelerometers have been the most widely used method in population and calibration studies with high school students (Romanzini et al., 2014). Regarding the number of steps per day in free-living conditions, the accelerometer is also considered as the reference standard in most previous studies (Westerterp, 2013; 2017). However, it should be considered that accelerometer data for validation of other devices has traditionally been based on proprietary algorithms and therefore still subject to various forms of measurement error, particularly when assessing an outcome like MVPA (Migueles et al., 2017). The gold standard in structured or controlled settings is the video-taped and manual counter of the number of steps in a particular circuit or activity (El-Amrawy and Nounou, 2015; Stamm and Hartanto, 2018). In this sense, the convergence of measurements provided by other wrist-worn activity trackers, mobile apps, and the accelerometer will provide new possibilities and devices to use in those contexts beyond the laboratory, for instance, in Physical Education settings where validity and feasibility must coexist with each other (O’Neill et al., 2017).

Different cross-sectional studies with adults (e.g., Redenius et al., 2019; Toth et al., 2018) and synthesis studies (Brickwood et al., 2019; Gal et al., 2018) have shown the validity of wrist-worn activity trackers and mobile apps. Concerning high school students, the number of publications is scarce, although there are still some publications regarding their validity (Cosoli et al., 2020; Gorzelitz, et al., 2020). Overall results of these above-mentioned publications have shown a great variety of protocols for validation, devices, and methodological characteristics, which provide irregular results barely comparable to each other. Nevertheless, depending on the context to be applied and the required accuracy of the PA parameters, different devices could be recommended. Moreover, particular measurements, such as energy expenditure or PA levels, are more complex constructs, and their measurements present significant errors when assessed by wrist-worn activity trackers and mobile apps. For instance, Adamakis (2020) obtained large errors in all monitors and apps used underestimating the PA energy expenditure from 13.16% to 37.46%. On the contrary, the number of steps is a simpler measurement that can be assessed in a valid manner by mobile apps and wrist-worn activity trackers with a moderate correlation to the reference standard. Although these outputs measured by wearable activity trackers tend to have a lower validity for assessing intensity-related PA than steps among adults because of their high absolute percentage error with respect to reference instrumentation (Cosoli et al., 2020), among high school students, the evidence is currently insufficient (Voss et al., 2017). Moreover, because the components of wrist-worn activity trackers and mobile phones for the estimation of PA levels and steps are in continuous evolution by companies of wearable activity trackers, to analyze new models of them is desirable and recommended. Finally, Apple device validity studies were not found, or were scarce and centered in other parameters such as heart rate or energy expenditure, or measuring steps in a laboratory setting, as some previous synthesis studies have concluded (Fuller et al., 2020; Gorzelitz et al., 2020; LaMunion et al., 2020). However, they did not study unstructured free-living conditions, the measurement of PA intensities, nor the adequacy of its measurement to PA daily recommendations.

Consequently, the overall purpose of the present study was to examine the validity of the mobile apps Pedometer, Pacer, Google Fit for Android mobiles (Samsung); Pedometer, Pacer, and Apple Health for Apple mobiles (iPhone); and the wrist-worn activity trackers Samsung Galaxy Watch Active 2, Apple Watch Series 5, and Xiaomi Mi Band 5 for estimating PA in high school students during free-living conditions. Specifically, the purposes of this study were twofold: Study 1: to compare the criterion-related validity of the steps assessed by the nine wearable activity trackers under structured free-living conditions. Study 2: to compare the convergent validity of the daily steps, total PA, and MVPA scores assessed by the three wrist-worn activity trackers under unstructured free-living conditions (the comparability between them and their convergent validity with the accelerometer, considered as reference standard).

METHODS

The Guidelines for Reporting Reliability and Agreement Studies (GRRAS) have been taken into account in the present study (Kottner et al., 2011), and the protocol followed conforms to the Declaration of Helsinki statements (64th WMA, Brazil, October 2013) and was approved by the Ethical Committee for Human Studies at the University of Granada. Then, the principals of two state high schools (chosen by convenience) from an urban area situated in Granada (Spain) were contacted for Study 1 and 2. They were informed about the project and permission to conduct the study was obtained. After the school’s approval was acquired, all the students and their legal guardians were fully informed about the characteristics of the project. Signed written informed assents from the participants and signed written informed consents from their legal guardians’ were obtained before starting the study. School A was used for Study 1 (i.e., criterion-related validity under structured free-living conditions) and school B for Study 2 (i.e., convergent validity under unstructured free-living conditions). According to the center’s reports, all the students’ families had a middle socioeconomic level.

The present validity study followed a cross-sectional design. All the high school students (i.e., 70 and 75 for the school A and B, respectively) enrolled in the selected schools (i.e., 12–18 years old) were invited to participate in the present study. The high schools were located in an urban area of the city center of (deleted for anonymity reasons). For each study, the following inclusion criteria were considered: (a) being enrolled in the selected high schools (i.e., in the 7th to 12h grade); (b) being free of any health disorder that would make them unable to engage in PA normally; (c) presenting the corresponding signed written informed assents of the students, and (d) presenting the corresponding signed written informed consents of their legal guardians. The following exclusion criteria were considered: (a) not having completed and valid data from the three wrist-worn activity trackers (i.e., Samsung Galaxy Watch Active 2, Apple Watch Series 5, or Xiaomi Mi Band 5) or from the six mobile apps (Study 1) or from the three wrist-worn activity trackers (Study 2), and (b) not having completed and valid data from the video-based step counts (Study 1) or from the accelerometer (Study 2).

A priori sample size calculation was estimated with the Arifin’s web-based sample size calculator (Arifin, 2018). Based on steps values, parameters were set as follows: Study 1 and 2: Intraclass Correlation Coefficient (ICC), ρ0 = 0.70 (Nunnally, 1978); ρ1 = 0.86 (Voss et al., 2017), α = 0.05, 1 – β = 0.80, k = 2, dropout = 10% (Rowlands et al., 2018). Study 2: Kappa, k0 = 0.40 (Cicchetti, 2001); k1 = 0.85 (Mayorga-Vega et al., 2021), p = 0.25 (Guthold et al., 2020), α = 0.05, 1 – β = 0.80, k = 2, dropout = 10% (Rowlands et al., 2018). A final sample size of at least 49 students (minimum initial sample size equal to 55) was used in the final analysis. In addition to exceeding the minimum required sample size, a balance by grade and sex was taken into account for each study’s sample.

Measures

Demographic characteristics. Participants’ sex (males/females), age (in years), and non-dominant hand (left/right) information was self-reported.

Anthropometric. Participants’ body mass (kg) and height (cm) were first measured following the International Standards for Anthropometric Assessment (Stewart et al., 2011). Participants’ body mass and height were measured in shorts, T-shirts, and barefoot. For the body mass measure, participants stood in the center of the scale (Seca, Ltd., Hamburg, Germany; accuracy = 0.1 kg) without support and with their weight distributed evenly on both feet. For the body height assessment, participants stood with their feet together with the heels, buttocks and upper part of the back touching the stadiometer (Holtain Ltd., Crymmych, Pembs, United Kingdom; accuracy = 0.1 cm), and with the head placed in the Frankfort plane. Each measurement was performed twice and the mean was recorded (Stewart et al., 2011). Then, the body mass index was calculated as body mass divided by body height squared (kg/m2). Finally, students’ body weight status was categorized by sex- and age-adjusted body mass index thresholds as overweight/obese or non-overweight/obese (Cole et al., 2000). Body mass index and body weight status scores have shown high evidence supporting validity among high school students (Cole et al., 2000).

Activity wristbands and mobile apps. Participants’ PA levels and steps were estimated by the three wrist-worn activity trackers [i.e., Xiaomi Mi Band 5 (Xiaomi, Pekin, China); Samsung Galaxy Watch Active 2 (Samsung, Seul, South Korea); and Apple Watch Series 5 (Apple Park, California, USA)]; and from six mobile apps [i.e., Pedometer (ITO Technologies) and Pedometer Pacer Health for Android and iOS; and Google Fit app for Android (Samsung Galaxy S20+), and the Apple Health app for iOS (iPhone 11 Pro Max)]. Regarding the number of wrist-worn activity trackers, it was considered that three devices were a feasible number that did not interfere with the participants’ natural arm swing while walking, brisk walking, or running (Study 1), or daily PA (Study 2). Mobile phones (Study 1) were allocated in two belt bags that changed sides for each participant, which also did not interfere with the participants’ movements during the test. The total mass of the three wearable bands was not high (88 grams). According to the user manual of each device brand, the wrist-worn activity trackers were fit snugly on the top of participants’ wrist, close and above the wrist bone (7.6 cm width). Concerning the particular chosen wrist-worn activity trackers, the criteria were to study the most used worldwide display-based trackers, choosing the most advanced model (in that moment). Three wrist-worn activity trackers were chosen, one being a more low cost option (i.e., Xiaomi Mi Band 5 ≈ 25€) and the other two were smartwatches from the two main brands [Samsung belonging to Android (Samsung Galaxy Watch Active 2 that uses the Android Wear System) and Apple belonging to iOS (Apple Watch Series 5 that uses the WatchOS Apple System)] (based on the International Data Corporation’s Worldwide Quarterly Wearable Device Tracker reports from 2021, and Henriksen et al. 2018). Concerning the mobile apps, the following ones were selected: Pedometer (ITO Technologies) and Pedometer Pacer Health for Android and iOS, the Google Fit app for Android and the Apple Health app for iOS. After revising the previous literature, the criteria followed for the selection were (both for Android and iOS): (a) the most popular and used apps (due to the number of downloads and their user ratings); (b) free download apps; and (c) the included apps of the corresponding mobile phones (i.e., Samsung Google Fit for Android and Apple Health for iOS).

According to the user manual of each device brand, wrist-worn activity trackers were adjusted, in random order, on the participant’s wrist of their non-dominant hand. The three chosen devices are characterized to be small and lightweight based in tri-axial built-in accelerometers (Xiaomi Mi Band 5: 22 g, 1.5 x 1.6 x 4.0 cm; Samsung Galaxy Watch Active 2 2: 30 g, 1.09 x 4.4 x 4.4 cm; Apple Watch Series 5: 30.8 g, 1.07 x 4.0 x 4.4 cm). Each wrist-worn activity tracker and mobile phone has its own algorithmic equation to estimate the daily steps taken and the minutes engaged in each specific intensity-related PA. The wrist-worn activity trackers data and mobile phone apps were recorded immediately from the screen (for Study 1), and were also synchronized via Bluetooth for the three wrist-worn activity trackers to their specific applications in order to download and store data (for Study 2) (Xiaomi Mi Band 5: Mi Fit version 5.3.2 for Android; Samsung Galaxy Watch Active 2: Samsung Health version 6.19 for Android; Apple Watch Series 5: Apple Health version 7.6.2 for iOS).

Regarding the data scoring, steps (number) were registered as directly reported in the devices. For the intensity-related PA (minutes), scores were calculated as follows: for the Xiaomi Mi Band 5, the variables “slow walking” and “brisk walking” were calculated by adding the total time spent on all the stretches of “slow walking” and “brisk walking,” respectively. The variable MVPA was calculated by adding the total time spent on all the stretches of “moderate activity” and “vigorous activity”. Aside from the default variable “total activity”, the variable “walking” was also calculated by adding the total time spent on all the stretches of “slow walking” and “brisk walking”; for the Samsung Galaxy Watch Active 2, the variable total PA was calculated considering the default option “total active minutes” and; for the Apple Watch Series 5, the default option “exercise minutes” was considered as the variable MVPA.

Video-based steps count. Participants’ reference standard of steps under structured free-living conditions was determined by step counting the video recording in slow-motion, which is considered to be the golden standard of this measurement. Participants were asked to perform a 200-meter course in four different conditions. The 200-meter course was marked with cones and lines and performed inside the school on a non-slippery sport court with an oval shape and no tight turns. A digital video camera (Go Pro Hero 7, California, USA) with a tripod was situated in the middle of the sports court in order to easily record the participants’ lower limbs during the entire course from the sagittal plane. When the participant was at the starting line, the steps count from the wrist-worn activity trackers were recorded. Then, they were instructed to not move until they started walking/running. They also were asked to always start the course with the contralateral leg to the arm where the wrist-worn activity trackers were attached. For calculating the speed and step cadence of each condition, when the participant started walking/running the manual chronometer was activated and was stopped after he/she crossed the finish line. Participants were requested to stop immediately after the finish line, and a cone was situated five meters beforehand to remind them. Then, the steps counted by the wrist-worn activity trackers were registered. Participants performed the following four conditions: 1) slow pace walking; 2) normal pace walking (also known as self-pace walking); 3) brisk pace walking; and 4) jogging (slow running). Finally, the reference standard step count for each participant in each condition were performed independently by two researchers through the slow-motion video recording projected on a 15.6” screen. When disagreement occurred (8.3%), a third researcher evaluated it.

Accelerometer. Participants’ reference standard of daily steps and intensity-related PA under unstructured free-living conditions were determined by wGT3X-BT accelerometers (ActiGraph, LLC, Pensacola, FL, USA). The ActiGraph model wGT3X-BT is a small (4.6 x 3.3 x 1.5 cm), lightweight (19 g), tri-axial accelerometer. Accelerometers were adjusted on the participants’ right hips. Initializing, downloading, wear time validation and scoring (i.e., PA data filter) were performed using the ActiLife Lifestyle Monitoring System Software version 6.13.3 (ActiGraph, LLC, Pensacola, FL, USA), and were initialized with a sampling frequency of 30 Hz (Evenson et al., 2008; Trost et al., 2011). Data download was carried out with 15-second epoch (Evenson et al., 2008). According to previous literature when selecting a set of cut-points to estimate a variable from activity counts, it is recommendable to select the same filter that was used in the validation study for the cut-points employed (Migueles et al., 2017). Valid wear time was set as equal to or higher than 600 minutes per day (Migueles et al., 2017), with non-wear periods set as 60 minutes or more of consecutive zero-count epochs with up to a two minute spike tolerance (Oliver et al., 2011).

Regarding the data scoring, steps (number) were assessed by within-instrument processing of the number of cycles in the accelerometer signal or cycle counts. The time (minutes) engaged in MVPA (≥ 2,296 counts/ min), and total PA (≥ 101 counts/min) was calculated by the Evenson’s thresholds (Evenson et al., 2008). According to the cross-validation study performed by Trost et al. (2011), these cut-off points have demonstrated the best evidence supporting score validity for assessing intensity-related PA among high school students. Finally, participants’ steps and MVPA were dichotomized as achieving or not achieving the daily recommendation of at least 10,000 steps and 60 minutes of MVPA, respectively (Mayorga-Vega et al., 2021; WHO, 2020). ActiGraph accelerometer scores have shown high evidence supporting validity for assessing steps and intensity-related PA among high school students (Hickey et al., 2016; Lee et al., 2015; Romanzini et al., 2014).

Procedure

Study 1 and 2: Data collection was carried out by the same researchers, instruments, and protocols. Firstly, participants’ demographic characteristics, anthropometric, and self-reported habitual PA levels were registered. The three wrist-worn activity trackers were adjusted so they would not move (i.e., avoiding over-tightening and clearance), in a random order, on the participants’ wrists of their non-dominant hand. All devices were blocked in order to prevent participants from manipulating any functions and options they had, and to avoid influence in their habitual behavior.

Study 1: Evaluations were carried out during the afternoon in participants’ leisure time from Monday to Friday, and then data were downloaded and batteries charged during the morning. Due to the limitations of instruments, facilities, and time, an average of two or three participants per hour were evaluated one by one during each evaluation session. Apart from the three wrist-worn activity trackers, the two mobile phones were inside two belt bags allocated on each participants’ hip, preventing movement, approximately like carrying it in the pants pocket, and alternating sides for each participant. Participants were instructed to walk/run the 200-meter course in the four conditions specified above, at a continuous speed, and with a natural arm swing. Before starting, a demonstration in order to guide each participant was performed. During the five-minute rest between conditions, step count data from the three wrist-worn activity trackers and the mobile phone apps were recorded.

Study 2: The three wrist-worn activity trackers and the accelerometer were adjusted to one participant each day, Monday through Thursday. Then, on Friday, data were downloaded. Due to the limitations of material resources, waves of three participants per day were carried out. For each wave, participants were met at 8:00 a.m. in a room allocated in the school gym, so they could then go and start their school day at the regular time (i.e., 8:30 a.m.). Additionally to the wrist-worn activity trackers, an accelerometer was adjusted on the participants’ right hip using an elastic waistband. Participants were instructed to wear the wrist-worn activity trackers and the accelerometer for the whole day until bedtime maintaining their habitual PA levels, and they were asked to take them off only when they took a bath/shower, or to leave them in a plastic box inside their schoolbags just before going to bed. In the case of a participant reporting that he/she was going to engage in aquatic activities that day, he/she was requested to come the next day (and another participant was requested instead). In the morning of the following day, the wrist-worn activity trackers were collected and adjusted to the next three participants following the same protocol.

Statistical analysis

Descriptive statistics for all the variables of the included participants were calculated. Firstly, all the statistical tests assumptions were checked and met (e.g., histograms and Q-Q plots for normality). Furthermore, univariate (i.e., z ± 3.0) and multivariate outliers (i.e., Mahalanobis distance) were removed. Afterward, the agreement between the PA scores (i.e., steps, total PA, and MVPA considered as continuous variables) measured by the wearable activity trackers and the video (study 1)/accelerometers-wearables (study 2) were calculated as follows: Equivalence test with the Confident Interval method (90% CI) (Dixon et al., 2018); (b) Limits of Agreement (LOA) with its confident intervals (95% CI) (Bland and Altman, 1986); (c) Mean Absolute Error (MAE) (Willmott and Matsuura, 2005); (d) Mean Absolute Percentage Error (MAPE) (Johnston et al., 2021); and (e) Intraclass Correlation Coefficient (ICC), and its 95% CI, by a two-way random-effects model with absolute agreement and single measurement [also known as ICC(2,1)] (Koo and Li, 2016). Additionally, LOA plots, which are the individual participant differences between the two scores plotted against the respective individual means, were performed (Bland and Altman, 1986). Heteroscedasticity was also examined objectively by calculating the Pearson’s correlation coefficient (r) between the absolute differences and the individual means (Atkinson and Nevill, 1998). Based on Cohen’s (1992) benchmarks, a correlation coefficient > 0.50 was considered as indicative of heteroscedasticity. Finally, the agreement between the PA scores (i.e., steps and MVPA) dichotomized as achieving or not achieving the daily recommendations of 10,000 steps (Mayorga-Vega et al., 2021) and 60 minutes of MVPA (i.e., categorical variables) (WHO, 2020) measured by the wearable activity trackers and the accelerometers (Study 2) were calculated as the proportion of agreement [P = number of agreements/ (number of agreements + disagreements)] and kappa coefficient (k) (Hernaez, 2015). Agreement values were interpreted as follows: Equivalence test, the mean reference standard is within ± 15% of the mean activity trackers is considered acceptable (Dixon et al., 2018); MAPE, > 20.0% poor; 15.1-20.0% questionable; 10.1-15.0% acceptable; 5.1-10.0% good; and 0.0-5.0% excellent (Bai et al., 2021; Johnston et al., 2021). ICC, 0.00-0.49 unacceptable; 0.50-0.59 poor; 0.60-0.69 questionable; 0.70-0.79 acceptable; 0.80-0.89 good; and 0.90-1.00 excellent (Nunnally, 1978); k, 0.00-0.39 poor; 0.40-0.59 acceptable; 0.60-0.74 good; and 0.75-1.00 excellent (Cicchetti, 2001). Based on statistical inference, each ICC value was interpreted according to its 95% CI, that means, there was 95% chance that the true ICC value landed on any point between the 95% CI range (Koo and Li, 2016). Regarding kappa values, < 0.00 poor; 0.00-0.20 slight; 0.21-0.40 fair; 0.41-0.60 moderate; 0.61-0.80 substantial; 0.81-1.00 almost perfect (Landis and Koch, 1977). All statistical analyses were performed using the SPSS version 25.0 for Windows (IBM® SPSS® Statistics), except for the equivalence testing where the Jamovi version 2.3 (The Jamovi project, https://www.jamovi.org) was used. The statistical significance level was set at p < 0.05.

RESULTS
General characteristics

Figure 1 shows the flow diagram of the participants through the studies. An initial sample of 62 and 61 students agreed to participate and met the inclusion criteria in the Studies 1 and 2, respectively. Since some participants met at least one exclusion criterion, the final sample consisted of 56 (29 males and 27 females) and 51 (26 males and 25 females) participants for the Studies 1 and 2, respectively (i.e., non-compliance rate of 9.7% and 16.4%, respectively). Table 1 shows the general characteristics of the included participants.

Study 1: Validity of the wearable activity trackers during structured free-living conditions

The average speed (SD) in each condition was as follows: Slow pace walking = 1.1 (0.2) m/s [3.9 (0.5) km/h]; normal pace walking = 1.5 (0.1) m/s [5.2 (0.5) km/h]; brisk pace walking = 1.9 (0.2) m/s [6.8 (0.6) km/h]; and running = 3.1 (0.4) m/s [11.0 (1.3) km/h]. The average step cadence (SD) in each condition was as follows: Slow pace walking = 99.7 (7.3) steps/min; normal pace walking = 113.3 (5.0) steps/min; brisk pace walking = 128.7 (7.0) steps/min; and running = 167.2 (19.1) steps/min.

Table 2 shows the validity of the wearable activity trackers for estimating steps during structured free-living conditions. The agreement between the steps assessed by the wearable activity trackers and video-based count tended to be higher for slow pace walking, followed by running, normal pace walking, and brisk pace walking. Regarding the validity results of the steps based on the values of 90% CI of the equivalence test, the 90% confidence interval of all activity trackers scores were inside the equivalence region of reference standard. Similarly, based on the MAPE values, the validity results of the steps assessed by all five devices (three wrist-worn activity trackers and the two mobile phones with their respective apps) in the four conditions were excellent (i.e., < 5%). Note that due to the three Apple iPhone apps registering exactly the same parameters without distinction between them, only “Apple iPhone apps” has been reported in this Results section. Regarding the validity results of the steps based on the values of 95% CI of the ICC, similarly, the Samsung apps Pacer and Google Fit in all conditions ranged between good to excellent. Furthermore, the validity results of the steps assessed by the Apple iPhone and Apple Watch Series 5 were excellent, and by the Samsung Pedometer and Samsung Galaxy Watch Active 2 were good-excellent in the three conditions. However, their validity was worse in one condition: Apple iPhone in the slow pace walking condition was acceptable-excellent; Apple Watch Series 5 in the brisk pace walking was poor-excellent; Samsung Pedometer in the normal pace walking condition was questionable-good; and Samsung Galaxy Watch Active 2 in the brisk pace walking was questionable-excellent. Similarly, while the validity results of the steps assessed by the Xiaomi Mi Band 5 were excellent in the slow pace walking and running conditions, they were acceptable-excellent and poor-good in the normal and brisk pace walking conditions, respectively. Figure 2, Figure 3, Figure 4 and Figure 5 show the LOA plots. Pearson’s correlation coefficients showed that with all the instruments there was no heteroscedasticity on any walking/running condition (Table 3).

Study 2: Validity of the wrist-worn activity trackers during daily unstructured free-living conditions

Table 4 shows the validity of the wrist-worn activity trackers for estimating daily PA (i.e., steps, moderate-to-vigorous, and total physical activity) in high school students during unstructured free-living conditions.

Regarding the validity results of the steps based on the values of 90% CI of the equivalence test, the 90% confidence interval of the steps assessed by the Xiaomi Mi Band 5 and Samsung Galaxy Watch Active 2 were inside the equivalence region of reference standard. Similarly, based on the values of both the MAPE and 95% CI of the ICC, the validity results of the steps assessed by the Xiaomi Mi Band 5 were good and excellent, respectively; and the validity results of the steps assessed by the Samsung Galaxy Watch Active 2 were acceptable and excellent based on the MAPE and 95% CI of the ICC, respectively. However, as regards the Apple Watch Series 5, the validity results showed that it was not inside the equivalence region of the reference standard (i.e., equivalence test), was poor for the values of the MAPE and questionable-excellent for those with the 95% CI of the ICC. The validity results for the MVPA and total PA assessment for the three wrist-worn activity trackers were not inside the equivalence region of the reference standard (i.e., equivalence test), and were poor for the values of both the MAPE and 95% CI of the ICC (exceptionally was questionable-excellent for the values of 95% CI of the ICC assessed by the Apple Watch Series 5). Figure 6, Figure 7, and Figure 8 show the LOA plots. Pearson’s correlation coefficients did not show heteroscedasticity, except with the MVPA with the Xiaomi Mi Band 5 (Table 3).

Table 5 shows the comparability of the wearable activity trackers for estimating daily PA during unstructured free-living conditions. Based on the values of both the MAPE and 95% CI of the ICC, the comparability results showed that the steps scores of the Samsung Galaxy Watch Active 2 and Xiaomi Mi Band 5 were good-excellent. Although the Apple Watch Series 5 and Xiaomi Mi Band 5 also had an ICC acceptable-excellent value for steps, with the MAPE the value was questionable. Similarly, while the comparability results showed that the total PA scores of the Samsung Galaxy Watch Active 2 and Xiaomi Mi Band 5 (with the slow-brisk walking time) were good-excellent with the ICC, it was poor with the MAPE. For the rest of results (i.e., Samsung Galaxy Watch Active 2-Apple Watch Series 5 for steps; all the comparison with MVPA; and for total PA Samsung Galaxy Watch Active 2-Xiaomi Mi Band 5 and Xiaomi Mi Band 5 total PA time and slow-brisk walking time) the values were unacceptable-poor.

Table 6 shows the validity of the wrist-worn activity trackers for estimating the daily PA recommendations (i.e., 10,000 steps/day or 60 min. of MVPA) in high school students during unstructured free-living conditions. A total of 13.7% and 17.6% of high school students met the accelerometer-measured step- and MVPA-based recommendations, respectively. The validity results of the daily step-based recommendations assessed by the three wrist-worn activity trackers were excellent. Regarding the daily MVPA-based recommendation, while the validity results with the Apple Watch Series 5 were excellent, for the Xiaomi Mi Band 5 they were poor. Table 7 shows the comparability of the wrist-worn activity trackers for estimating the daily PA recommendations during unstructured free-living conditions. The comparability results of the daily step-based recommendations assessed by the three wrist-worn activity trackers were excellent. However, as regards the daily MVPA-based recommendation, the results with the three wrist-worn activity trackers were poor.

DISCUSSION

Regarding the general objective of the study referring to the validity of the nine wearable activity trackers for estimating PA in high school students during free-living conditions, it is necessary to differentiate the measurement of steps and other PA parameters in structured or unstructured free-living conditions. In regard to the measurement of steps, all wearable activity trackers were valid under the two conditions. However, to measure PA parameters in unstructured free-living conditions only the Apple Watch Series 5 was valued as excellent for estimating the compliance or non-compliance of the daily recommendations based on MVPA.

Previous research coincided with the present study in regards to steps counting, showing high validity outcomes in youths, both with other wrist-worn activity tracker brands such as Jawbone or Fitbit (Evenson et al., 2015; Kang et al., 2019) and with the ones used in this tudy, although with different models of wrist-worn activity trackers (e.g., Fuller et al., 2020; Hao et al., 2021). However, similar to this study, the measurement of PA intensities has shown inadequate validity results (e.g., Degroote et al., 2020; Evenson et al., 2015; Feehan et al., 2018; Fuller et al., 2020; Voss et al., 2017).

Specifically, regarding the aim of Study 1, all cases of the wearable activity trackers were inside the equivalence region of reference standard (i.e., equivalence test), were excellent for the four conditions evaluated, taking into account the criterion of MAPE values, and good to excellent taking into account the ICC criterion. Perhaps this is the only condition and setting where there is a higher consensus in previous literature regarding wrist-worn activity trackers and their validity for measuring steps accurately (e.g., Fuller et al., 2020; Fokkema et al. 2017; or Hao et al., 2021). There were no clear differences between data obtained among devices. Being the wrist-worn activity trackers more comfortable than mobile phones to wear, and particularly being the Xiaomi Mi Band 5 the cheapest one (around 25€), it seems logical to conclude that this could be the most recommended option for measuring steps under structured free-living conditions. Therefore, in the school setting the Xiaomi Mi Band 5 could provide Physical Education teachers with the best opportunity to measure the high school students’ steps in Physical Education-based health promotion programs accurately. In this sense, several aspects have to be highlighted: a) the cost of the wearables analyzed is similar or even less than other materials such as a gymnastics bench or mats which are very used in PE; b) many times, adolescents have their own mobile phones that can be used in the centers with the corresponding authorization. Therefore, it would not be necessary to buy more wearables, but rather to know which applications to use and to know if they are valid, thus being a very feasible way to promote physical activity among this population and in Physical Education classes without spending money; c) in Spain, high school centers have an annual budget distributed among all departments, and d) also, in Spain, external financing called “teaching innovation programs” exist which allow to obtain the necessary budget for buying, for example, enough wearables.

In relation to the mobile apps specifically, it is important to denote that the Apple iPhone 11 used the same algorithm for assessing PA parameters in the three evaluated apps, providing the same results for all of them. Consequently, the possibilities offered by the Apple Company related to PA parameters assessed by mobile apps are reduced in comparison with Android mobile phones.

In regard to the specific aim of Study 2, different results were obtained, and different conclusions could be deduced. First, for the measurement of daily steps, the Xiaomi Mi Band 5 obtained the best validity result (good-excellent; inside the equivalence region of reference standard); followed by the Samsung Galaxy Watch Active 2 (acceptable-excellent; inside the equivalence region of reference standard); and finally, the Apple Watch Series 5 (questionable-excellent; not inside the equivalence region of reference standard). Previous research also obtained good validity results of daily steps in unstructured free-living conditions with high school students (Schneider and Chau, 2016; ŠimŠ¯nek et al., 2019; Yang et al., 2019), although the first one studied the Fitbit activity wristband detecting an overestimated registration in comparison to the ActiGraph accelerometer; the second studied the Garmin 1 and 3 activity wristband models, obtaining good levels of accuracy in comparison to the Yamax pedometer as the reference standard; and the third one studied a previous version of Xiaomi than the one used in the present study in comparison to the ActiGraph accelerometer. Secondly, for the MVPA measurement, the best validity result was obtained by the Apple Watch Series 5 with questionable-excellent values. The Xiaomi mi Band 5 presented unsatisfactory results and similar to the Apple Watch Series 5 was not inside the equivalence region of reference standard (i.e., equivalence test), and the Samsung Galaxy Watch active 2 did not provide this parameter. Previous literature with high school students’ samples is scarce. On the one hand, recent and previous review studies agree with the fact that some specific models of activity wristbands have obtained adequate validity results for MVPA (Gorzelitz et al., 2020), although they only analyzed one study with high school students which examined the validity of the Fitbit activity wristband. On the other hand, a second study with high school students carried out by Yang et al. (2019) obtained inadequate validity results for MVPA. Therefore, further studies are needed in order to achieve a more consensus in this variable, which probably with the new models of activity wristbands that incorporate new algorithms they may provide a more accurate measurement of MVPA in the future. Third, for total PA, none of the wrist-worn activity trackers analyzed provided adequate validity results, and the Apple Watch Series 5 did not provide this parameter.

Finally, for the PA daily recommendations, and taking into consideration the criterion of 10,000 steps, all devices obtained excellent values of validity. In relation to the 60 min of MVPA criterion, while the Apple Watch Series 5 obtained excellent results, the Xiaomi Mi Band 5 obtained poor results (note that the Samsung Galaxy Watch Active 2 did not provide this parameter). Therefore, valuable outputs were obtained for future intervention programs because the three wrist-worn activity trackers were valid for accurately classifying high school students according to whether or not they met the recommended 10,000 steps per day or the 60 min of MVPA only in the case of Apple Watch Series 5.

Consequently, these wearables are also useful for reporting valid feedbacks for being used in high school students’ progressive challenges to achieve daily recommendations in health-promotion interventions, and can be used as a motivational strategy to promote PA due to their accurate measurement of PA. Wearables’ characteristics of being lightweight, easy to use and understand, and economically affordable (Parra Saldías et al., 2018), together with the results obtained in the present study, makes these devices feasible and applicable to consumers (researchers or not) for promoting better levels of PA in adolescents at any context. In this sense, recent meta-analysis of Casado-Robles et al. (2022) concluded that the use of these devices in PA promotion programs among adolescents are an effective strategy for improving their PA levels. Therefore, and specifically in the school setting, Physical Education teachers may use these wearables in order to: (a) have a better control of their students’ extracurricular and out-of-school time PA (i.e., letting students self-monitor their PA and compare it with the daily PA recommendations, and also allow teachers to control students’ PA via the wearables’ application); (b) individualize health intervention programs for improving all students’ PA levels, programming personalized alerts and daily aims throughout the personal wearable; (c) maintain the students’ physical fitness levels achieved in previous Physical Education programs; and (d) plan particular students’ PA programs for holiday periods, such as Christmas or summertime.

A strength of this study was that, to our knowledge, it was the first study to examine the validity of more advanced and used models of wearable activity trackers (at the moment of the data collection, International Data Corporation’s Worldwide Quarterly Wearable Device Tracker Report, 2021) in structured and unstructured free-living conditions and with high school students, analyzing both PA parameters as continuous variables and consensus daily PA recommendations criteria. Furthermore, since the main goal of wearable activity trackers is to assess high school students’ daily PA levels or to use them as a motivational tool during daily life, the evaluation in free-living conditions as in the present study are closer to reality and, therefore, they are meaningful and useful (Duncan et al., 2018). Taking into account that the majority of the previous studies have been performed in adults or special samples such as people with different health problems (e.g., Feehan et al., 2018 or Fokkema et al., 2017), this study expands and complements this area of research, especially among the high school student population in which the number of studies are very reduced, which is also applicable to practical interventions in order to improve health promotion and behavior change programs in regards to PA, both in Physical Education and in other health or sport contexts (e.g., Duncan et al., 2012; Strath and Rowley, 2018).

Regarding the limitations, this study used a non-probability and relatively small sample, which limits the generalizability of the obtained results to this particular context. Secondly, ActiGraph accelerometers have been shown as the most common and valid method for objectively assessing high school students’ PA levels during free-living conditions (e.g., Shephard and Tudor-Locke, 2016; Van Hecke et al., 2016; Hickey et al., 2016; Trost et al., 2011; Migueles et al., 2017), considering that there is not any golden standard for those PA levels in unstructured free-living conditions. However, some methodological issues have not achieved an evidence-based consensus, such as the use of specific cut-off points which might misclassify students PA levels because they are not relative to the individual students’ fitness status. Nevertheless, taking into account this issue, the best current evidence-based decisions were adopted in the present study (Migueles et al., 2017). Therefore, it may contribute to the variability of wearable activity trackers convergent validity obtained results. Finally, the third limitation is related to the models of wearable activity trackers evaluated and its associated consequences. Two repercussions could be mentioned: (a) more advanced models of wearable activity trackers are continuously appearing in the market, and scientific knowledge always lags behind this market progress; and (b) some Companies did not provide their algorithms to classify PA at different intensities (e.g., Xiaomi or Apple companies). In this sense, to overcome these difficulties, companies must be more open about the algorithms they are using to estimate PA intensities. Moreover, it would be useful to standardize the way of taking data and analyzing it, that is, the protocol to carry out the study. To do that, it might be useful to develop interdisciplinary collaborations and open source tools to allow these data to be collected. Likewise, it would be practical to standardize the way of the reports, in order to facilitate the understanding regarding the protocol followed to analyze the validity of specific wearables and how the results obtained have been interpreted. Because of all of these mentioned limitations, further studies should be performed in the future in order to improve the knowledge regarding the accuracy and valid registration of PA parameters by wearable activity trackers, both for consumers’ and researchers’ decisions of their personal use. Moreover, it would be interesting if future studies show a comparison between males and females in terms of the validity assessment and physical activity levels.

CONCLUSION

The wearable activity trackers tested in this study have shown adequate validity results in order to assess steps in both structured (i.e., wrist-worn activity trackers and mobile phone apps) and unstructured (i.e., wrist-worn activity trackers) free-living conditions for both continuous and dichotomous variables. However, in unstructured free-living conditions and for assessing MVPA, only Apple Watch Series 5 reported valid results regarding the compliance or non-compliance with the daily PA recommendations, but not for measuring total PA or MVPA as continuous variables. Therefore, depending on the user’s/researcher’s aim and context, one or another wearable activity tracker could be more adequate, mainly because of its valid measurements and its costs.

ACKNOWLEDGEMENTS

Authors thank Aliisa Hatten for the English revision. This study is part of the doctoral thesis of Carolina Casado-Robles, as well as of the WearFit project (Reference: B-SEJ-029-UGR18), funded by the FEDER/ Regional Government of Andalusia-Ministry of Economy and Knowledge. The experiments complied with the current laws of the country in which they were performed. The authors have no conflicts of interest to declare. The datasets generated and analyzed during the current study are not publicly available, but are available from the corresponding author who was an organizer of the study.

AUTHOR BIOGRAPHY
     
 
Jesús Viciana
 
Employment:Department of Physical Education and Sport, University of Granada, Spain.
 
Degree: PhD
 
Research interests: Measurement and evaluation, health-enhancing physical activity, sedentary behavior, Physical Education-based interventions, motivation toward physical activity.
  E-mail: jviciana@ugr.es
   
   

     
 
Carolina Casado-Robles
 
Employment:Department of Physical Education and Sport, University of Granada, Spain.
 
Degree: PhD
 
Research interests: Measurement and evaluation, health-enhancing physical activity, sedentary behavior, Physical Education-based interventions, motivation toward physical activity.
  E-mail: ccasado@ugr.es
   
   

     
 
Santiago Guijarro-Romero
 
Employment:Department of Didactic of Musical, Plastic and Corporal Expression, University of Valladolid, Valladolid, Spain
 
Degree: PhD
 
Research interests: Measurement and evaluation, health-enhancing physical activity, sedentary behavior, Physical Education-based interventions, motivation toward physical activity.
  E-mail: santigr93@gmail.com
   
   

     
 
Daniel Mayorga-Vega
 
Employment:Departamento de Didáctica de las Lenguas, las Artes y el Deporte, Facultad de Ciencias de la Educación, Universidad de Málaga, Málaga, España.
 
Degree: PhD
 
Research interests: Measurement and evaluation, health-enhancing physical activity, sedentary behavior, health-related physical fitness, Physical Education-based interventions, motivation toward physical activity.
  E-mail: dmayorgavega@gmail.com
   
   

REFERENCES
Adamakis M (2020) Criterion validity of wearable monitors and smarphone applications to measure physical activity energy expenditure in adolescents. Sport Sciences for Health 16, 755-763.
Arifin W (2018) A web-based sample size calculator for reliability studies. Education in Medicine Journal 10, 67-76.
Atkinson G., Nevill A. M (1998) Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine. Sports Medicine 26, 217-238.
Bai Y., Tompkins C., Gell N., Dione D., Zhang T., Byun W (2021) Comprehensive comparison of Apple Watch and Fitbit monitors in a free-living setting. Plos One 16, e0251975.
Bland J., Altman D (1986) Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1, 307-310.
Bossuyt P. M., Reitsma J. B., Bruns D. E., Gatsonis C. A., Glasziou P. P., Irwig L., Lijmer J. G., Moher D., Rennie D., de Vet H. C., Kressel H. Y., Rifai N., Golub R. M., Altman D. G., Hooft L., Korevaar D. A., Cohen J. F (2015) STARD 2015: An updated list of essential items for reporting diagnostic accuracy studies. British Journal of Sports Medicine 351, 5527.
Brickwood K. J., Watson G., O'Brien J., Williams A. D (2019) Consumer-based wearable activity trackers increase physical activity participation: Systematic review and meta-analysis. JMIR mHealth and uHealth 7, e11819.
Casado-Robles C., Viciana J., Guijarro-Romero S., Mayorga-Vega D (2022) Effects of consumer-wearable activity tracker-based programs on objectively measured daily physical activity and sedentary behavior among school-aged children: A systematic review and meta-analysis. Sports Medicine-Open 8, 18.
Cicchetti D (2001) The precision of reliability and validity estimates re-visited: Distinguishing between clinical and statistical significance of sample size requirements. Journal of Clinical and Experimental Neuropsychology 23, 695-700.
Cohen J (1992) A power primer. Psychological Bulletin 112, 155-159.
Cole T. J., Bellizzi M. C., Flegal K. M., Dietz W. H (2000) Establishing a standard definition for child overweight and obesity worldwide: International survey. British Journal of Sports Medicine 320, 1240-1243.
Cosoli G., Spinsante S., Scalise L (2020) Wrist-worn and chest-strap wearable devices: Systematic review on accuracy and metrological characteristics. Measurement 159, 107789.
Crossley S., McNarry M. A., Rosenberg M., Knowles Z. R., Eslambolchilar P., Mackintosh K. A (2019) Understanding youths’ ability to interpret 3D-printed physical activity data and identify associated intensity levels: Mixed-methods study. Journal of Medical Internet Research 21, e11253.
Da Silva M. P., Fontana F. E., Callahan E., Mazzardo O., De Campos W (2015) Step-count guidelines for children and adolescents: A systematic review. Journal of Physical Activity and Health 12, 1184-1191.
Degroote L., Hamerlinck G., Poels K., Maher C., Crombez G., De Bourdeaudhuij I., Vandendriessche A., Curtis R. G., DeSmet A (2020) Low-cost consumer-based trackers to measure physical activity and sleep duration among adults in free-living conditions: Validation study. JMIR mHealth and uHealth 8, e16674.
Dixon P. M., Saint-Maurice P. F., Kim Y., Hibbing P., Bai Y., Welk G. J (2018) A primer on the use of equivalence testing for evaluating measurement agreement. Medicine and Science in Sports and Exercise 50, 837-845.
Duncan M. J., Wunderlich K., Zhao Y., Faulkner G (2018) Walk this way: Validity evidence of iPhone health application step count in laboratory and free-living conditions. Journal of Sports Sciences 36, 1695-1704.
Duncan M., Birch S., Woodfield L (2012) Efficacy of an integrated school curriculum pedometer intervention to enhance physical activity and to reduce weight status in children. European Physical Education Review 18, 396-407.
El-Amrawy F., Nounou M. I (2015) Are Currently Available Wearable Devices for activity tracking and heart rate monitoring accurate, precise, and medically beneficial?. Healthcare Informatics Research 21, 315-320.
Evenson K. R., Catellier D. J., Gill K., Ondrak K. S., McMurray R. G (2008) Calibration of two objective measures of physical activity for children. Journal of Sports Sciences 26, 1557-1565.
Evenson K. R., Goto M. M., Furberg R. D (2015) Systematic review of the validity and reliability of consumer-wearable activity trackers. International Journal of Behavioral Nutrition and Physical Activity 12, 159.
Feehan L. M., Geldman J., Sayre E. C., Park C., Ezzat A. M., Yoo J. Y., Hamilton C. B., Li L. C (2018) Accuracy of Fitbit devices: Systematic review and narrative syntheses of quantitative data. JMIR mHealth and uHealth 6, e10527.
Fokkema T., Kooiman T. J., Krijnen W. P., Van Der Schans C. P., De Groot M (2017) Reliability and validity of ten consumer activity trackers depend on walking speed. Medicine and Science in Sports and Exercise 49, 793-800.
Fuller D., Colwell E., Low J., Orychock K., Tobin M. A., Simango B., Buote R., Van Heerden D., Luan H., Cullen K., Slade L., Taylor N (2020) Reliability and validity of commercially available wearable devices for measuring steps, energy expenditure, and heart rate: Systematic review. JMIR mHealth and uHealth 8, e18694.
Gal R., May A. M., Van Overmeeren E. J., Simons M., Monninkhof E. M (2018) The effect of physical activity interventions comprising wearables and smartphone applications on physical activity: A systematic review and meta-analysis. Sports Medicine-Open 4, 1.
Gorzelitz J., Farber C., Gangnon R., Cadmus-Bertram L (2020) Accuracy of wearable trackers for measuring moderate- to vigorous-intensity physical activity: A systematic review and meta-analysis. Journal for the Measurement of Physical Behaviour 3, 346-357.
Guthold R., Stevens G. A., Riley L. M., Bull F. C (2020) Global trends in insufficient physical activity among adolescents: a pooled analysis of 298 population-based surveys with 1.6 million participants. The Lancet Child and Adolescent Health 4, 23-35.
Hao Y., Ma X. K., Zhu Z., Cao Z. B (2021) Validity of wrist-wearable activity devices for estimating physical activity in adolescents: Comparative study. JMIR mHealth and uHealth 9, e18320.
Henriksen A., Haugen Mikalsen M., Woldaregay A. Z., Muzny M., Hartvigsen G., Hopstock L. A., Grimsgaard S (2018) Using fitness trackers and smartwatches to measure physical activity in research: Analysis of consumer wrist-worn wearables. Journal of Medical Internet Research 20, e110.
Hernaez R (2015) Reliability and agreement studies: A guide for clinical investigators. British Journal of Sports Medicine Gut 64, 1018-1027.
Hickey A., John D., Sasaki J. E., Mavilia M., Freedson P (2016) Validity of activity monitor step detection is related to movement patterns. Journal of Physical Activity and Health 13, 145-153.
International Data Corporation. (2021) European Wearable Market Showed Positive Growth in 1Q21, Says IDC. Available at: https://www.idc.com/getdoc.jsp?containerId=prEUR147995121 (Accessed: 15 January 2022).
Johnston W., Judice P. B., Molina García P., Mühlen J. M., Lykke Skovgaard E., Stang J., Schumann M., Cheng S., Bloch W., Brønd J. C., Ekelund U., Grøntved A., Caulfield B., Ortega F. B., Sardinha L. B (2021) Recommendations for determining the validity of consumer wearable and smartphone step count: expert statement and checklist of the INTERLIVE network. British Journal of Sports Medicine 55, 780-793.
Kang S., Kim Y., Byun W., Suk J., Lee J. M (2019) Comparison of a wearable tracker with actigraph for classifying physical activity intensity and heart rate in children. International Journal of Environmental Research and Public Health 16, 2663.
Koo T., Li M (2016) A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine 15, 155-163.
Kottner J., Audigé L., Brorson S., Donner A., Gajewski B. J., Hróbjartsson A., Roberts C., Shoukri M., Streiner D. L (2011) Guidelines for reporting reliability and agreement studies (GRRAS) were proposed. Journal of Clinical Epidemiology 64, 96-106.
LaMunion S. R., Blythe A. L., Hibbing P. R, Kaplan A. S., Clendenin B. J., Crouter S. E (2020) Use of consumer monitors for estimating energy expenditure in youth. Applied Physiology Nutrition and Metabolism 45, 161-168.
Landis J. R., Koch G. G (1977) The measurement of observer agreement for categorical data. Biometrics 33, 159-174.
Lee J. A., Williams S. M., Brown D. D., Laurson K. R (2015) Concurrent validation of the Actigraph gt3x+, Polar Active accelerometer, Omron HJ-720 and Yamax Digiwalker SW-701 pedometer step counts in lab-based and free-living settings. Journal of Sports Sciences 33, 991-1000.
Maher C., Ryan J., Ambrosi C., Edney S (2017) Users’ experiences of wearable activity trackers: A cross-sectional study. BioMed Central Public Health 17, 880.
Mayorga-Vega D., Casado-Robles C., López-Fernández I., Viciana J (2021) A comparison of the utility of different step-indices to translate the physical activity recommendation in adolescents. Journal of Sports Sciences 39, 469-479.
Mayorga-Vega D., Casado-Robles C., Viciana J., López-Fernández I (2019) Daily step-based recommendations related to moderate-to-vigorous physical activity and sedentary behavior in adolescents. Journal of Sports Science and Medicine 18, 586-595.
Migueles J. H., Cadenas-Sanchez C., Ekelund U., Delisle Nyström C., Mora-Gonzalez J., Löf M., Labayen I., Ruiz J. R., Ortega F. B (2017) Accelerometer data collection and processing criteria to assess physical activity and other outcomes: A systematic review and practical considerations. Sports Medicine 47, 1821-1845.
Nunnally, J. (1978) Psychometric theory. 2nd edition. New york: McGraw-Hill.
O'Neill B., McDonough S. M., Wilson J. J., Bradbury I., Hayes K., Kirk A., Kent L., Cosgrove D., Bradley J. M., Tully M. A (2017) Comparing accelerometer, pedometer and a questionnaire for measuring physical activity in bronchiectasis: a validity and feasibility study?. Respiratory Research 18, 16.
OECD. (2019) OECD Future of Education 2030. Making Physical Education dynamic and inclusive for 2030. International curriculum analysis. OECD Publishing.
Oliver M., Badland H. M., Schofield G. M., Shepherd J (2011) Identification of accelerometer nonwear time and sedentary behavior. Research Quarterly for Exercise and Sport 82, 779-783.
Paluch A. E., Bajpai S., Bassett D. R., Carnethon M. R., Ekelund U., Evenson K. R., Fulton J. E (2022) Daily steps and all-cause mortality: a meta-analysis of 15 international cohorts. The Lancet Public Health 7, e219-e228.
Parra Saldías M., Mayorga-Vega D., López-Fernández I., Viciana J (2018) How many daily steps are really enough for adolescents? A cross-validation study. Retos 33, 241-246.
Poitras V., Gray C., Borghese M., Carson V., Chaput J., Janssen I., Katzmarzyk P. T., Pate R. R., Connor Gorber S., Kho M. E., Sampson M., Tremblay M. S (2016) Systematic review of the relationships between objectively measured physical activity and health indicators in school-aged children and youth. Applied Physiology, Nutrition, and Metabolism 41, 197-239.
Redenius N., Kim Y., Byun W (2019) Concurrent validity of the Fitbit for assessing sedentary behavior and moderate-to-vigorous physical activity. BioMed Central Medical Research Methodology 19, 29.
Romanzini M., Petroski E. L., Ohara D., Dourado A. C., Reichert F. F (2014) Calibration of ActiGraph GT3X, Actical and RT3 accelerometers in adolescents. European Journal of Sport Science 14, 91-99.
Rowlands A. V., Harrington D. M., Bodicoat D. H., Davies M. J., Sherar L. B., Gorely T., Khunti K., Edwardson C. L (2018) Compliance of adolescent girls to repeated deployments of wrist-worn accelerometers. Medicine and Science in Sports and Exercise 50, 1508-1517.
Schneider M., Chau L (2016) Validation of the Fitbit Zip for monitoring physical activity among free-living adolescents. BioMed Central Research Notes 9, 448.
Shephard, R. and Tudor-Locke, C. (2016) The objective monitoring of physical activity: Contributions of accelerometry to epidemiology, exercise science and rehabilitation. 1st edition. Switzerland:Springer.
ŠimŠ¯nek A., Dygrýn J., Jakubec L., Neuls F., Frömel K., Welk G. J (2019) Validity of Garmin Vívofit 1 and Garmin Vívofit 3 for school-based physical activity monitoring. Pediatric Exercise Science 31, 130-136.
Stamm A., Hartanto R (2018) Feature extraction from mems accelerometer and motion tracking measurements in comparison with smart bands during running. Proceedings 2, 197-202.
Stewart, A., Marfell-Jones, M., Olds, T. and De Ridder, J. (2011) International standards for anthropometric assessment. New Zealand: International Society for the Advancement of Kinanthropometry.
Strath S. J., Rowley T. W (2018) Wearables for Promoting Physical Activity. Clinical Chemistry 64, 53-63.
Toth L. P., Park S., Springer C. M., Feyerabend M. D., Steeves J. A., Bassett D. R (2018) Video-recorded validation of wearable step counters under free-living conditions. Medicine and Science in Sports and Exercise 50, 1315-1322.
Trost S. G., Loprinzi P. D., Moore R., Pfeiffer K. A (2011) Comparison of accelerometer cut points for predicting activity intensity in youth. Medicine and Science in Sports and Exercise 43, 1360-1368.
Tudor-Locke C., Craig C. L., Beets M. W., Belton S., Cardon G. M., Duncan S., Hatano Y., Lubans D. R., Olds T. S., Raustorp A., Rowe D. A., Spence J. C., Tanaka S., Blair S. N (2011) How many steps/day are enough? for children and adolescents. International Journal of Behavioral Nutrition and Physical Activity 8, 78.
Vailshery, L. S. (2021) Wearables Statistics & facts. Availabe at: https://www.statista.com/topics/1556/wearable-technology (Accessed: 15 January 2022).
Van Hecke L., Loyen A., Verloigne M., van der Ploeg H. P., Lakerveld J., Brug J., De Bourdeaudhuij I., Ekelund U., Donnelly A., Hendriksen I., Deforche B (2016) Variation in population levels of physical activity in European children and adolescents according to cross-European studies: a systematic literature review within DEDIPAC. International Journal of Behavioral Nutrition and Physical Activity 13, 70.
Voss C., Gardner R. F., Dean P. H., Harris K. C (2017) Validity of commercial activity trackers in children with congenital heart disease. Canadian Journal of Cardiology 33, 799-805.
Westerterp K. R (2013) Physical activity and physical activity induced energy expenditure in humans: measurement, determinants, and effects. Frontiers in Physiology 4, 90.
Westerterp K. R (2017) Doubly labelled water assessment of energy expenditure: Principle, practice, and promise. European Journal of Applied Physiology 117, 1277-1285.
Willmott C., Matsuura K (2005) Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Climate Research 30, 79-82.
World Health Organization. (2020) WHO guidelines on physical activity and sedentary behaviour. Geneva: World Health Organization.
Yang X., Jago R., Zhang Q., Wang Y. Y., Zhang J., Zhao W. H (2019) Validity and reliability of the wristband activity monitor in free-living children aged 10-17 years. Biomedical and Environmental Sciences 32, 812-822.
The International Data Corporation's Worldwide Quarterly Wearable Device Tracker reports (2021) Available from URL: https://www.idc.com/getdoc.jsp?containerId=IDC_P31315








Back
|
PDF
|
Share