The kinematic analysis of sports and clinical movements provides useful information to athletes and coaches for evaluating the technical performance during competitions and training sessions. In every sport, including swimming and other aquatic disciplines, this information can be used to optimize training activities (Ito and Okuno, 2010; Pogalin et al., 2007; Slawson et al., 2010). Kinematic analysis requires excellent accuracy and robustness of the methods used for data collection, as even little variations in kinematic indices can be important. Thus, there is a great interest in developing measuring techniques for a highly accurate and sensitive analysis of human movements. The majority of the methods for kinematic analysis are based on markers, attached or fixed to the human body that enable tracking of specific anatomical landmarks. In dry conditions, passive markers are commonly used, consisting of discs or spheres of different sizes covered by retro-reflective tape (Berthouze and Mayston, 2011; Cappozzo et al., 1995; Davis et al., 1991; Frigo et al., 1998; Knuesel et al., 2005). Recently, Kjendlie and Olstad, 2012 evaluated an automatic motion capture system designed to analyze human swimming using spherical markers with a diameter of 19 mm, reporting a 7% to 10% increase in the passive drag due to the resistance exerted by the 24 markers attached to the swimmer. Although further data are needed to support their findings, the formers may lead to the conclusion that the use of markers of non-negligible volume in the water is questionable. Indeed, an increased passive drag could negatively affect the performance of the swimmer. A possible approach to avoid increment of passive drag consists in replacing the spherical markers with bi-adhesives placed on the swimsuit, or with non-reflective markers drawn on the swimmer’s skin (Ceccon et al., 2013; McCabe et al., 2011; McCabe and Sanders, 2012). When this setup is used, movements are filmed through conventional underwater cameras, and the resulting video recordings are analyzed using specific software for tracking of features. Manual tracking represents the roughest solution to analyze movements performed in the water. However, this tracking method requires an extensive amount of time. For example, Psycharakis and Sanders, 2008 used manual digitation to analyze 19 anatomical landmarks for 4 stroke cycles in 10 swimmers performing a 200-m front-crawl trial. The mean stroke frequency was 0.74 Hz, involving a total duration of approximately 5.4 s for the examined fraction (4 cycles) of each swim. Six cameras at 50 frames per second were used, therefore, about 1620 frames digitized for each swimmer. Although not all the markers had to be digitized in each frame (some were not visible by one or more cameras), a well-trained operator would have reasonably used no less than one minute per frame, involving a total digitation time of 27 hours for each swimmer. Therefore, the availability of appropriate software for automatic tracking would represent a significant advance in the practical use of kinematic analysis in swimming and other aquatic sports. This approach would provide a quick feedback on the kinematic characteristics of swimming to swimmers and coaches. Several commercial computer software are available to track the markers, measure the kinematic variables, and present the data intuitively to coaches and athletes (Barris and Button, 2008). Although many of these software have tools to perform automatic tracking of markers, little is known about their algorithms and techniques of analysis. The awareness of this information is essential to optimize the automatic tracking procedure in every environments and conditions. While the process of tracking has been thoroughly established already, it is not necessarily easy to select features that can be tracked properly (Shi and Tomasi, 1994), therefore a wide variety of algorithms has been proposed that aim at developing the most robust algorithm with less computational time. Common tracking methods are the “point tracking”, based on the detection of points that represents objects in consecutive frames, and the “kernel tracking”, representing the objects as primitive shapes and computing their motion from frame to frame (Yilmaz et al., 2006). To the authors’ knowledge, only Figueroa et al., 2003 described in detail the algorithms used in a software program (DVideo) designed for automatic tracking of markers in human motion analysis. In that software, based on the kernel tracking, the motion is computed by template matching where a similarity measure, e.g. cross correlation, is used to search for the object template in each image. While this kind of approach has a number of advantages for the analysis of sports gestures under normal visibility conditions (Figueroa et al., 2003), it may not prove to be as much effective when dealing with underwater images. The analysis of video recordings performed in an underwater environment involves additional difficulties linked to the small target size, the low image quality and the presence of background clutters. With high probability, these aspects make underwater tracking process harder. An alternative tracking approach, that could prove to be useful in handling the typical difficulties of underwater motion analysis, is represented by optical flow techniques, such as the popular Kanade-Lucas-Tomasi (KLT) tracking (Lucas and Kanade, 1981; Tomasi and Kanade, 1991). Optical flow is a flexible representation of visual motion that is particularly suitable for computers analysing digital images. The algorithm explicitly optimizes the tracking performance by classifying a feature as appropriate if it can be tracked successfully. This technique is reported to be the most efficient and accurate among optical flow techniques in terms of average angular deviations from the correct space-time orientation (Barron et al., 1994). The method based on optical flow is complex, but it can detect the motion accurately even without knowing the background (Lu et al., 2007). In this context KLT had proven to be accurate and efficient in computing optical flow (Barron et al., 1994; Galvin et al., 1998; Liu et al., 1996;Lu et al., 2007). Barron et al., 1994 compared the accuracy of different optical flow techniques on both real and synthetic image sequences, and found the KLT the most reliable one. His findings were confirmed by Liu et al., 1996 who analyzed the accuracy and the efficiency trade-offs in various optical flow algorithms, and showed that KLT presents better accuracy with reduced computation time. Finally, Galvin et al., 1998 evaluated eight optical flow algorithms and concluded that the KLT method consistently produces accurate depth maps with a low computational time, showing good tolerance to the presence of noise. More recently, this technique has proven to be the most efficient in automatically estimating vehicle speed from video sequences acquired with a fixed mounted camera for its robustness in presence of noise (Shukla and Patel, 2013). Therefore, our hypothesis was that the KLT feature tracker could be the most appropriate technique for tracking underwater images. The aim of the present study was to evaluate the effectiveness of a software for automatic tracking of user-defined features in underwater conditions. The software has been developed starting from a free open-source implementation of the KLT feature tracker (Sinha et al., 2011), and was already used in a previous study that analysed the front crawl swimming (Ceccon et al., 2013). It is recognized that an important characteristic of an effective tracking algorithm is a limited amount of manual interventions required throughout the tracking process (Chiari et al., 2005; Figueroa et al., 2003). The percentage of manual interventions necessary to supervise the automatic tracking was, therefore, chosen as the criterion to evaluate the algorithm. |