Research article - (2006)05, 561 - 566 |
An Analysis of Ten Years of the Four Grand Slam Men’s Singles Data for Lack of Independence of Set Outcomes |
Graham Pollard1,, Rod Cross2, Denny Meyer3 |
Key words: Data analysis, independence in tennis, constant probabilities, psychological development |
Key Points |
|
|
|
Assuming without loss of generality that player A is the better player, the results of a best-of-five sets singles match can be recorded as WWW, WWLW, WLWW, LWWW, WWLLW, WLWLW, WLLWW, LWWLW, LWLWW, LLWWW, and LLL, LLWL, LWLL, WLLL, LLWWL, LWLWL, LWWLL, WLLWL, WLWLL and WWLLL where W represents a set won by player A, and L represents a set lost by player A. When we do not know who the better player is, a win in three sets for example (WWW or LLL above) is simply a win WWW to the winner of the match (not necessarily player A). Thus, when we do not know who the better player is, the above twenty outcomes reduce to the ten mutually exclusive outcomes WWW, WWLW, WLWW, LWWW, WWLLW, WLWLW, WLLWW, LWWLW, LWLWW and LLWWW where W represents a set won by the eventual winner of the match and L represents a set lost by the eventual winner. The data consisted of ten years of men’s singles grand slam results. There were 4883 matches in total, and spurious data such as matches where one player ‘retired’ (presumably injured) before the match was finished were omitted. The number of matches in each of the above categories was: WWW 2330; WWLW 503; WLWW 487; LWWW 609; WWLLW 151; WLWLW 135; WLLWW 186; LWWLW 138; LWLWW 156; LLWWW 188 The first model fitted involved a constant probability, p, of player A (the notionally or theoretically better player) winning each set. A short and simple search using a spreadsheet showed that the value of p which minimized Chi-Squared was 0.769, and the results are given in The value of Chi-Squared was 38.68 with 8 degrees of freedom, so the fit is a poor one. This is not surprising as a constant p-value for all matches is clearly unrealistic. It can be seen from the Obs-Exp column in In order to attempt to overcome the shortage of three and five sets matches expected under the above model, it was decided to model the data using two p values, one greater than 0.769 and the other less than it, and combine the results. The value greater than 0.769 would increase the proportion of three set matches, and the value less than 0.769 would increase the proportion of five set matches. Thus, for simplicity, the data was modeled as consisting of 2 types of matches-‘close’ matches (with p less than 0.769) and ‘not- so- close’ matches (with p greater than 0.769). Half the matches were assumed to be ‘close’, and half ‘not- so- close’. Symmetric values about 0.769, p1 and p2, were considered, and the two p values which minimized Chi-Squared were identified. These two values were p1 = 0.705 and p2 = 0.833. The results for this model are given in The value of Chi-Squared for this model was 35.59 with 7 degrees of freedom, so the fit is again a poor one. Whilst this is a better fit with respect to the proportion of three and five set matches, the number of LWWW matches is still underestimated under this model, as is the number of WLLWW and LLWWW matches. It is noted here as an aside that if we remove the restriction that exactly half of the matches have a p-value of p1 and half of them have the value p2 whilst keeping p1 = 0.705 and p2 = 0.833, a slightly smaller value of chi-squared can be obtained. The lowest Chi-Squared value obtained was 34.35 with 6 degrees of freedom when the proportion of matches with p1 = 0.705 was 0.53, and the proportion of matches with p2 = 0.833 was 0.47. Thus, for this model (and indeed for the others considered in this paper), modifying the proportion of ‘close’ and ‘not- so- close’ matches had negligible effect on the Chi-Squared values. For this reason, no further reports on this modification are given in this paper. It can be seen from It can be seen from Given that p1 and p2 are increased by D1 or D2 in certain situations, it seemed appropriate, in order to get a reasonable overall fit, to lower both their ‘starting’ values (ie, those for set1) from those in The value of Chi-Squared was 1.83 with 5 degrees of freedom, so the fit is a good one indicating that the model fits the data well. In order to carry out a simple check on the model, it was decided to break the data into two time periods (1995-1999 and 2000-2004), and check for consistency across the periods ( There appeared to be no evidence in the data that the weaker player could lift his game in situations where it would have been useful for him to do so. |
|
|
The 4883 completed men’s singles matches at grand slam tournaments for the period 1995-2004 have been analysed to test the hypothesis that the probability of winning a set within a match is constant. This hypothesis was rejected. A model which fits the data well has been found. It is a model in which the better player lifts his probability of winning a set in certain situations. These situations are (i)when he is behind in the set score, needs to lift his game, and lifts his probability of winning the next set by (on average) 0.035, (ii)when he has just won a set, is ‘on-a-run’’, and lifts his probability of winning the next set by (on average) 0.035, and (iii)when he has just lost two sets in a row, desperately needs to lift his game, and lifts his probability of winning each remaining set by (on average) 0.110. |
|
|
The results of this study are quite encouraging for the better player, but perhaps somewhat discouraging for the weaker player. The findings indicate that the weaker player needs to be ‘on his guard’ for a change in fortunes when the match is ‘going well’ for him. The results of the analysis in this paper show that often the better player can increase his probability of winning a set by quite a substantial amount when it is really necessary to do so in order to reduce his probability of losing the match. A set can often be won rather than lost by winning just one, two, or a few particular important points (Morris, Further studies might include whether women’s matches (although only best-of-three sets) have comparable characteristics or whether there are gender differences in this regard. It would appear that the methodology used in this paper has a range of sporting applications, particularly for the often occurring situation in which the better player or team does not always win a match, or the ‘best’ player or team does not always win a series of matches. Another area of application might be assessment in which the ‘best’ student (or persons being assessed) does not always come first. |
Conclusions |
The conclusion is that matches turn around in favour of the better player significantly more often than would be expected under the usual randomness/independence assumptions of probability. As each point is a ‘zero-sum’ situation for the two players, it is not strictly possible to tell from just the statistical records whether this ‘turn-around’ characteristic is because the better player lifts his play or because the weaker player lowers his play. Nevertheless, it is useful for both players to know of the existence of this phenomenon as any player (except the best player in the world) should sometimes be the better player and sometimes the weaker on the court. The better player can take advantage of it, and the weaker player needs to guard against it. |
AUTHOR BIOGRAPHY |
|
REFERENCES |
|