Revisiting Who's The Best. Should we use RMSE, MSE or Accuracy?
/A few years back, I authored a blog post in which I deftly presented the case for the superiority of MAE over Accuracy for identifying the most talented forecasters.
Read MoreA few years back, I authored a blog post in which I deftly presented the case for the superiority of MAE over Accuracy for identifying the most talented forecasters.
Read MoreLast season, we had the first partially player-based forecaster, MoSHPlay, which provided forecasts for game margins and game totals, and estimates of home team victory probabilities. It performed reasonably well in a year that was, in many ways, completely unlike those it had been trained on.
It used as inputs the margin and team score forecasts of MoSHBODS, and player ratings derived from historical SuperCoach scores.
In this blog I’ll take you through the process of reviewing the existing MoSHPlay forecasting equations, and investigate the efficacy of deriving player ratings from AFL Player Rating scores rather than from SuperCoach scores.
Read MoreIt feels slightly surreal that it’s only been a little over 12 months since last we did it, but it’s time to, once again, review the schedule for the men’s footy season ahead.
Read MorePerhaps naively in retrospect, I had hoped that the changes I made to the MoS twin algorithms in preparation for the 2020 season would be the last I’d need to make for a while. But, the unusual nature of 2020 fixturing highlighted some characteristics of both Systems that I thought needed redressing, so 2021 will see new versions of MoSSBODS and MoSHBODS providing key forecasts.
Read MoreEven as a casual punter, you’d probably be aware of the availability of same-game multis which, put simply, allow you to construct a wager out of multiple “legs” (individual bets), usually up to some limit.
Read MoreSometimes, data comes at you fast …
Read MoreThis week the topic of what characteristics of a game of men’s AFL football might be correlated with its attendance was discussed on Twitter, which caused me to review this blog from 2015 where I looked at the same topic.
Read MoreThe fine folk who brought us the fitzRoy R package have been diligently collecting historical AFL Player Rating data with a view to potentially including it in an upcoming version of the package, and asked me to take a look at what they have so far, which spans the period from 2012 to the end of 2019.
Read MoreIn the normal course of things, it would have taken me months to create a simulator that I was happy with, but the current situation has given me larger blocks of time to devote to the problem than would otherwise have been the case, so the development process has been, as the business world loves to say, “fast-tracked”.
The new version is somewhat similar to the one I wrote about in this earlier blog, but different in a number of fundamental ways, each of which we’ll address during the remainder of this blog.
Read MoreThis past week, in between some pieces of client work, I’ve been coming up with a workable methodology for creating a score-by-score men’s AFL simulator that will be as faithful as possible to the actual scoring behaviour we’ve observed in recent seasons.
Over the next few blogs I’ll be describing the process of building this simulator which, let me stress immediately, is not yet finished. From what I’ve been able to create so far, I think the general approach I’m following is viable, but we’ll only see just how viable when it’s done.
Read MoreOver the past couple of blogs, we’ve been analysing historical scoring progressions to come up with archetypical game types in terms of the ebb-and-flow of the game margin.
To do that, we treated the score progressions as time series data and today we’ll do something similar with teams’ season-by-season historical MoSH2020 Team Ratings for the period 1970 to 2019, inclusive.
Read MoreNow it might be that you’re indifferent about whether it was the home or the away team that was leading at any point, but instead care about the size of the margin from the point of view of the team that eventually won.
In today’s blog we’re going to revisit the analysis of the previous blog, using the same data, but looking at it from the viewpoint of the winning teams.
(Thanks to Daniel from InsightLane for this idea. You can find all of the score progressions used here on the ScoreWorm page of his website.)
Read MoreIt’s been a while since I’ve felt I’ve had much time to do more than post the outputs of any football analyses to Twitter, but sheltering-in-place and something of a lull in my consulting work has left me with a bit of time to redress that.
Read MoreThe MoS twins - MoSSBODS and MoSHBODS - have both performed well these past few seasons, but there are some aspects of them that I’ve wanted to improve for a while, and I’ve decided that 2020 is the time to do that.
Read MoreIn this blog post we'll use the same methodology but replace the static, start-of-season MoSHBODS data with the dynamic Ratings and VPVs that each team's opponents carried into the respective game to assess who, in hindsight, had easier or more difficult schedules than we assessed initially.
Read MoreThis year we’ll use the same methodology as we did last year to analyse the AFL Fixture for the upcoming season. Details of that methodology are in this post from 2015 and this one from the year before. We’ll again use the MoSHBODS Team Rating System to provide the estimates of relative team ability and venue effects.
Read MoreMomentum. It's something that you're almost guaranteed to hear mentioned at least once in any sporting commentary, and it's a topic that continues to pique my interest. Whether or not it exists remains a legitimate subject for debate, but detecting it, if it does exist, has been elusive.
Read MoreMoSHBODS has tipped the Tigers to win by about 16 points on Saturday, but Giants supporters shouldn’t be too concerned about that, because MoSHBODS’ record in Grand Finals since 2000 isn’t particularly impressive, as you can see from the table below.
Read MoreIn the previous blog, we created a quantile regression model that allowed us to estimate, in-running, a home team’s victory probability, and to create in-running confidence intervals for the home team’s final margin.
We evaluated that model based on a variety of performance metrics calculated using a 50% holdout sample from the original data set, which included games spanning the 2008 to 2016 period.
But nothing really measures a model’s performance better than a completely fresh data set from a non-overlapping time period, and in this blog we’ll be running the same metrics, but for games spanning the 2017 to 2019 period (up to and including the first week of the 2019 Finals). That’s 616 games entirely unseen by the model.
Read MoreMAFL is a website for ...