I should not have used the word "skill" in my description of this model chart because it is measured in absolute error which by definition is different than "skill" compared to climatology.
"Absolute error" tells us about the size of the forecast error based on the difference between the forecast conditions and what actually happens. Using temperature as an example, absolute error does not tell you whether the forecast was warmer or cooler than what actually happened; it only tells you about the size of the error. In other words, a forecast that is 5 degrees Fahrenheit too high would have the same absolute error as a forecast that is 5 degrees Fahrenheit too low.
"Skill" compared to climatology are forecasts that have more accuracy than a generic "climatology" forecast of 30-year normal conditions. If the forecast is less accurate than using a forecast of climatological normals, then the forecast has no skill and is essentially useless.
It would be interesting to chart the "skill" of the ensembles but for NWP, absolute error is easier for verification purposes.