|
|
|
![]() |
|
|||||||
|
||||||||
![]() |
|
|
Thread Tools |
Rating:
|
Display Modes |
|
|
|
#1
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Quote:
Thanks, it turns out that we were using different WMPR values. I redownloaded Ether's attachment and it contains different values. Maybe the initial download was corrupted? I'm baffled. Anyways, I can confirm that after redownloading the correct WMPR values that I get your results. |
|
#2
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Quote:
OPR A and b are posted here. Last edited by Ether : 26-05-2015 at 15:24. |
|
#3
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
And now for the data where the single testing point was removed from the training data, then the model was computed, then the single testing point was evaluated, and this was repeated 80 times. So the results below are for separate training and testing data.
Stdev of prediction residual of the winning margins: OPR: 34.6 CCWM: 30.5 WMPR: 30.4 (note that on the testing data from a few posts ago, WMPR had a Stdev of 15.9, so this is an argument that WMPR is "overfitting" the small amount of data available and that it could benefit from having more matches per team) # of matches predicted correctly (out of 79 possible) OPR: 63 CCWM: 55 WMPR: 58 So here, the WM-based measures are both better at predicting the winning margin itself but not at predicting match outcomes. CCWM and WMPR have almost identical prediction residual standard deviations but WMPR is slightly better at match outcome prediction in this particular example for some reason. Again, it would be great to test this on some 2014 data where there was more defense. Last edited by wgardner : 26-05-2015 at 11:20. |
|
#4
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Quote:
Otherwise, if someone can provide the the 2014 qual match data in CSV format, I can quickly generate all the A and b for WMPR, OPR, and CCWM and post it here. |
|
#5
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Quote:
For fun, I computed the standard deviation of the prediction residual of the testing data not in the training set using the WMPR solution, 0.9*WMPR, 0.8*WMPR, etc. The standard deviation of the prediction residual of the winning margin for the test data for this particular tournament was minimized by 0.7*WMPR, and that standard deviation was down to 28.4 from 30.4 for the unscaled WMPR. So again, more evidence that the WMPR is overfit and could benefit from additional data. This doesn't change the match outcome prediction that some folks are interested in, since scaling all of the WMPRs down doesn't change the sign of the predicted winning margin which is all the match outcome prediction is. |
|
#6
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Attached are A, b, and T for 2015 OPR, CCWM, and WMPR. Also introducing a funky new metric, EPR, which uses 3 simultaneous equations for each match: 1) r1+r2+r3-b1-b2-b3 = RS-BS 2) r1+r2+r3=RS 3) b1+b2+b3=BS ... and solves all the equations simultaneously. |
|
#7
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Quote:
How do you interpret these new EPR values? They look closer to an OPR than a WM-measure as #2 and #3 both only factor in offense. How will you measure its performance? Perhaps by comparing the overall residual of all 3 combined vs. other ways of predicting all 3 (e.g., using WMPR for #1 and standard OPR for #2 and #3)? |
|
#8
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Quote:
I like this idea of solving all 3 equations simultaneously. WMPR is a good improvement over CCWM because winning margin depends on who you are playing against (except in 2015), but I think EPR is even better. I will adopt it after you guys finish your validation on how well it predicts outcome. I like this EPR because it is one number instead of two. It can replace both OPR and WMPR. The problem with OPR is similar to the problem with CCWM. It does not take into account of who the opponents were. If you play against stronger opponents, you may not be able to score as many points, especially in those years with limited game pieces. Equation 1 will take care of that. It will improve on the line fitting. To me, I would interpret EPR as how many points a team will be able to score with typical opponents on the field. This eliminates the error of match schedule strength due to luck of the draw. A team may have higher than normal score because they faced weaker opponents more often. That would skew the OPR numbers. I think EPR would be more accurate in predicting match scores. Would somebody like to test it out? Another reason I like EPR is that it is easier to compute without all that SVD stuff. I would prefer high school students to be able to understand and implement this on their own. |
|
#9
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Quote:
| 2 2 2 -1 -1 -1 | |r1| = |2RS - BS| | 2 2 2 -1 -1 -1 | |r2| = |2RS - BS| | 2 2 2 -1 -1 -1 | |r3| = |2RS - BS| | -1 -1 -1 2 2 2 | |b1| = |2BS - RS| | -1 -1 -1 2 2 2 | |b2| = |2BS - RS| | -1 -1 -1 2 2 2 | |b3| = |2BS - RS| |
|
#10
|
|||
|
|||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
|
|
#11
|
||||
|
||||
|
Re: Incorporating Opposing Alliance Information in CCWM Calculations
Quote:
x = pinv(A,tol)*b; pinv() is explained in detail here: http://www.mathworks.com/help/matlab/ref/pinv.html (well worth reading; explains the interesting difference between x1=pinv(A) and x2=A\b) |
![]() |
| Thread Tools | |
| Display Modes | Rate This Thread |
|
|