Win Probabilities and Their Role in Constructing Player Won-Lost Records
(1)   Given a Base-Out Transition Matrix, one can compute a Base-Out Probability Matrix.Steps 1 through 4 above are explained next.
(2)   Given a Base-Out Probability Matrix, one can compute a Run Probability Matrix.
(3)   Given a Run Probability Matrix, one can compute an Inning Probability Matrix.
(4)   Given a Run Probability Matrix and an Inning Probability Matrix, one can compute a Win Probability Matrix.
Base-Out Probability MatrixThe basic base-out transition matrix is 24 rows by 28 columns (24x28). The 24 rows identify the number of outs (0, 1, or 2) and the position of the baserunners (8 possibilities, identified here as 0, 1, 2, 3, 1-2, 1-3, 2-3, and 1-2-3) at the beginning of the event. The 28 columns identify the number of outs and the position of the baserunners at the end of the event. The four additional columns represent the additional possibility that this event results in the third out of the inning. The third out gets four columns to also reflect the number of runs which score on the play (0, 1, 2, or 3 - you couldn't score more than 3 runs while also making the third out). This breakdown of third-out plays is important in the next step, converting the base-out transition matrix to a run-probability matrix.
Run Probability MatrixThe second step in converting a base-out transition matrix into a win-probability matrix is to convert the base-out probability matrix into a run-probability matrix. That is, given the initial base-out state, what is the probability of scoring exactly zero runs over the remainder of the inning, what is the probability of scoring exactly one run, what is the probability of scoring exactly two runs, etc?
We begin with the initial state, two outs and the bases loaded, (2,1-2-3)The probability of scoring exactly zero runs is equal to the probability of transitioning from (2,1-2-3) to (3,0) (i.e., 3 outs, zero runs scored), because any other transition will necessarily involve at least one run scoring.
Next, consider the initial state, (2,2-3)The probability of scoring exactly zero runs is equal to the probability of transitioning from (2,2-3) to (3,0) PLUS the probability of transitioning from (2,2-3) to (2,1-2-3) (which produces no runs scored) times the probability of scoring zero runs from (2,1-2-3) (which was solved for above). Again, any other final base-out state except for these two will have involved at least one run scoring.
Continuing onward, we can work up to the initial state, (2,1) - two out and a runner on firstThe probability of scoring exactly zero runs is equal to the probability of transitioning from (2,1) to (3,0) plus the probability of transitioning from (2,1) to (2,1-2) times the probability of scoring zero runs from (2,1-2) plus the probability of transitioning from (2,1) to (2,1-3) times the probability of scoring zero runs from (2,1-3), ....
Inning Probability MatrixOnce the run-probability matrix is constructed, what I call an inning-probability matrix can be constructed, based upon the run-probabilities for the base-out state (0,0) - nobody on and nobody out. As with the run-probability matrix, the inning-probability matrix is constructed recursively.
Win Probability MatrixGiven an inning-probability matrix, giving the probability of winning given the score differential at the start of any inning, and a run-probability matrix, which gives the probability of scoring any number of runs through the end of the given inning given the current base-out state, one can build a complete win-probability matrix.
1. Independent EventsMost events can happen regardless of the base-out situation. One can strike out at any time, regardless of how many baserunners or outs there are. Similarly, a triple could happen at any time regardless of the number of baserunners. All batter results, except for double plays (which are base-state dependent), intentional walks, and bunts, fall into the category of independent events. Intentional walks and bunts are treated as purely contextual events, which are described below.
2. Base-State Dependent EventsSome events can only happen given certain baserunners or a certain number of outs. For example, one can only ground into a double play with at least one baserunner on and less than two outs. Any Player Game Points accumulated by a baserunner on third base can, of course, only be accumulated in a base-out state that includes a runner on third base.
3. Purely Contextual EventsWhile it is possible to remove much, if not all, of the context from most plays, there are certain plays which are, essentially, purely elective plays, and are therefore inextricably tied to the context in which they take place. In my opinion, it would be wrong to attempt to divorce these plays from their context.
On August 28, 2002, starting pitcher Odalis Perez hit a solo home run with two outs in the bottom of the fifth inning off of Arizona's Rick Helling for the only run in a 1-0 Dodgers win.From a context-neutral perspective, Perez's and LoDuca's home runs were exactly equal in value: 0.1448 wins, since both home runs took place in the same run-scoring environment - Dodger Stadium in 2002. From a prospective perspective, on the other hand, LoDuca's home run, which ended the game, was more than twice as valuable, 0.4088 wins, as Perez's home run, at 0.1814 wins. These values correspond to a WPA valuation system: Baseball-Reference.com, for example, reports the WPA of these home runs as being 37% for LoDuca and 17% for Perez.
On September 27, 2002, Paul LoDuca led off the bottom of the tenth inning with a home run off of San Diego's Jeremy Fikac to break a scoreless tie and give the Dodgers a walkoff 1-0 victory.
Player Wins | |||||||||
Prospective | Contexts | ||||||||
Date | Batter | Situation | eWins | Inter-Game | WPA (BB-Ref) | pWins | Inter-Game | Intra-Game | Combined |
---|---|---|---|---|---|---|---|---|---|
8/28/2002 | Odalis Perez | Two out, bottom of 5th inning |
0.1448 | 0.1814 | 0.17 | 0.2724 |
1.2527 | 1.5018 | 1.8813 |
9/27/2002 | Paul LoDuca | Leading off bottom of 10th |
0.1448 | 0.4088 | 0.37 | 0.3619 |
2.8231 | 0.8852 | 2.4991 |
On September 29, 2005, David Ortiz went 3-5 including a home run leading off the bottom of the 8th inning to tie the score 4-4 and a walkoff RBI single with one out in the bottom of the 9th inning. Baseball-Reference credits Ortiz with a WPA of 0.584 for the game. Obviously, those hits were huge for the Red Sox and Ortiz was rightly celebrated as the hero of that game.Take Ortiz's two RBIs off the scoreboard for the Red Sox in that September 29th game, and the Blue Jays would have won that game 4-3. Then again, if Ortiz made a (single) out in his final at-bat, Manny Ramirez would have come to bat with the potential winning run still in scoring position (albeit with two outs).
On April 26, 2005, the Yankees defeated the Los Angeles Angels of Anaheim (or whatever they were calling themselves that season) 12-4. The Yankees took a 3-0 lead in the bottom of the first inning and led 10-2 by the end of the 4th inning. Obviously, there weren't a lot of "clutch" situations in this game. It was over early. Do you know why it was over early? Because Alex Rodriguez hit a 2-out, 3-run home run in the bottom of the first inning to give the Yankees that 3-0 lead, he hit a 2-out, 2-run home run in the bottom of the third inning to extend the Yankees' lead to 5-2, and he capped it off with a 2-out grand slam in the bottom of the 4th inning to give the Yankees that aforementioned 10-2 lead. For all of that, Baseball-Reference only credits Alex Rodriguez with a WPA of 0.490 for that game.