Baseball Player Won-Loss Records
Home     List of Articles

Baseball Player Wins and Losses

The job of a Major League Baseball player is to help his team win games, for the ultimate purpose of making the playoffs and winning the World Series. Since the early history of Major League Baseball, pitchers have been credited with Wins and Losses as official measures of the effectiveness of their pitching. Of course, Pitcher Wins are a fairly crude measure of how well a pitcher did his job, as wins are the product of the performance of the entire team - batters, baserunners, and fielders, in addition to pitchers.

While the implementation of Pitcher Wins as a measure of pitcher effectiveness is less than ideal, nevertheless the concept is perfectly sound. The ultimate measure of a player's contribution - be he a pitcher, a hitter, a baserunner, or a fielder - is in how much he contributes to his team's wins.

Using play-by-play data compiled from Retrosheet, I have constructed a set of Player won-lost records that attempt to quantify the precise extent to which individual players contribute directly to wins and losses in Major League Baseball on the baseball field.

The purpose of this article is to provide a general overview of Player wins. I compare my Player wins to WAR in a separate article. I give an overview of my website in a third article.

Basic Calculations

The starting point for my construction of Player wins and losses is context-dependent player wins and losses - pWins and pLosses - and the starting point for constructing pWins and pLosses is Win Probabilities. The concept of Win Probability was first developed by Eldon and Harlan Mills in 1969 and published in their book, Player Win Averages.

The basic concept underlying win probability systems is elegantly simple. At any point in time, the situation in a baseball game can be uniquely described by considering the inning, the number and location of any baserunners, the number of outs, and the difference in score between the two teams. Given these four things, one can calculate a probability of each team winning the game. Hence, at the start of a batter’s plate appearance, one can calculate the probability of the batting team winning the game. After the completion of the batter’s plate appearance, one can once again calculate the probability of the batting team winning the game. The difference between these two probabilities, typically called the Win Probability Advancement or something similar, is the value added by the offensive team during that particular plate appearance (where such value could, of course, be negative).

If we assume that the two teams are evenly matched, then the initial probability of winning is 50% for each team. At the end of the game, the probability of one team winning will be 100%, while the probability of the other team winning will be 0%. The sum of the Win Probability advancements for a particular team will add up to exactly 50% for a winning team (100% minus 50%) and exactly -50% for a losing team (0% minus 50%). Hence, Win Probability Advancement is a perfect accounting structure for allocating credit for team wins and losses to individual players.

Changes in Win Probabilities are credited to the individual players responsible for these changes. These contributions are called Player Game Points here. Positive changes in Win Probabilities are credited as Positive Player Game Points, while negative changes in Win Probabilities are credited as Negative Player Game Points.

Player Game Points are assigned to both offensive and defensive players on each individual play. Anything which increases the probability of the offensive team winning is credited as Positive Points to the offensive player(s) involved and as Negative Points to the defensive player(s) involved. Anything which increases the probability of the defensive team winning is credited as Positive Points to the defensive player(s) involved and as Negative Points to the offensive player(s) involved. Within any individual game, the number of Positive Player Game Points by offensive players on one team will be exactly equal to the number of Negative Player Game Points by defensive players on the other team and vice versa. Similarly, the number of Positive Player Game Points collected by members of the winning team will exactly equal the number of Negative Player Game Points accumulated by the losing team (and, again, vice versa).

Player Game Points assigned in this way provide a perfect accounting structure for assigning 100% of the credit for all changes in Win Probability to players on both teams involved in a game.

I convert these Player Game Points into Context-Dependent Player Wins and Losses, which I call pWins and pLosses. I simultaneously construct Context-Neutral Player Wins and Losses, called eWins and eLosses as well, which can be compared to Context-Dependent Player Wins and Losses to identify the contextual factors affecting players’ performances and how those contextual factors affect the translation of Player Wins and Losses into team wins and losses.

For both Context-Dependent and Context-Neutral Player Games, two adjustments are made to these results to move from initial Player Game Points to Player Won-Lost records.

1. Normalizing Component Won-Lost Records to 0.500
A key implicit assumption underlying my Player Won-Loss Records is that Major League Baseball players will have a combined winning percentage of 0.500. While this is trivially true at the aggregate level, almost regardless of what you do, it should also be true at finer levels of detail as well.

So, for example, if Player Won-Loss records are calculated correctly, the total number of wins accumulated by baserunners on third base for advancing on wild pitches and passed balls should be exactly equal to the total number of losses accumulated by baserunners on third base for failing to advance on wild pitches or passed balls. Likewise, the total number of wins accumulated by second basemen for turning double plays on ground balls in double-play situations should be exactly equal to the total number of losses accumulated by second basemen for failing to turn double plays on ground balls in double-play situations.

To ensure this symmetry, therefore, I normalize Player Game Points to ensure that the total number of Positive Player Game Points is exactly equal to the number of Negative Player Game Points for every Component of Player Game Points as well as by sub-component, at the finest level of detail which makes logical sense in each case.

2. Normalizing Player Game Points by Game
The total number of Player Game Points accumulated in an average Major League Baseball game is around 3.3 per team. This number varies tremendously game-to-game, however, with some teams earning 2 wins in some team victories while some other teams may earn 6 wins in team losses. At the end of the day (or season), however, all wins are equal. Hence, in my work, I have chosen to assign each team one Player Win and one Player Loss for each team game. In addition, the winning team earns a second full Win, while the losing team earns a second full Loss. Ties are allocated as 1.5 Wins and 1.5 Losses for both teams. Context-neutral player decisions (eWins and eLosses) are also normalized to average three Player decisions per game. For eWins and eLosses, this normalization is done at the season level, rather than the game level, so that different numbers of context-neutral player decisions will be earned in different games.

Why 3 Player Decisions per Game?
The choice of three Player Decisions per game here is largely arbitrary. I chose three because the resulting Player Won-Lost records end up being on a similar scale to traditional pitcher won-lost records, with which most baseball fans are quite familiar. For example, expressed in this way, Bryce Harper led the major leagues in 2015 with 25.7 (Context-Dependent) Player Wins, while Jay Bruce led the majors with 21.1 losses.

In comparison, Jake Arrieta led all major league pitchers in 2015 with 22 wins (Arrieta amassed 17.7 Player Wins) while Shelby Miller (14.0 Player losses) led the major leagues with 17 losses. Over the entire Retrosheet Era, the most pWins accumulated by a player in a single season was 30.6 by Babe Ruth in 1927 (against 15.1 pLosses). The most single-season pLosses were accumulated by Julio Franco in 1985 with 23.6 pLosses (and 19.4 pWins).

Why Do Players Get Wins in Games Their Team Loses?
If one is interested in assigning credit to players for team wins or blame to players for team losses, one might think that it would make sense to only credit a player with Player wins in games which his team won and only credit Player losses in games which his team lost. I have chosen instead to give players some wins even in team losses and some losses even in team wins. I do this for a couple of reasons.

Most simply put, baseball players do tons of positive things in team losses and baseball players do tons of negative things in team wins. Throwing away all of those things based solely on the final score of the game leads, in my opinion, to too much valuable data simply being lost. It makes the results too dependent on context.

As I noted above, in the average major-league baseball game of the Retrosheet Era (1921 - 2018), the average team amasses 3.3 Player Game Points. The win probability for the winning team goes from 50% at the start of the game to 100% at the end, so that the winning team will amass exactly 0.5 more positive Player Game Points than negative Player Game Points by construction. This means that the players on an average winning team will amass a combined record of something like 1.9 - 1.4 in a typical game. That works out to a 0.576 winning percentage, or about 93 wins in a 162-game schedule (93-69). Put another way, more than 40% of all Player Game Points (1 - 0.576) would be zeroed out in a system that credited no Player wins in team losses (or Player losses in team wins). That's simply too much for me to be comfortable making such an adjustment.

There are two reasons why such a large percentage of plays do not contribute to victory. First, it is indicative, I think, of the fairly high level of competitive balance within Major-League Baseball. Put simply, even very bad Major-League Baseball teams are not that much worse than very good Major-League Baseball teams.

But the other reason why such a large percentage of plays do not contribute to victory, and why I assign player wins even in team losses and vice-versa, is because of the rules of baseball. Because there is no clock in baseball, the only way for a game to end (in a league with no slaughter rule) is for the winning team to do some things that reduce its chances of winning: it has to make 3 outs per inning for at least 8 innings (not counting rain-shortened games). Likewise, a losing team is guaranteed to do some things that increase its chance of winning: it must get the other team out 3 times per inning.

My system still rewards players who do positive things that contribute to wins more favorably than players who do positive things that lead to losses. As I noted above, an average team will amass a player winning percentage of approximately 0.576 in team wins (and 0.424 in team losses). By assigning 2 wins and only 1 loss in team wins, however, players will amass a 0.667 player winning percentage in team wins (and 0.333 in team losses). So, player wins that lead to team wins will still be more valuable than player wins that happen in team losses. The latter are simply not worthless.

Relationship of Player Decisions to Team Decisions
Under my system, to move from players’ team-dependent won-lost records (pWins and pLosses) to a team won-lost record, one can subtract out what I call “background wins” and “background losses.” One-third of a player’s decisions are background wins and one-third of a player’s decisions are background losses. Mathematically, then, if the sum of the team-dependent won-lost records of the players on a team is W wins and L losses, then the team’s won-lost record will be as follows:

Team Wins = W – (W + L) / 3;     Team Losses = L – (W + L) / 3

As some practical examples, a team of .500 players will be a .500 team (of course), but, for example, a team of .510 players (e.g., 248-238) will be a .530 team (86-76 in a 162-game season), and a team of .550 players (e.g., 267-219) will be a .650 team (105-57). At the other extreme, a team of .400 players (e.g., 194-292) will be a .200 team (32-130).

There are two implications to this relationship between player wins and team wins. First, the range of winning percentages for players is narrower than the range of team winning percentages. This is important in evaluating the concept of replacement level. In my work, team-level replacement level is a winning percentage between 0.340 and 0.350. But, player-level replacement level is closer to 0.450.

The second implication is that player wins and losses do not have a purely additive effect on team wins and losses; instead, the effect is somewhat more multiplicative. In an average game, the players on the winning team will amass a (context-neutral) winning percentage of approximately 0.576 - not all that much above 0.500. Having players who are a little bit better than average will translate into a team that is a lot better than average. In fact, a team of 0.576 players would win well over 100 games in a 162-game season. The reverse is true of below-average players. A team of slightly below-average players will lose far more often than they win. For example, the players on the 2011 Houston Astros amassed a pWinning percentage of 0.449. In fact, that number has already been adjusted to reflect the Astros' team record of 56-106, and hence understates the raw context-neutral performance of the Astros' players. In terms of raw context-neutral numbers, with no adjustments, the combined performance of the players on the 2011 Houston Astros was a player winning percentage of 0.481. In other words, in this case, a team of 0.481 players became a 0.346 team.

Players' final won-lost records will be pushed away from 0.500 depending on exactly how their performance translates into team wins and losses. So, the final Player records of the Astros' players fell from 0.481 to 0.449 because the players' losses contributed more to losses than the players' wins were able to contribute to team victories. By tying to team wins and losses, pWins and pLosses for a player will be dependent on the context in which they take place. Part of that context is the quality of a player's teammates.

But even beyond the actual context of pWins and pLosses, this tendency of player records to push away from 0.500 also affects eWins and eLosses for a player as well. We can expect players with context-neutral won-lost records over 0.500 to have their record translate into (slightly) more wins than might be implied by their raw record, and players with context-neutral won-lost records below 0.500 to have their record translate into (slightly) more losses than their raw record.

This (expected) effect is stronger the more concentrated a player's record is within a game. Because of this, this "expected team win adjustment" is stronger for pitchers, especially starting pitchers, who concentrate their performance more heavily in the games they play. Because of this, pitching accounts for 31.2% of total player decisions, but pitchers earn 43.1% of pWins over replacement level.

Basic Results: pWins
Player wins are calculated such that the players on a team earn 3 Player decisions per game. I calculate two sets of Player wins: pWins are tied to team wins - the players on a winning team earn 2 pWins and 1 pLoss, the players on a losing team earn 1 pWin and 2 pLosses - while eWins attempt to control for the context in which they were earned, as well as controlling for the abilities of a player's teammates.

Player wins end up being on a similar scale to traditional pitcher wins: 20 wins is a good season total, 300 wins is an excellent career total.

There are a total of 76 major-league players who have accumulated 300 or more pWins over games for which Retrosheet has released play-by-play data (1921 - 2018). They are shown below.

300 pGame Winners of the Retrosheet Era*
Player pWins pLosses pWin Pct. pWOPA pWORL
Hank Aaron491.8372.10.56942.277.9
Barry Bonds462.0314.90.59561.091.4
Willie Mays460.1334.70.57949.882.0
Pete Rose Sr.437.3394.60.52611.344.2
Rickey Henderson427.2353.60.54728.359.1
Carl Yastrzemski426.4363.10.54017.850.7
Stan Musial420.8311.50.57540.169.6
Frank Robinson394.8304.00.56530.459.8
Dave Winfield392.8343.00.53414.644.4
Cal Ripken383.5350.70.52227.555.9
Al Kaline377.4301.90.55624.652.7
Mel Ott375.2272.50.57939.266.2
Alex Rodriguez374.0297.20.55742.169.1
Joe L. Morgan373.8292.10.56146.472.8
Derek Jeter369.5322.40.53432.959.6
Ted Williams368.9253.10.59344.469.7
Robin Yount367.5345.70.51518.145.7
Reggie Jackson366.5297.70.55225.554.0
Craig Biggio358.5325.40.52417.944.7
Adrian Beltre357.5317.90.52917.443.5
Nolan Ryan356.8328.00.52119.750.7
Brooks Robinson354.6310.50.53317.043.7
Warren Spahn353.9293.40.54740.868.5
Roberto Clemente352.1298.50.54113.539.8
Mickey Mantle351.7230.40.60450.373.9
Andre Dawson347.0313.40.5257.733.7
Albert Pujols346.2255.30.57633.958.9
Eddie Murray345.4284.90.54818.644.9
Lou Brock343.4333.00.508-8.618.6
Gary Sheffield341.8289.70.54117.743.6
Ken Griffey Jr.339.2300.40.53016.441.9
Steve Carlton337.9303.90.52627.654.5
Mike Schmidt337.5257.50.56734.056.8
Chipper Jones336.7258.50.56633.857.0
Ozzie Smith335.8310.40.52021.245.9
Luis Aparicio332.5326.50.50412.639.2
Phil Niekro331.8320.90.50814.342.8
Billy Williams329.4281.30.53911.136.6
Carlos Beltran328.5292.50.52914.138.8
Greg Maddux328.5271.30.54839.968.5
George Brett328.1274.90.54422.447.3
Dwight Evans326.4274.50.54319.443.4
Omar Vizquel322.4334.00.4912.828.2
Vada Pinson322.4300.80.5170.826.0
Rusty Staub322.0307.10.512-2.624.7
Eddie Mathews320.9240.80.57134.357.0
Don Sutton320.8295.30.52120.547.2
Paul Waner318.9273.40.53811.035.8
Manny Ramirez318.3250.50.56026.650.1
Roger Clemens318.1228.10.58246.374.6
Paul Molitor318.0272.70.53819.747.4
Tony Gwynn Sr.317.9287.80.5254.427.8
Luis 'Gonzo' Gonzalez317.7293.50.5202.626.8
Gaylord Perry317.7289.20.52320.347.6
Roberto Alomar314.5280.90.52820.644.0
Sammy Sosa313.8281.10.5276.430.4
Tim Raines Sr.313.5276.20.53210.834.1
Dave Parker312.3275.80.5318.733.0
Ernie Banks310.4279.10.52711.935.8
Ichiro Suzuki309.5303.80.505-2.620.9
Early Wynn309.2284.10.52121.947.2
Steve Finley309.1291.90.5144.528.1
Graig Nettles309.0270.40.53316.239.1
Tom Seaver308.9256.70.54634.458.5
Bobby Abreu307.6266.30.53612.234.8
Rafael Palmeiro307.5267.00.5358.733.0
Tony Perez307.0256.40.54514.837.7
Lou Whitaker306.1260.80.54028.250.0
Babe Ruth306.0177.80.63254.474.2
Willie Davis305.0278.60.5234.728.2
Enos Slaughter303.9253.60.54515.938.8
Torii Hunter303.4292.40.5093.326.3
Luke Appling303.1281.10.51914.538.4
Darrell Evans302.6259.10.53915.037.6
Jeff Kent302.5264.80.53319.341.5
Pee Wee Reese301.5239.10.55835.357.2

*Play-by-play data are missing for many games prior to 1937.

Accumulating 300 pWins is certainly an accomplishment. But it's fairly clear looking at the above list that the list of the top players in pWins is not necessarily a list of the best players, period. Just to pick out two examples: Omar Vizquel actually has a career winning percentage under 0.500 and Rusty Staub was (slightly) below average over the course of his career.

Don't get me wrong: Omar Vizquel and Rusty Staub both had fine, noteworthy major-league careers. But did they have better careers than, say, 5-time Cy Young winner Randy Johnson, who "only" amassed 281.2 pWins in his illustrious career?
Comparing Players across Positions: Wins over Positional Average (WOPA)
Player won-lost records are an excellent overall measure of player value. When context and the effects of teammates are controlled for, Player won-lost records can also, in my opinion, serve as an excellent starting point for measuring player talent. As a means of comparing players who play different positions, however, raw Player won-lost records are not necessarily an ideal comparative tool.

In constructing Player wins and losses, all events are measured against expected, or average, results across the event. Because of this, fielding Player Won-Lost records are constructed such that aggregate winning percentages are 0.500 for all fielding positions. Hence, one can say that a shortstop with a defensive winning percentage of 0.475 was a below-average defensive shortstop and a first baseman with a defensive winning percentage of 0.510 was an above-average defensive first baseman, but there is no basis for determining which of these two players was a better fielder – the below-average fielder at the more difficult position or the above-average fielder at the easier position.

From an offensive perspective, batting Player Won-Lost records are constructed by comparing across all batters, not simply batters who share the same fielding position. In the National League, this means that offensive comparisons include pitcher hitting, so that, on average, non-pitcher hitters will be slightly above average in the National League, while, of course, because of the DH rule, the average non-pitcher hitter will define the average in the American League.

In order to compare players across positions, it is therefore necessary to normalize players' records relative to an average player at the position(s) a player played. Doing this, we can see, in the above table, that while Rusty Staub amassed a better career winning percentage (0.512) than Omar Vizquel (0.491), Vizquel was above-average for the position(s) he played (mostly shortstop), with 2.8 wins above "positional average", while Staub was below-average for the position(s) he played (mostly RF), with -2.6 wins "above" positional average. Positional averages for every season for which I calculate Player won-lost records can be found here.

The top 50 players in career pWOPA over the Retrosheet Era (1921 - 2018) are shown in the table below.

Top 50 Players in Wins over Positional Average*
Player pWins pLosses pWin Pct. pWOPA pWORL
1Barry Bonds462.0314.90.59561.091.4
2Babe Ruth306.0177.80.63254.474.2
3Mickey Mantle351.7230.40.60450.373.9
4Willie Mays460.1334.70.57949.882.0
5Joe L. Morgan373.8292.10.56146.472.8
6Roger Clemens318.1228.10.58246.374.6
7Lefty Grove254.8176.50.59144.763.9
8Ted Williams368.9253.10.59344.469.7
9Hank Aaron491.8372.10.56942.277.9
10Alex Rodriguez374.0297.20.55742.169.1
11Warren Spahn353.9293.40.54740.868.5
12Lou Gehrig265.8160.10.62440.358.0
13Stan Musial420.8311.50.57540.169.6
14Greg Maddux328.5271.30.54839.968.5
15Mel Ott375.2272.50.57939.266.2
16Joe DiMaggio281.9194.30.59238.357.6
17Pee Wee Reese301.5239.10.55835.357.2
18Tom Seaver308.9256.70.54634.458.5
19Jimmie Foxx289.1195.10.59734.454.8
20Eddie Mathews320.9240.80.57134.357.0
21Randy 'Big Unit' Johnson281.2220.90.56034.259.9
22Mike Schmidt337.5257.50.56734.056.8
23Albert Pujols346.2255.30.57633.958.9
24Chipper Jones336.7258.50.56633.857.0
25Mariano Rivera126.660.80.67632.945.6
26Derek Jeter369.5322.40.53432.959.6
27Yogi Berra245.3182.30.57432.650.1
28Bob Gibson263.9217.60.54831.752.5
29Jim Palmer242.8186.00.56631.250.7
30Pedro J. Martinez193.6137.40.58531.148.5
31Frank Robinson394.8304.00.56530.459.8
32Juan Marichal232.1187.10.55429.747.8
33Whitey Ford210.3164.30.56129.045.3
34Rickey Henderson427.2353.60.54728.359.1
35Lou Whitaker306.1260.80.54028.250.0
36Steve Carlton337.9303.90.52627.654.5
37Cal Ripken383.5350.70.52227.555.9
38Barry Larkin292.1249.40.53927.048.2
39John Smoltz239.7201.10.54426.648.5
40Manny Ramirez318.3250.50.56026.650.1
41Bobby Doerr263.7215.60.55026.345.5
42Duke Snider268.7201.00.57226.145.4
43Jackie Robinson195.9140.10.58326.139.7
44Bob Feller257.0217.60.54226.147.2
45Mike Mussina224.2173.10.56425.947.5
46David Ortiz250.9188.30.57125.551.5
47Reggie Jackson366.5297.70.55225.554.0
48Clayton Kershaw144.7104.60.58025.037.0
49Tom Glavine279.4249.50.52824.850.1
50Bill Dickey170.3127.10.57324.637.1

*Play-by-play data are missing for many games prior to 1937.

Focusing on players' wins above average helps to highlight players who had relatively short but brilliant careers, players like Pedro Martinez, whose 193.6 career pWins rank a fairly low 445th in the Retrosheet Era, while his 31.1 pWOPA rank a much more impressive 30th, or Mariano Rivera, whose 126.6 pWins rank even lower than Pedro's (1154th) but who ranks 25th in career pWOPA with 32.9.

Wins over Replacement Level: WORL
Replacement Level is the level of performance which a team should be able to get from a player who they can find easily on short notice – such as a minor-league call-up or a veteran waiver-wire pickup. The theory here is that major league baseball players only have value to a team above and beyond what the team could get from basically pulling players off the street. That is, there’s no real marginal value to having a third baseman make routine plays that anybody who’s capable of playing third base at the high school or college level could make, since if a major-league team were to lose its starting third baseman, they would fill the position with somebody and that somebody would, in fact, make at least those routine plays at third base. This is similar to the economic concept of Opportunity Cost.

For my work, I define Replacement Level as equal to a winning percentage one weighted standard deviation below Positional Average, with separate standard deviations calculated for pitchers and non-pitchers. Unique standard deviations are calculated in this way for each year. These standard deviations are then applied to the unique Positional Averages of each individual player. Overall, this works out to an average Replacement Level of about 0.456 (0.465 for non-pitchers, and 0.440 for pitchers). A team of 0.456 players would have an expected winning percentage of 0.368 (60 - 102 over a 162-game season).

The top 50 players in career pWORL over the Retrosheet Era (1921 - 2018) are shown in the table below.

Top 50 Players in Wins over Replacement Level*
Player pWins pLosses pWin Pct. pWOPA pWORL
1Barry Bonds462.0314.90.59561.091.4
2Willie Mays460.1334.70.57949.882.0
3Hank Aaron491.8372.10.56942.277.9
4Roger Clemens318.1228.10.58246.374.6
5Babe Ruth306.0177.80.63254.474.2
6Mickey Mantle351.7230.40.60450.373.9
7Joe L. Morgan373.8292.10.56146.472.8
8Ted Williams368.9253.10.59344.469.7
9Stan Musial420.8311.50.57540.169.6
10Alex Rodriguez374.0297.20.55742.169.1
11Warren Spahn353.9293.40.54740.868.5
12Greg Maddux328.5271.30.54839.968.5
13Mel Ott375.2272.50.57939.266.2
14Lefty Grove254.8176.50.59144.763.9
15Randy 'Big Unit' Johnson281.2220.90.56034.259.9
16Frank Robinson394.8304.00.56530.459.8
17Derek Jeter369.5322.40.53432.959.6
18Rickey Henderson427.2353.60.54728.359.1
19Albert Pujols346.2255.30.57633.958.9
20Tom Seaver308.9256.70.54634.458.5
21Lou Gehrig265.8160.10.62440.358.0
22Joe DiMaggio281.9194.30.59238.357.6
23Pee Wee Reese301.5239.10.55835.357.2
24Chipper Jones336.7258.50.56633.857.0
25Eddie Mathews320.9240.80.57134.357.0
26Mike Schmidt337.5257.50.56734.056.8
27Cal Ripken383.5350.70.52227.555.9
28Jimmie Foxx289.1195.10.59734.454.8
29Steve Carlton337.9303.90.52627.654.5
30Reggie Jackson366.5297.70.55225.554.0
31Al Kaline377.4301.90.55624.652.7
32Bob Gibson263.9217.60.54831.752.5
33David Ortiz250.9188.30.57125.551.5
34Carl Yastrzemski426.4363.10.54017.850.7
35Nolan Ryan356.8328.00.52119.750.7
36Jim Palmer242.8186.00.56631.250.7
37Tom Glavine279.4249.50.52824.850.1
38Yogi Berra245.3182.30.57432.650.1
39Manny Ramirez318.3250.50.56026.650.1
40Lou Whitaker306.1260.80.54028.250.0
41John Smoltz239.7201.10.54426.648.5
42Pedro J. Martinez193.6137.40.58531.148.5
43Barry Larkin292.1249.40.53927.048.2
44Juan Marichal232.1187.10.55429.747.8
45Gaylord Perry317.7289.20.52320.347.6
46Mike Mussina224.2173.10.56425.947.5
47Paul Molitor318.0272.70.53819.747.4
48George Brett328.1274.90.54422.447.3
49Fergie Jenkins286.7250.80.53323.647.3
50Don Sutton320.8295.30.52120.547.2

*Play-by-play data are missing for many games prior to 1937.

Measuring against replacement level instead of average helps to weed out pure compilers such as Rusty Staub while showing a mix of short excellent careers (e.g., Pedro Martinez) together with long, more modestly above-average careers, such as Don Sutton.

Contextual Factors
As explained above, I calculate two measures of Player won-lost records: [pWins & pLosses] and [eWins & eLosses]. Comparing the results for these two sets of Player records, it is possible to isolate and identify the specific contextual factors that affect how player performance translates into team wins.

My pWins and pLosses are tied to team wins: the players on a team earn a total of 2 pWins and 1 pLoss in every team win, and 1 pWin and 2 pLosses in every team loss. These records are highly contextual. That is, hitting a grand slam with two outs in the bottom of the ninth inning with your team trailing by three runs will earn more pWins than hitting a solo home run leading off the top of the 8th inning with your team trailing 13-1. Positive events that contribute to wins are more valuable than positive events that end up going for naught in team losses. I believe that a good case can be made that pWins and pLosses do the best possible job of truly capturing player value - which is an inevitable function of the context in which it occurs. Nevertheless, calculating Player wins and losses in this way leads to player value being due, at least in part, to factors outside of a player's control: the quality of his teammates, the timing of his performance.

Because of this, I also calculate a set of Player won-lost records which attempt to control for the quality of a player's teammates and the context in which he performed. I call these expected Player won-lost records, or eWins and eLosses.

Most sabermetric measures - e.g., Linear Weights, bWAR, fWAR, WARP, et al. - are designed to be context-neutral, and are therefore most comparable to my eWins and eLosses. Bill James's Win Shares do tie to team wins, but the linkage of team wins to player Win Shares is done via an across-the-board adjustment based on end-of-season data, rather than linking to team wins on a game-by-game basis, like my pWins and pLosses. Context does come into play for some subsets of players for some statistics. For example, both Baseball-Reference and Fangraphs incorporate leverage into their WAR statistics for relief pitchers.

There are two ways in which [pWins & pLosses] might differ from [eWins & eLosses], which I call "context" and "win adjustments".

Context refers to the importance of a specific play in terms of determining team victories relative to a play of average importance. Differences in context will affect the total number of player decisions, so that, for example, a player who performed in an above-average context (>1) will earn more context-dependent player decisions (pWins + pLosses) than context-neutral player decisions (eWins + eLosses).

Win Adjustments measure differences in a player's player winning percentage across different situations, i.e., the increase in a team's probability of victory relative to the average increase in win probability associated with a particular event. So, for example, a player who hits better in the clutch than at other times may have a higher winning percentage when measured using pWins and pLosses than based on eWins and eLosses. The player's "win adjustment" would be the difference between these two winning percentages.

Context and Win Adjustments can both differ across two dimensions: inter-game or intra-game.

Inter-game refers to differences in the relative importance of situations within a single game.

Intra-game refers to differences in the relative importance of situations across different games.

Here are a few specific examples of players whose pWins differ from their eWins and why.

Trevor Hoffman, for example, earned 9.2 more pWins than eWins in his career (10%) mostly because the actual context in which he performed was 0% greater than his expected context.

Derek Jeter earned 13.4 more pWins than eWins in his career largely because he had the good fortune to play with above-average teammates, so that Jeter's positive contributions have been more likely to contribute to victories than would have been expected. This is reflected in what I call Jeter's intra-game win adjustment, which has raised his pWinning percentage by 0.019 (to 0.534) over Jeter's expected winning percentage (accounting for 13.4 additional wins).

David Ortiz earned 1.4 more pWins than eWins in 2005 mostly because of the timing of his hits (i.e., Big Papi was a great clutch hitter that year). The timing of Ortiz's hits, which I call his inter-game win adjustment, increased his winning percentage by 0.041 that year, adding 1.3 wins to Ortiz's record.
The choice between pWins and eWins will likely depend on one's purposes in putting together a list. One could think of pWins as measuring what actually happened, while eWins perhaps measure what should have happened. Personally, I think both of these measures provide us with useful and interesting information.

Components of Player Wins and Losses
Player Wins and Losses are calculated using a nine-step process, each step of which assumes average performance in all subsequent steps. Each step of the process is associated with a Component of Player Wins and Losses (Player Decisions). These nine components are outlined briefly below. There are four basic positions from which a player can contribute toward his baseball team’s probability of winning: Batter, Baserunner, Pitcher, and Fielder. Player Decisions are allocated to each of these four positions, as appropriate, within each of the following nine Components.

Component 1: Basestealing

Player Decisions are assessed to baserunners, pitchers, and catchers for stolen bases, caught stealing, pickoffs, and balks.

Component 2: Wild Pitches and Passed Balls

Player Decisions are assessed to baserunners, pitchers, and catchers for wild pitches and passed balls.

Component 3: Balls not in Play

Player Decisions are assessed to batters and pitchers for plate appearances that do not involve the batter putting the ball in play: i.e., strikeouts, walks, and hit-by-pitches.

Component 4: Balls in Play

Player Decisions are assessed to batters and pitchers on balls that are put in play, including home runs, based on how and where the ball is hit.

Component 5: Hits versus Outs on Balls in Play

Player Decisions are assessed to batters, pitchers, and fielders on balls in play, based on whether they are converted into outs or not.

Component 6: Singles versus Doubles versus Triples

Player Decisions are assessed to batters, pitchers, and fielders on hits in play, on the basis of whether the hit becomes a single, a double, or a triple.

Component 7: Double Plays

Player Decisions are assessed to batters, baserunners, pitchers, and fielders on ground-ball outs in double-play situations, based on whether or not the batter grounds into a double play.

Component 8: Baserunner Outs

Player Decisions are assessed to batters, baserunners, and fielders on the basis of baserunner outs.

Component 9: Baserunner Advancements

Player Decisions are assessed to batters, baserunners, and fielders on the basis of how many bases, if any, baserunners advance on balls in play.
For components where Player decisions are shared across multiple players (e.g., pitchers and fielders in Component 5), I divide credit between players based on the extent to which player winning percentages within the particular component persist over time.

The distribution of Player Wins and Losses by Component varies across seasons and across leagues, depending on the exact distribution of plays. The average distribution of Player decisions by Component across all seasons for which I have calculated Player won-lost records is as follows.

Breakdowns of Player Game Points by Component: 1921 - 2018
Distribution of Player Decisions

Percent of Offensive/Defensive Component Decisions Allocated to Player Decisions
Percent of Total Batters Baserunners Pitchers Fielders
Component 1: Stolen Bases, etc.2.2%0.0%100.0%51.3%48.7%
Component 2: Wild Pitches, Passed Balls1.3%0.0%100.0%73.2%26.8%
Component 3: Balls Not in Play14.7%100.0%0.0%100.0%0.0%
Component 4: Balls in Play34.8%100.0%0.0%100.0%0.0%
Component 5: Hit vs. Out33.0%100.0%0.0%29.1%70.9%
Component 6: Single v. Double v. Triple3.5%100.0%0.0%27.1%72.9%
Component 7: Double Plays2.2%86.8%13.2%16.2%83.8%
Component 8: Baserunner Outs2.3%40.5%59.5%0.0%100.0%
Component 9: Baserunner Advancements6.0%46.8%53.2%0.0%100.0%
Total Offensive/Defensive Decisions91.7%8.3%62.5%37.5%
Total Player Decisions45.8%4.2%31.2%18.8%

Value vs. Talent

Sabermetricians often distinguish between two measures of player performance: value and true talent. My basic Player won-lost records, pWins and pLosses, are purely the former, a value measure. Unfortunately, as anybody who has ever read an MVP debate knows, the word "value" can have different definitions to different people.

The first link above defines value thusly: "A player's value is his contributions to his team based upon his on-field performance (hitting, running, fielding and pitching) in a neutral context." I would define value slightly differently. My definition of value would be this: A player's value is his contributions to his team's on-field success. Player value is a retrospective evaluation, which quantifies what happened in the past.

True talent, on the other hand, is defined in the latter link above as the "probabilistic expectations of a player’s output at a given point in time, given that we know everything there is to know about that player." In other words, "true talent" is a prospective measure of expected performance, which predicts what will happen.

As I said, Player won-lost records are a measure of player value, by which I mean a player's (on-field) contributions to his team's on-field performance, measured in wins and losses. Value, defined in this way, is highly dependent on context. Several key types of context which affect player value include the following.

1.    Run-Scoring Environment
Runs are more valuable in a lower run-scoring environment. Scoring one run is more likely to lead to winning in an environment where 1-0 victories are fairly common than in an environment where the average final score is 8-6. This is why Player won-lost records control for the run-scoring environment, both for the season and league in which the game took place as well as for the ballpark in which the game was played.

2.    Timing of Events
The timing of events within a game can affect the value of those events. Hits which drive in runners on base can be viewed as more valuable than hits with the bases empty which do not produce runs. Home runs are more valuable in tie games than when the score is 15-0 (in either direction).

3.    Retrospective Context
The value of a win is greater than the value of a loss. Retrospectively, one can argue that this means that the value of events are greater if they contribute to a win than if they contribute to a loss.

I have to concede at this point that value is ultimately subjective and, hence, my Player won-lost records are ultimately subjective. The main point of subjectivity is the value of a win versus the value of a loss. I value team wins at 2 pWins and 1 pLoss, and I value team losses at a pWin-pLoss record of 1-2. I explained and attempted to defend that choice above.

There is also some inherent subjectivity in the assignment of value to specific players. I have attempted to make these assignments as objectively as possible. Again, my choices in this respect are explained briefly above, and in somewhat more detail elsewhere. Note, however, that given the overall value of team wins and team losses, the total value for a team is fixed, which means that, to the extent one assigns too much value to one player on a team it must be at the expense of assigning too little value to one of his teammates.

Player won-lost records, as I calculate them, represent a complete accounting of all value accumulated within a major-league baseball game. Note that this means that "luck" has to be accounted for somewhere, regardless of whether we think the accumulation of that "luck" was the result of any skill, whether any such skill "persists", or whether there is any predictive ability associated with such events.

Value versus True Talent
So what is the difference between "value" and "true talent"? The key difference, as I see it, is that "value" can be directly observed, while "true talent" can only be inferred. Going one step farther, "true talent" can only be inferred from value. Hence, to my mind, measuring value is a necessary first step to being able to assess true talent.

Unfortunately, I think that too often there is confusion between value and true talent, where "true talent" measures make their way into what are intended to be "value" measures. For example, in his Win Shares system, Bill James increases the fielding Win Shares for third basemen if they played for a team with a below-average number of innings pitched by left-handed pitchers. The rationale for this is that left-handed pitchers allow more balls hit toward the third baseman (because LHP face more RHB).

I assume that this is true, but, even if it is true, that would simply mean that third basemen are less valuable with right-handed pitchers on the mound than with lefties pitching. This is also a good example of why a single-number value system can be misleading, although Bill James has corrected for this by adding Loss Shares to his Win Shares system.

Another example of a "value" system that slips in some "true talent" into its calculations is Fangraphs' calculation of WAR (Wins above Replacement). For pitchers, Fangraphs calculates WAR based on FIP (Fielding Independent Pitching). Rather than considering the actual number of runs allowed by a pitcher, FIP calculates how many runs a pitcher would be expected to allow given his walks, strikeouts, and home runs allowed. As such, FIP doesn't explain what did happen, it explains what would be expected to have happened.

Now, there's an argument to be made for using FIP and it's baked right into the name: it controls for the fielders behind the pitcher. The fielders are then valued based on their fielding (using UZR). The problem is that UZR controls for the hardness of the balls-in-play, for the hit types, for the handedness of the pitcher and hitter, etc. In other words, for a bunch of things that are NOT captured in FIP. Which leaves those things uncaptured at all. So we're left with WAR measuring what we would have expected players to be worth, not what they really were worth.

So What's the Point of Context-Neutral Wins and Losses (eWins, eLosses)?
So, if context is a necessary condition of measuring player value, then what is the point of the context-neutral wins and losses that I calculate, eWins and eLosses? By constructing wins and losses that are stripped of context, it becomes possible to distinguish the value of what players do (eWins, eLosses) from when players do these things via the Contextual Factors that relate eWins and eLosses to pWins and pLosses.

In this way, value can be divided into its myriad sub-components, not simply batting versus baserunning versus pitching versus fielding, or basestealing versus baserunner outs versus baserunner advancement, but also inter-game context versus inter-game win adjustments versus the impact of one's teammates on one's fielding, etc. In this way, I believe that Player won-lost records can serve as something of the Platonic ideal of baseball statistics, with everything expressed in the same units - wins and losses - and with everything accounted for in a way which ties back perfectly to what actually happened on the baseball field.

I hope you enjoy reading about and playing with Player won-lost records as much as I enjoyed creating them. To help you get the most out of my Player won-lost records, a map of my website can be found here. Thank you for visiting the site and please come back often!

All articles are written so that they pull data directly from the most recent version of the Player won-lost database. Hence, any numbers cited within these articles should automatically incorporate the most recent update to Player won-lost records. In some cases, however, the accompanying text may have been written based on previous versions of Player won-lost records. I apologize if this results in non-sensical text in any cases.

Home     List of Articles