The Z Factor - Numberphile

2024 ж. 31 Қаң.
114 895 Рет қаралды

Featuring Tom Crawford. Jane Street skyscraper puzzle (and info on the AMP program) at bit.ly/numberphile-amp --- More links & stuff in full description below ↓↓↓
Jane Street’s Academy of Math and Programming is now accepting applications for their summer 2024 program, which will run from June 29th-August 2nd in NYC... Or you can just check out the puzzle for fun too - bit.ly/numberphile-amp
Tom Crawford's website, with links to his work and other outreach: tomrocksmaths.com
More Tom videos on Numberphile: bit.ly/Crawford_Videos
Tom on the Numberphile Podcast: • The Naked Mathematicia...
Patreon: / numberphile
Numberphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: bit.ly/numberphile-janestreet
We're also supported by the Simons Laufer Mathematical Sciences Institute (formerly MSRI): bit.ly/MSRINumberphile
Our thanks also to the Simons Foundation: www.simonsfoundation.org
NUMBERPHILE
Website: www.numberphile.com/
Numberphile on Facebook: / numberphile
Numberphile tweets: / numberphile
Subscribe: bit.ly/Numberphile_Sub
Video by Brady Haran and Pete McPartlan
Numberphile T-Shirts and Merch: teespring.com/stores/numberphile
Brady's videos subreddit: / bradyharan
Brady's latest videos across all channels: www.bradyharanblog.com/
Sign up for (occasional) emails: eepurl.com/YdjL9

Пікірлер
  • I think the best way to interpret the Z-score here is 'how much better than the competition were they in their time'. If you use the Z-score to directly compare two teams, you could argue that someone in an amateur league with a better Z-score is a better team than a professional team.

    @idonno87@idonno873 ай бұрын
    • Correct. We use IQ as an example in my classes because it is *always* M = 100 and SD = 15, because it is “normed” to the current population. What this ignores is what’s called the Flynn Effect, which is that IQ is constantly increasing (or we’re getting better at taking tests). So someone with an average IQ now (100) is smarter than someone with that same average IQ score 50 years ago.

      @plackt@plackt3 ай бұрын
    • Yes there isnt an allowance for the whole league being stronger over time. You can kind of get away with it in this example, but its the 'Babe Ruth vs Modern Baseball or Muhammed Ali vs Modern Boxer' questions that get posed where it falls apart, never mind the 'who is the greatest motorsport driver' questions where the actual demands and rules of the sport change heavily over time.

      @wobaguk@wobaguk3 ай бұрын
    • yeah, the point about somehow getting rid of the comparative/relative nature of the data irritated me pretty much, you cannot turn the information "how many times did they win/lose AGAINST OTHER TEAMS" into a datapoint that gives us information independant of the teams they played against.

      @davejacob5208@davejacob52083 ай бұрын
    • That's what he says in the video.

      @GlacialScion@GlacialScion3 ай бұрын
    • @@wobaguk Yeah for a lot of sports that won't work. For some events like swimming, the sport has changed only a little, but times have improved a lot. Evidently, the average professional swimmer today is faster than the best professional swimmers from 50 years ago. So a Z-score comparison wouldn't be very helpful there.

      @EebstertheGreat@EebstertheGreat3 ай бұрын
  • To answer the question at the end, the lowest z-score to win in the end theat i found was 1900 VFL Melbourne team who won from 6th with 24 points (league average of 28). They had a z-score of -0.33072.

    @seanbucket@seanbucket3 ай бұрын
    • You got the result for a regular system, where it's just 1st place points = 1 place, not like Belgium/Australia?

      @TimmehTRP@TimmehTRP3 ай бұрын
    • 4 points for a win, 2 for a draw, 0 for a loss

      @seanbucket@seanbucket3 ай бұрын
  • 20:50 Yeah, just a 6.5, no big deal. For those who don't know, this has been cited as the most impressive statistics in all of sports.

    @mad_vegan@mad_vegan3 ай бұрын
    • Didn't the guy say he'd never seen one higher than 4? Then we end the video with a 6.5? Heck, the whole video maybe should have been about Bradman.

      @esotericVideos@esotericVideos3 ай бұрын
    • I plot Z-scores every day and all the axes cut off at 4. I've only seen a couple players barely above 4, 6.5 is ridiculous!

      @matthiasgreen4042@matthiasgreen40423 ай бұрын
    • @@esotericVideos He never saw higher than 4 for team games won in a season. Bradman data is not a team winning games

      @Kleyguerth@Kleyguerth3 ай бұрын
    • I sat through the whole video waiting them to get to Bradman.

      @CheeseAlarm@CheeseAlarm3 ай бұрын
    • He was beyond 6 sigmas away from the mean. There is a GE executive somewhere who just orgasmed talking about this.

      @joshuaychung@joshuaychung2 ай бұрын
  • That bradman fact at the end. He was just totally god tier. 20:49

    @djsmeguk@djsmeguk3 ай бұрын
    • A 1-in-25Billion performance! Edit: If there have been 117Billon humans in history, then only 4-5 humans have ever perfected their craft to this level.

      @Lashb1ade@Lashb1ade3 ай бұрын
  • Don Bradman at Z=6.5 is insane!

    @Mark_Williams.@Mark_Williams.3 ай бұрын
  • I spent half the video wondering about Don Bradman’s Z-score. Had a feeling it would be huge. Thanks for including that.

    @kiltysalter2966@kiltysalter29663 ай бұрын
  • If Matt Parker isn't already using this to comedically compare F1 driver Lewis Hamilton's career against NASCAR driver Jimmie Johnson's in order to troll as many people as possible, I bet he is after reading this.

    @cliftonchurch6039@cliftonchurch60393 ай бұрын
  • 7:45 No. It doesn't remove the issue at all, it just ignores it. It remains true that the same Z factor from different sets may mean totally different things as far as which team is better.

    @jursamaj@jursamaj3 ай бұрын
    • Yeah, it sounds like he has it backwards. You can't compare historical teams directly, but Z scoring lets you compare how they played relative to their competition.

      @davidgustavsson4000@davidgustavsson40003 ай бұрын
    • And in the NBA it's virtually impossible because of how dramatically teams can change from one year to another. In other sports you can use an ELO rating idea to compare teams from different times. But the further apart they are in time, the harder it is to draw a sound conclusion.

      @jonpaxman@jonpaxman2 ай бұрын
  • As a football data analyst, I can confirm that z-scores are used everywhere, for instance comparing metrics that are measured differently (i.e. goals scored and possession %). They also allow us to create compound metrics which are more reliable and meaningful. Normalizing data without losing the details of how it's distributed (as you would with percentile ranking) is fantastic.

    @matthiasgreen4042@matthiasgreen40423 ай бұрын
    • Why would you lose that data with a percentile ranking? A z-score can be trivially converted to a fractile and back, after all.

      @therealax6@therealax63 ай бұрын
    • @@therealax6 if your data perfectly follows the normal distribution that is true, but not with real data. For example the top competitor's percentile ranking will always be 100%, no matter how far ahead of the others they are. Z-scores however reflect that gap.

      @matthiasgreen4042@matthiasgreen40423 ай бұрын
    • @@matthiasgreen4042 That makes sense! I always assumed you'd just adjust fractiles for discrete data so that they represent the midpoint of a continuous range (e.g., you have four data points: use fractiles .5/4, 1.5/4, 2.5/4 and 3.5/4 instead of 0/3, 1/3, 2/3, 3/3), but that adjustment is probably too inaccurate to adequately capture the edges of the underlying distribution.

      @therealax6@therealax63 ай бұрын
  • Somewhere, Michael Jordan is taking this video personally...

    @derektrombley589@derektrombley5893 ай бұрын
  • Don Bradman having a Z Factor of 6.5 is insane

    @spudmanwsmb@spudmanwsmb3 ай бұрын
  • I just want to say that I recently have gotten a job tutoring math to fellow college students - something that i would have never even considered before stumbling upon Numberphile! math used to be a subject I lamented, now it's my favorite.

    @RetroMoviesAndGames@RetroMoviesAndGames3 ай бұрын
    • Throughout the years I've come to realise that people don't hate math; they hate math class. Math at general education levels (primary school, secondary school, entry-level courses at university, etc.) is taught in a way that makes it about as uninteresting and uninspiring as a subject can be. Students are taught to pointlessly memorise formulas that they then have to echo back in the correct order without knowing why or what for. Students are almost never exposed to actual problems that let them exercise their problem-solving skills, sadly.

      @therealax6@therealax63 ай бұрын
    • Same here!! I love getting opportunities to show people that math isn't so bad.

      @grahamwilson8843@grahamwilson88433 ай бұрын
    • @@therealax6yes, I was very lucky to have parents who were huge math nerds and multiple teachers who made the subject engaging… and even then, I still had huge math anxiety and took computer science instead of math at university to fulfill the requirement.

      @emilyrln@emilyrln2 ай бұрын
  • I am actually quite unsure if you can use the analogy of z-score to portray greatness. An underlying assumption is that the observations are i.i.d. (independent and indentically distributed) and they are clearly not independent. The total number of wins is equal for each season - if one wins, another one looses. And because you compare how much of an outlier you are relatively to the seasonal distribution, you end up just comparing how much your win share is different from the others - while their win distribution are dependent on eachother - which could lead to some weird constructs of high z-scores occuring more often than they actually should. I think a much better estimate is the combinatorical calculation of how likely that win number (with all the other teams win number also happening) is to occur for a season!

    @elkjaer28@elkjaer283 ай бұрын
    • I completely agree, using the Z-score of win totals across different sports leagues makes no sense. I don't think it's even very useful to compare across eras of a single sports league as many others have said. The wins aren't independent, the league average win rate is absolutely necessarily 41 wins in an 82 game season, and there is a limit on the standard deviation that's related to the number of games too. So comparing the biggest outlier for wins in the NBA vs the Premier league vs the MLB doesn't really work. Much less comparing Z-scores of wins in these leagues to independent statistics. My back of the napkin math says it's mathematically impossible for an NBA team's "win rate" Z-score to reach the Bradman batting score of 6.5 that's quoted. But that's not to discount that Bradman score as an incredible outlier. Use Z-score to show me how far above league average Jordan's midrange efficiency was and compare that to Curry's insane 3 point shooting and I'll listen, because you can use independent data points.

      @bberk1286@bberk12863 ай бұрын
    • Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.

      @B3Band@B3Band3 ай бұрын
    • @@B3Bandbro this is a MATH video

      @trademarked2476@trademarked24763 ай бұрын
  • The Z Factor sounds like a game show where teen contestants face each other in a battle royale-style TikTok dance-off.

    @unvergebeneid@unvergebeneid3 ай бұрын
  • Not comfortable with three aspects of the assumptions. First we take discrete data (not a particularly large dataset) and fit it to a continuous function. It is not possible to win 13.2 games of basketball and there is no possibility of winning 200 games in a season even though the normal distribution says there is a very small but finite probability. Second, the wins could be distributed very differently in different seasons. In one extreme the overall winners could win every game with all other teams winning just above or just below half of their games. In another extreme, the winning team could win by the narrowest of margins (overall points scored and same number of wins as the team that came second. Third, the winning margins etc could be identical but it could be that all teams in the second dataset are ALL either much better or much worse than in the comparison dataset.

    @marksteers3424@marksteers34243 ай бұрын
    • There are discrete distributions that could be used as well.

      @soloban81@soloban813 ай бұрын
    • @@soloban81 Agreed - though not mentioned in this video

      @marksteers3424@marksteers34243 ай бұрын
    • Yes, but the normal distribution is a close approximation to the discrete distribution (in this case probably a binomial) given large enough sample size and p not too close to 0 or 1. Agree with the other two points though :)

      @AbelShields@AbelShields3 ай бұрын
  • Very nice. I can understand this math and it is also put into context of "how exceptional" performances were.

    @bertblankenstein3738@bertblankenstein37383 ай бұрын
  • This approach is used all the time in physics. Theoretical physicists commonly plot in terms of dimensionless parameters to distill the physics without influence from individual parameters

    @ultramarathonman100@ultramarathonman1003 ай бұрын
  • Beautiful as always!

    @aaronb.5185@aaronb.51853 ай бұрын
  • It’s cute to see how Tom really knows some (mostly math) stuff but clearly also has no clue about other (mostly sports) stuff. :-)

    @MrConverse@MrConverse3 ай бұрын
  • I was expecting an ELO model video. I believe this is the best way to compare teams and performance across different leagues, sports, etc. Even adding some modifiers to the model can take into consideration home and away factors, best results against certain opponents and so

    @user-tp2tu8jp2x@user-tp2tu8jp2x3 ай бұрын
  • Assumption is that rules, athleticism, and technical skills have remained the same.

    @turmericgarage8509@turmericgarage85093 ай бұрын
  • The 1995-96 Bulls, actually.

    @fernandocarrazzoni@fernandocarrazzoni3 ай бұрын
    • Both 95-6 and 96-7 Bulls. Easily, too.

      @misterkefir@misterkefir3 ай бұрын
  • Would love to see the Z factors around Wayne Gretzky. Every statistical fact I've read of him seems unreal.

    @Iztari@Iztari3 ай бұрын
  • What about the '95/'96 Bulls? That was supposed to be the best year for the Bulls.

    @agmessier@agmessier3 ай бұрын
    • Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.

      @B3Band@B3Band3 ай бұрын
    • It was an error. He meant to use the 95/96 team.

      @B3Band@B3Band3 ай бұрын
  • yeah, you can argue which team is better in the context of their season. but you can't say which is more likely to win in a match on the grounds of that, for example it is easy to imagine the Chicago bulls data set coming from a wheel chair basket ball league team, identical numbers, but that team would lose badly against the actual Chicago bulls unless by some miracle wheelchairs beat legs. the direct comparison cannot be made between the teams, because the conclusion is a function of the dataset not the teams ability to play. it just means they were better in their time, playing the teams they did :).

    @monkerud2108@monkerud21083 ай бұрын
    • Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.

      @B3Band@B3Band3 ай бұрын
  • That's pretty interesting. Thank you. I feel like e's in the standard deviation there somewhere.

    @frankharr9466@frankharr94663 ай бұрын
  • Another interesting wrinkle in all this is that most sports statistics are competitive. Like, The 2015/16 Warriors getting so many wins reduced the number of available wins for all the other teams, because all of them played (and, mostly, lost) games against GSW. This makes it harder to meaningfully compare across eras, because your sense of how strong the competition is is, in part, self-correcting, so just dividing out the standard deviation doesn't necessarily account for it entirely. (For a toy example, consider a tournament where matches are decided by rolling a 6-sided die, but then one year they add a rule where every player adds 4 to their result. The win distribution should have exactly the same standard deviation, but even mediocre players from the +4 year will obviously have a huge advantage in hypothetical head-to-head matches against top players from other years.) This comes up a lot in baseball discussions: Modern pitching has advanced so far that it's pretty questionable whether Babe Ruth could even get to play in the major leagues today, whereas a replacement-level modern hitter sent back to his time would likely be a superstar because they'd trained against better opponents. I don't think any of this discounts Tom's actual point (in fact, I think it supports it) but it's interesting to think about how these statistics are shaped.

    @12tone@12tone3 ай бұрын
  • I really like the clarification of what you can actually compare using this methodology. And that there are a whole host of unlisted factors which contribute to a win in a head to head matchup. Love the video!

    @realdreamerschangetheworld7470@realdreamerschangetheworld74703 ай бұрын
  • So, applying this technique, who wins then? The 1994 brazilian world champion football team, Mike Tyson or the Nigerian bobsled team they made that movie about? In a more serious note: Does the quality of what makes a sport change little enough over time that these comparisons can be made? Is Basketball played primarily for 3 pointers the same sport as Basketball played mostly for 2 pointers?

    @pedroyochinori8371@pedroyochinori83713 ай бұрын
    • Really??? Nigerian bobsled team? Completely wrong continent dude

      @levibernard1838@levibernard18383 ай бұрын
    • @@levibernard1838 actually Jamaican, found it after searching. Movie name is Cool Runnings 😅

      @pedroyochinori8371@pedroyochinori83713 ай бұрын
    • The man compared two basketball teams in two different eras! The 3 points were there in the 1990s! While you are ridiculously trying to dismiss him by talking about three different sports! .. go away!

      @hus390@hus390Ай бұрын
    • Dismiss 😅? It's just a question. A reply with yes or with no and maybe some reasoning are both fine. I guess you can also try to dunk (get it?) on other people for internet brownie points too though. Cheers! 🎉

      @pedroyochinori8371@pedroyochinori8371Ай бұрын
  • Revisit this in 100 years please someone, see what has changed..

    @goldnutter412@goldnutter4122 ай бұрын
  • Nice Work & Video 👍

    @VINTERIUM..EXPLORIUM.1@VINTERIUM..EXPLORIUM.13 ай бұрын
  • Amateur NBA statistical analyst here! This concept is exactly what i use to compare player statistics across eras. I don't like using it for team comparisons, however, because throughout nba history the number of teams in the league has changed a bunch of times, and when theres fewer teams, one particular team's dominance counts against them more than when theres more teams. They play opposing teams more often, and so if theyre really good, opposing teams will naturally lose more of their games. For instance, in the 1967 nba season, the Philadelphia 76ers set an NBA record in winning 84% of their games (68-13). In the 1960s is was infrequent for the best teams in the league to win more than even 60 games, let alone 68. There were only 10 teams in the league at the time, however, so the bottom teams that year, like the Baltimore Bullets, who went 20-61, might seem like an left hand outlier in our distribution, but they played the 76ers 9 times and went 1-8 against them, whereas nba teams today with 30 teams play each other between 2 and 4 times. For this reason you have to exclude the team in question from the distribution. Doing so, the '67 Sixers have a z score of 2.82, compared to the 16 warriors score of 2.5 What i use z-score for is individual player statistics, and doing "inflation adjustments" between eras. I think this works much better because this means there are way way more data points and so you dont run into the problem of one outlier player inflating the standard deviation of the distribution, working against their own chances of producing a high z score

    @allank8497@allank84972 ай бұрын
  • Wait a second... so what happens when the stddev -> 0?? (i.e. all the teams tie, and the final winner is from a league-wide-tie-breaker)? That team, that simply won the tie-breaker, would have A Z-FACTOR THAT EXPLODES.. no..??

    @empmachine@empmachine3 ай бұрын
  • So I have often wondered about "adding dimensions" to this process. Like, if you wanted to examine two different but connected measures (say total wins AND points differential), could you just slap the second measure down as an orthogonal axis and glean anything useful from that? And if so, can you just kind of Linear Algebra + MV Calculus your way to more robust conclusions? It naively feels like it should be an option, but I can't even begin to conceive of the methodology you'd use to ... do it.

    @ANunes06@ANunes063 ай бұрын
  • Normally I have no clue wtf they’re talking about on numberphile. Today however, is intro to statistics stuff. A class I’ve taken 3 times. 😅

    @swill128@swill1283 ай бұрын
  • I'd like a tour of the office.

    @maverator@maverator3 ай бұрын
  • When you look into salary cap changes, you get a better feel for just how strong MJ was. There was two years where he got paid over 100% of the bulls normal salary cap, and all the other players signed for tiny contracts that were fully in the luxury tax area. If you gave Lebron James or Steph Curry 100% of a teams salary cap, and only put players around them that would sign for bare minimums, they would literally never win a single match.

    @trygswyrmwoodside3229@trygswyrmwoodside32293 ай бұрын
  • That skyscraper solve at the end was dire.

    @ThePeadar2211@ThePeadar22113 ай бұрын
    • "The numbers outside the grid indicate how many skyscrapers are visible in that direction" has nothing to do with height - I was wondering if that makes the outside numbers completely meaningless!?

      @maxid87@maxid873 ай бұрын
    • @@maxid87 It's a standard puzzle type but the way it's presented here is really confusing. The number indicates how many absolute increases are in that row or column. Or how many times a new height record is broken from that direction. So when the 6 is outside the grid that should indicate that there are 6 increases in that direction which is forced to be 1 2 3 4 5 6. But in this solve they've put a 6 in the first position so it can't increase any further.

      @ThePeadar2211@ThePeadar22113 ай бұрын
    • Yeah, putting a 6 inside the grid next to the 6 outside makes it clear they're not actually attempting to solve the puzzle. Also, they have two 2s in that left column, which breaks the "Latin Square" rule..

      @rmsgrey@rmsgrey3 ай бұрын
  • Pele mentioned, Brazil mentioned. Let's go!

    @alexandrecolautoneto7374@alexandrecolautoneto73743 ай бұрын
  • This is why statistics is the best 😄 Great video!

    @mathewlegge639@mathewlegge6393 ай бұрын
  • The biggest problem with this (in my mind) is that the two teams were playing under different sets of rules. The NBA changed rules over this time, so how are you accounting for the different sets of rules? Would the 1990s Bulls performed better under the new rules that promoted offense?

    @peregrinef3203@peregrinef32033 ай бұрын
    • Indeed. It would assist that one could use the same mathematics to compare, say, a basketball team and a javelin thrower, because everything becomes a unitless Z-score

      @phyphor@phyphor3 ай бұрын
    • Heck, you could compare a basketball team to a singer (where the number and length of time at number 1 in the charts is used as the metric), or to anything else that you can rank. I'm not sure I'm ok with that, though.

      @phyphor@phyphor3 ай бұрын
    • Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.

      @B3Band@B3Band3 ай бұрын
    • @@phyphorthat’s a lame way to dismiss the video! Dude compared same sports and same league but different eras. Not any meaningful changes in the rules. I still put my eggs on Bulls though!

      @hus390@hus390Ай бұрын
  • Chicago bulls with mj, any time, always. No math needed.

    @aonpl@aonpl3 ай бұрын
  • The Bulls’ 72-10 season was in 1995-1996 and not 1996-1997.

    @clarkaustin6294@clarkaustin62943 ай бұрын
  • Maths Schmaths - Lovely t-shirt!

    @collapsibletank@collapsibletank3 ай бұрын
  • Goes to show just how impressive it was that Cleveland beat GS that year!

    @thatguyintheback1656@thatguyintheback16563 ай бұрын
  • The Z-Score compares unique data sets assuming the competition level is equal. It doesn’t account for modern teams being more developed from decades of improvement in sports science, conditioning, & play-calling. It might work for 2 teams 5-10 years apart with similar strength leagues but over time most sports drift. At 14:55 he does mention that it doesn’t directly say who would win in a modern head to head game.

    @RobertJordan7@RobertJordan73 ай бұрын
  • The rules were also a bit different back then, so it’s difficult to compare.

    @ItachiUchiha-ns1il@ItachiUchiha-ns1il3 ай бұрын
    • Now I kind of want to see Sachin Tendulkar against a bodyline bowler.

      @DeGuerre@DeGuerre3 ай бұрын
    • Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.

      @B3Band@B3Band3 ай бұрын
    • @@B3Bandno meaningful changes in the rules. I still would put my eggs on the bulls’ basket

      @hus390@hus390Ай бұрын
  • Thank you so much

    @gameofquantity96@gameofquantity963 ай бұрын
  • Fascinating video! Backing warriors any-day though😅

    @hypejuice1321@hypejuice13213 ай бұрын
  • I would love to see how to correctly combine multiple types of data into one graph. Something like point per match and free throw conversion and win by how many points and so on... My first instinct is to transform everything in percentages... but its just wrong lol

    @CaioTrinchinato@CaioTrinchinato3 ай бұрын
  • Bradman practiced with a gold ball and a wicket as the bat Genius

    @goldnutter412@goldnutter4122 ай бұрын
  • Integrating from minus infinity to x1 makes sense for the mathematical function f(x) but not for the distribution of people's heights.

    @richardmellish2371@richardmellish23712 ай бұрын
  • This is a big thing in baseball stats - IE OPS+, ERA+ are all similar type metrics but just divide by the average instead of taking the z score. It would be the same ranking if you assume that the in season personal variance is not useful

    @dontich@dontich3 ай бұрын
  • So cool!

    @FedeDragon_@FedeDragon_3 ай бұрын
  • Is Steve Kerr playing or coaching then?

    @patrese993@patrese9933 ай бұрын
  • Is there anything invalid about finding the Z-scores for multiple data points and then averaging them to create sort of a "z-index"? I know often times doing things like that has unintended effects lol, but on the surface it seems like it might provide an even more "informed" comparison between the teams.

    @heeerrresjonny@heeerrresjonny3 ай бұрын
    • Depends on if you care about the result being correct. If you just want a result then it's fine

      @NaR00W@NaR00W3 ай бұрын
  • now I am interessting how strong Verstappen was in the view of the Z-factor

    @basieluxanno7909@basieluxanno79093 ай бұрын
  • Soooo - my question. Is there a way of more directly comparing teams from different years by trending the z-factors of ALL the teams, to create a lineage from team from era A to team from era B?

    @laz001@laz0012 ай бұрын
  • You might want to look into the ELO rating system. Its been in use for over 60 years in rating chess players, and probably will do a better job of comparing across different time periods.

    @awaedin@awaedin3 ай бұрын
  • I would like to see an average of the Z factors between several stats, from important like wins to seemingly unimportant like hand size and team average height. Also Pele’s 3.8 is amazing but what about Michael Phelps and his Gold medals?

    @clayspillars@clayspillars3 ай бұрын
  • Thank you! Finally a video on a normal topic!

    @zzzaphod8507@zzzaphod85073 ай бұрын
  • You could argue that the strongest team was the one that won the season with the lowest standard deviation.

    @christianellegaard7120@christianellegaard71203 ай бұрын
    • I think that doesn't account for the strength of the winner. Something like the ratio of Z-score to standard deviation might be better.

      @davidhasen7983@davidhasen79833 ай бұрын
    • Also, the Warriors literally lost the championship that season. They couldn't even win the most important game of the season against their contemporaries. Just saying.

      @B3Band@B3Band3 ай бұрын
  • I would love to see F1 comparision. Senna, Shumacher, Prost, Hamilton...

    @EliezerGrawe@EliezerGrawe3 ай бұрын
  • Baseball is the ultimate statistical nerd sport. Sheer number of data points is endless.

    @markconrad9619@markconrad96193 ай бұрын
  • Liverpool's Z score the year they won the league. Nice.

    @IrishEye@IrishEye3 ай бұрын
  • My half baked thought is how does fitting a standard distribution to the Scottish premier league work - afaik the SPL normally has 2 super teams (Rangers & Celtic) that win almost every game they play and then 'the rest' who are scrapping amongst themselves for the lower positions.

    @joinedupjon@joinedupjon3 ай бұрын
  • Please please do a part 2 focusing on various z-scored stats for Messi v. Ronaldo

    @gustrindade@gustrindade3 ай бұрын
  • I assume that if you simply ranked using win percentage you'd get the same ranking as using the z factor?

    @awaedin@awaedin3 ай бұрын
  • Can you please share datasets I can use to test this out?

    @magnus425@magnus4252 ай бұрын
  • Would be curious for point differential and goal differential

    @bmar1784@bmar17842 ай бұрын
  • Tom, can we set sigma not just as real value. But as Divisor function(perfect number?)

    @mykolanikolayev1714@mykolanikolayev17143 ай бұрын
  • I think the real champions are our dads when we were in primary school

    @unnamed7225@unnamed72253 ай бұрын
    • nope! lots of dads are not even remotely thought of in a positive light by their primary school aged children. There are even lots of examples in literature.

      @WrongParadox@WrongParadox3 ай бұрын
  • Just to be especially contrarian a month after my opinion would have been welcome: while I'm no basketball expert, my understanding is that the rule changes since Michael Jordan retired from basketball have also made the defense's job harder than it was when Jordan had to get in to score points. Back when the defense had more options for positioning, he was still doing incredible numbers each game. So you need to factor in the fact that the 2016 Golden State Warriors also had less obstacles to scoring than the Bulls did when Jordan played, meaning if he were playing under 2016 rule sets, he'd likely be scoring more than he was when he was playing professionally.

    @Yggdraseed@YggdraseedАй бұрын
  • As an engineer, you'll see some very impressive Z factors from manufacturing lines. Particularly in industries where performance is literally life or death. For example, in aviation engineering, a manufacturer may achieve Z factors of 7 or 8 for a given part made in large quantities. Anything above 6 is equivalent to a process making a faulty part less than one in a million times, hence there's an entire branch of engineering rigour known as Six Sigma.

    @Dr_Tim_Sidnell@Dr_Tim_Sidnell3 ай бұрын
  • This doesn't count rule changes. I think it depends on what rule set the twi teams have to play with.

    @likenem@likenem3 ай бұрын
  • He already explained clearly that this is just about one parameter (number of wins in the season) among peers of their period. If you want to level up the complexity, you could add more parameters to consider then get their combined Z scores. For “changing rules” adding complexity alone: you need to establish how you would have gotten that rule-changing factor in a numeric scale… that’s math for some other days. Again, you can’t ignore the bravery of them bringing this topic up while telling others his favorite Premier League team… and the comment with the most upvote proves people went out of their way to be emotional for just, one, parameter’s Z-factor analysis… (that’s in the field of psychology on team sports)

    @laggyy3987@laggyy39873 ай бұрын
  • For me, with this formula, you can prove that a kids team from a regional tournament can beat a professional team

    @CristianFalcas@CristianFalcas3 ай бұрын
    • It's meant more to be a fun maths thing. Everyone likes thinking about things like this with their favorite teams. But I mean I guess you can be a turd in the punch bowl about it too, if you want.

      @1104Tea@1104Tea3 ай бұрын
  • However, there cannot be a normal distribution of values whose distribution is fundamentally asymmetric. For example, the height of people. If it had a normal distribution, it would mean that, although very rare, there are people with negative height. The central limit theorem states that standard distributed are the *mean* value of peoples' height measured by different researchers or taken from various reference books. Not the heights of different people in any - even as big as UK - set.

    @NataliaBazj@NataliaBazj3 ай бұрын
  • Could you do the NFL by breaking down the games played into each individual play? For example, could you measure the offense's performance in terms of American Yards gained on the field per play against the Yards prevented by the defense on each play? Obvious pitfalls here is it doesn't account for long drives from one end of the field toward the goal that does *not* result in a score, but I think it would help to more accurately determine the output of the team's performance in term of their athletic ability?

    @MadMan123654@MadMan1236543 ай бұрын
  • My statistics brain is a bit rusty... Does it make a difference that data points for "Win %" are very much not independent? In that, for one value to go up, some one else's value *must* go down? Compared to say, "Total Points" which could be higher or lower for a given team without necessarily affecting another team's outcome.

    @blinblue@blinblue3 ай бұрын
  • This is very flawed unless you intended to compare the relative percormance compared to their respective fields. To me it sounded like you wanted to compare who would win in a match between the two basketball teams if they were to play against one another.

    @DekarNL@DekarNL3 ай бұрын
  • Right on time 😂❤

    @realdreamerschangetheworld7470@realdreamerschangetheworld74703 ай бұрын
  • Doing this for Gretzky's stats and comparing to other sports' "greats" is always hilarious.

    @ErixTheRed@ErixTheRed3 ай бұрын
  • Me, an elder millennial: *Nods along, understanding, and agreeing. The video: ...so Michael Jordan's Bulls weren't quite as good as The Curry Warriors. Me: This is such bullshit.

    @thecompletejake@thecompletejake3 ай бұрын
  • A lot happened in '96, so this all depends on whether Jordan has met and trained with the Cartoon All-Stars and saved the planet from the Monstars already. You'd also have to ban his world-destroying Chaos Dunk, for obvious reasons.

    @batlrar@batlrar3 ай бұрын
  • As someone who grew up in Chicago-land in the 90's, it's obviously the Bulls.

    @srwapo@srwapo3 ай бұрын
  • If you were talking about the actual best team, you would have to team up the 90's team with the 2020's teams, and in that case I think the old teams would lose every match. If you look at a more interesting and much more popular sport, like juggling for instance, almost no-one could even do 7-clubs juggling in the 90's, but now "everyone" does it. I believe this massive improvement is probably occuring in lesser sports like football and basketball as well.

    @AndersHaalandverby@AndersHaalandverby3 ай бұрын
  • Acknowledging, as you said yourself, a limitation of this analysis is "based on level of skill at that time," let's just assume that the average level of competition in different times is equal, then maybe you could do the following statistical calculation: Given z1 = 2.06 (Bulls) and z2 = 2.54 (Warriors), what is the probability that Warriors would win over Bulls? Or for another way of looking at it, if the two teams would play 100 games, what's the most likely split of wins between them?

    @kevinstewart2572@kevinstewart25723 ай бұрын
  • This method allows to compare competitivity but it cannot answer the question as stated. Because it cannot estimate technicity (or team and personal performances, each time you try, you'll confront their data to some sets of numbers of their adversaries in their era). Each generation its own. Young pele at his peak couldnt shine in today's football the same. However you choose your dataset, choosen numbers are anchored to their generational context. So after such a time gap, it's mostly possible competitors became stronger in their art. Adversity grows, competitivity oscillates (it's also circumstantial), and technicity grows. Thus statistics are super important and powerful on medium periods.

    @kirkanos771@kirkanos7713 ай бұрын
  • Could you check the Z factor of all teams in the league for each season and then measure who had better win records against the rest of their respective leagues' Z factors? That could determine a more answer the question better.

    @GhettoFabulousLorch@GhettoFabulousLorch3 ай бұрын
  • I always wondered how the positive values like human height can be normally distributed.

    @ccoderproject@ccoderproject3 ай бұрын
    • it's not, it's really close because the mean is high and the standard deviation is small, so the non-negative effect is negligible, it's just another instance of central limit theorem

      @oldcowbb@oldcowbb3 ай бұрын
  • I wonder how Max Verstappen's '23 season looks on the Normal Standard.

    @00blaat00@00blaat003 ай бұрын
  • 0:12 MJ's 1996-97 Bulls by 10. Easily. And 1995-96 team was even better.

    @misterkefir@misterkefir3 ай бұрын
  • Isn't it possible that ALL the other teams in 2001 were really bad, but the same amount of bad as each other? The distribution being tighter doesn't have to mean that the competition was tougher. In fact, I don't see any relation between those two things.

    @steeevealbright@steeevealbright3 ай бұрын
  • Is Ole Einar Bjørndalen the King of Biathlon? I wonder if there is a calculation of biathletes z-factor per year and since they are individual athletes, whether the evolution of the z-factor for all athletes can allow us to do two things: normalize an athletes performance over time and hence make an actual inter-generational comparison (without the mess of players changing teams). Trouble is, I would have to work with ranks in races not wins...

    @SubtleForces@SubtleForces2 ай бұрын
  • Probabilities are chaos. Keep working.

    @777anarchist@777anarchist3 ай бұрын
  • Z Factor sounds way too cool for something so mundane

    @oldcowbb@oldcowbb3 ай бұрын
  • put the like. let's watch the video now.

    @_ilsegugio_@_ilsegugio_3 ай бұрын
  • I would like to have the first brown paper...

    @hidalgobuscato7957@hidalgobuscato79573 ай бұрын
  • Who's better Ninja Turtles or Street Sharks. Figure that out math boy

    @NaR00W@NaR00W3 ай бұрын
    • He ran out of paper so had to scribble on his arms.

      @flickingbollocks5542@flickingbollocks55423 ай бұрын
    • ​@@flickingbollocks5542 On his arms and probably elsewhere too, but I'm not sure I want to know...

      @R_V_@R_V_3 ай бұрын
KZhead