BuzzerBeater Forums

BB Global (English) > Skill cap testing

Skill cap testing

Set priority
Show messages by
This Post:
00
155261.38 in reply to 155261.36
Date: 9/3/2010 8:00:22 AM
Overall Posts Rated:
155155
I would also like to see a comparison between your approach and others, with the same data.


I think I am pretty much considering all possible options at the moment. I mean, I have never been a believer in potential sub-levels, but I am trying to make sure they are considered in the model. And I am allowing for the possibilty that the formula varies by position. If there is anything else that I am missing, I would appreciate it if you could tell me. ;-)



Personally, I am fond of the method of Josef-Ka, and what we could do with how8 (now, you just need to add +6% because of the reforms of salary, but it still is nicely accurate).


This assumes that salary is directly linked to potential. Maybe that is the case, I don't know. But this model also allows for that possibility.


For skills, you have to trust the fact that players from the data you received have truly capped quite enough.
In my opinion, there is truly a risk that it won't give tools accurate enough.


Indeed, there is a risk that the data is garbage. So what do you propose? No study at all? Actually, my major concern is that if there are sub-levels, then finding any equation to fit the data becomes very difficult and perhaps impossible.


In any case, it obliterates for example the idea that skills have different coefficients, according to their proportion and the position (like with how8).
So, I truly hope that there will be a comparison between both methods.


I don't follow you here, since the main model I am using is in line with Joesph Ka's formulas. I am open to there being a different formula for skill caps (perhaps a more linear one, as Joseph Ka guessed in another topic). So I am testing different things. But to be honest, the data is just not there at the moment to say anything about possible different models.



Otherwise, for your theory, I think that SF with a low or medium potential would be really among the most interesting ones to analyze, as they would be the ones that would permit the most directly to precise the strength or the limits of a relationship between the position and the potential cap, basing on your hypothesis that there is not.


If you see my reply to Joseph Ka above, you will see that I am testing both possibilities (no position vs position in the model). But actually, I do have a couple of these low potential SFs thanks to some of my friends. Although, I could always use more. But I would not just pick on SFs, any player with a potential outside of allstar would really help.



But the most interesting application will be to help trainers to foresee more precisely when a player will be truly close to the cap, and how much will be his salary.


Well, to me, the most interesting thing would be to know if it is actually salary that determines the cap. Because if it is not salary, then in theory you could take a low potential player and train him longer, provided you knew what you were doing. Now, such a player might not be so desirable. For example: I am sure not many people would want an all shot blocking player. But who knows. ;-)



So anyway, I do not only hope that there will be a comparison. I also hope that you will go on in your own way by interesting yourself to salaries, later ;)


In the end, I have realized that it does not matter if I include salaries in the model. Because if indeed the cap is determined directly by salary, what I should end up with is just Joseph Ka's coefficients back again (or something close to them, at any rate). But if I end up with something different enough, well, I will let you draw your own conclusions. ;-) In any case, I am gathering salary whenever possible. So it is certainly something I will look at.

Last edited by HeadPaperPusher at 9/3/2010 8:02:19 AM

Run of the Mill Canadian Manager
This Post:
00
155261.40 in reply to 155261.38
Date: 9/3/2010 8:36:45 AM
Overall Posts Rated:
102102
Indeed, there is a risk that the data is garbage. So what do you propose? No study at all?


Absolutely not. On the contrary.
I will follow this study. I consider the project interesting and trying to understand the way skills work is part of the game pleasure.
I also try to lead a study about skills in french communauty and I know how hard it may be to get data.

About the precision of data, I just think that it might be interesting to distinguish players according to 2 categories :
- those that truly didn't cap since many weeks (6-7 perhaps)
- those that are really slowing down since 6-7 weeks, but that have upped.

the first ones would be not numerous but they might be considered as value spots (i'm not sure if that' the right term in english)
Other ones, already classified per potential, would be also used according to your model, but the possibility that they didn't really cap would be informed, and would also permit to consider the evolution of skill cap according to the way a player seems to approach from it.

Just an idea for organizing data you have right now by considering their quality, before you really get a critical amount of data.

Good luck


Last edited by Dunker Joe at 9/3/2010 8:45:09 AM

BBF, le forum francophone : = (http://buzzerbeaterfrance.forumpro.fr/)
This Post:
11
155261.41 in reply to 155261.40
Date: 9/3/2010 9:42:50 AM
Overall Posts Rated:
155155


the first ones would be not numerous but they might be considered as value spots (i'm not sure if that' the right term in english)


As I mentioned before, I am keeping track of which data points I consider to be more precise (ie: a person who is able to give me a better history of their player's training and who has definitely trained their player long enough without a pop). I guess I could put an "accuracy" class variable in the model. Although at the moment, I have been simply doing two runs: one with and one without these data points. Which I guess amounts to the same thing.


Good luck


Thanks. If you have other ideas, let me know. In particular, I am still waiting for better ideas to model in the presence of error. Joseph Ka's idea was simply to get a lot of data. If it were only that easy. ;-)

Run of the Mill Canadian Manager
This Post:
22
155261.42 in reply to 155261.39
Date: 9/3/2010 10:54:52 AM
Overall Posts Rated:
102102
Lets make ID/IS/RB and other stats invisible then, it'll make the game more challenging.


That's your comeback? Okay, let me do the same to you and reduce your argument to the absurd. How about everytime we log in, we have a pop-up telling us what to do? "Click here to win in BB." Would that make it less hard for you?

We already have guidelines on which approximate salary range a player might most likely cap.

Last edited by kLepTo at 9/3/2010 10:55:11 AM

This Post:
22
155261.43 in reply to 155261.33
Date: 9/4/2010 4:19:36 AM
Overall Posts Rated:
4040
I think that the idea is that who is active and is searching for the information, will have kind of advantage in comparison with these which just play a game as it is. From the rules, if you are reading carefully, is evident, that they are incomplete for purpose.

This Post:
00
155261.44 in reply to 155261.4
Date: 9/5/2010 4:33:45 AM
Overall Posts Rated:
225225
So far, with 20 data points, I have a model that fits very well. An r-squared of almost .99. It says the following very weird results:

-The BB defined position does not seem to matter for capping purposes.
-The model works much better without an intercept (.99 vs .46 r-squared).
-Jump shot and rebounding matter a lot for skill capping. The rest of the skills have some contribution, but are pretty minor in comparison. Some skills are even so minor they are hardly worth mentioning.

If you just toss the position on the right-hand side of the equation, and with so few data points, it's pretty much a foregone conclusion that the variable will be meaningless. If you would really like to investigate positions, run separate regressions for each position when you get enough data points.

I don't see how having a better fit without a constant is weird. This simply states that a player that has capped at zero skills has zero potential (that's assuming that you keep the potential level on the LHS).

If potential capping is in any way similar to salary, then there is no wonder that some skills return insignificant coefficients. Skill weights are different per position, so you're getting significant coefficients for the position that occurs most often in your small sample.

There is a lot to explore in this, but a lot more data is needed. I'd say 50-100 datapoints per position, in order to be able to read anything from the results.

"I don't know half of you half as well as I should like; and I like less than half of you half as well as you deserve."
This Post:
00
155261.45 in reply to 155261.44
Date: 9/5/2010 2:00:35 PM
Overall Posts Rated:
155155

If you just toss the position on the right-hand side of the equation, and with so few data points, it's pretty much a foregone conclusion that the variable will be meaningless. If you would really like to investigate positions, run separate regressions for each position when you get enough data points.


I did this and it does generate different results. But it still does not prove anything. If there are enough data points by position, the class variable should be significant (ie: add something to the model) or not. Just getting different coefficients by position proves nothing, especially with so little data. I could take a number of variables, significant or not, out of the model (inside d, shot blocking, etc) and it would still change the coefficients.


Skill weights are different per position, so you're getting significant coefficients for the position that occurs most often in your small sample.


Maybe, I don't know. But suffice it to say that I will test everything, although as you said the data is rather limiting at the moment.


There is a lot to explore in this, but a lot more data is needed. I'd say 50-100 datapoints per position, in order to be able to read anything from the results.


Even 50 datapoints would imply at least 250 observations. So even if true, it seems like it will be a long time before this study will finish. There was a lot of excitement when the study started but now the data is only barely trickling in.

So, your concern is noted. And while I appreciate that you have a good deal of knowledge with numbers, what you could help more with is a solution to the possible issue of sub-levels on the cap. For example: how to prove they exist (or not) and if they do, how to model appropriately.

Run of the Mill Canadian Manager
This Post:
22
155261.46 in reply to 155261.45
Date: 9/5/2010 3:37:54 PM
Overall Posts Rated:
225225
So, your concern is noted. And while I appreciate that you have a good deal of knowledge with numbers, what you could help more with is a solution to the possible issue of sub-levels on the cap. For example: how to prove they exist (or not) and if they do, how to model appropriately.

The trick in figuring out potential potential sublevels is making sure you don't confuse them with skill sublevels. My best guess would be using the "current" salary level (or DMI in proficient game shape) as a regressor to control for unobserved sublevels in skills. If the per-position regressions produce significant differences between predicted and observed values of potential (or high error terms), this indicates differences within potential levels.

The problem with this approach is a certain collinearity on the right-hand side, since salary is also a product of skill levels. This may be addressed by running a regressin using just the salary levels (this will test against the assumption that the potential "value" and salary are linearly proportional).

This is mostly brainstorming, and I am sure there are at least a handful of users here who know how to work data, and can toss in their own ideas (Coco?).

All in all, it's not that I have concerns, per se. I do think that it is great that someone is willing to do this type of "reverse-engineering", since it's long overdue. I don't mean to come through as condescending, just trying to provide some pointers about data work, since I do have some experience there.

"I don't know half of you half as well as I should like; and I like less than half of you half as well as you deserve."
This Post:
00
155261.47 in reply to 155261.46
Date: 9/5/2010 8:14:09 PM
Overall Posts Rated:
155155
Actually, thanks to your message I was thinking about it today and realized that you are indeed right. You are right that one variable by position is not the right approach. I actually need 5 indicator variables (one for each position). Or I could just do one model per position, but that leaves no way to test for the significance of position.

As for regressing using current salary as a regressor, this again leaves potential as a y-variable. And in that case, if there are sub-levels on potential (ie: error), it still leads to a biased model.

Run of the Mill Canadian Manager
Advertisement