Upvote Upvoted 69 Downvote Downvoted
1 2
EU Team Elo is back
posted in Projects
31
#31
1 Frags +
Caelihow was this game not taken into account for our team??
http://logs.tf/1640993

7 players were used on the other team. It may have been a legit scrim with a sub or the other player may have been logged by mistake when they didn't actually play, but the system has to have simple limits to be feasible. Catering for every corner case would increase the workload massively.

[quote=Caeli]how was this game not taken into account for our team??
http://logs.tf/1640993[/quote]
7 players were used on the other team. It may have been a legit scrim with a sub or the other player may have been logged by mistake when they didn't actually play, but the system has to have simple limits to be feasible. Catering for every corner case would increase the workload massively.
32
#32
0 Frags +

I dont know if it's of any use, but just in case, it happened again
http://beta.tf2playerrankings.com/TeamRank/Match/Detail/2721
(This time we won tho :p But still, the overall maths gets flawed)

I dont know if it's of any use, but just in case, it happened again
http://beta.tf2playerrankings.com/TeamRank/Match/Detail/2721
(This time we won tho :p But still, the overall maths gets flawed)
33
#33
0 Frags +
OmbrackI dont know if it's of any use, but just in case, it happened again
http://beta.tf2playerrankings.com/TeamRank/Match/Detail/2721
(This time we won tho :p But still, the overall maths gets flawed)

I haven't given different tiers their own Elo because I don't know what it should be. If I don't record games between them I will never know. There is also no other mechanism that could leak Elo or Trueskill points into the upper tiers. I'll adjust the Elo of the various tiers when I get enough data to make a judgement. These games will continue to be recorded and processed.

[quote=Ombrack]I dont know if it's of any use, but just in case, it happened again
http://beta.tf2playerrankings.com/TeamRank/Match/Detail/2721
(This time we won tho :p But still, the overall maths gets flawed)[/quote]
I haven't given different tiers their own Elo because I don't know what it should be. If I don't record games between them I will never know. There is also no other mechanism that could leak Elo or Trueskill points into the upper tiers. I'll adjust the Elo of the various tiers when I get enough data to make a judgement. These games will continue to be recorded and processed.
34
#34
2 Frags +

I've run an update on Elo and Trueskill to reflect the tiers teams play in. Open has the lowest Elo and Trueskill, Mid's are unchanged, High has been given a boost and Prem has also been given a boost.

It's possible that these changes are a little strong, I'm hesitant to have all High teams rated above Mid teams when some of them are certainly weaker but I have to update the tier as a whole and in a comparison of overall win/loss rate between the two divisions the change is justified.

This now gives teams in lower tiers more ranking point incentives to take on teams in a higher tier.

Trueskill is a more complicated consideration because of it's uncertainty element, I've boosted those ranks but we'll have to see how that artificial change plays out over time.

I've run an update on Elo and Trueskill to reflect the tiers teams play in. Open has the lowest Elo and Trueskill, Mid's are unchanged, High has been given a boost and Prem has also been given a boost.

It's possible that these changes are a little strong, I'm hesitant to have all High teams rated above Mid teams when some of them are certainly weaker but I have to update the tier as a whole and in a comparison of overall win/loss rate between the two divisions the change is justified.

This now gives teams in lower tiers more ranking point incentives to take on teams in a higher tier.

Trueskill is a more complicated consideration because of it's uncertainty element, I've boosted those ranks but we'll have to see how that artificial change plays out over time.
35
#35
-1 Frags +

nunya confirmed bottom 3 high team by Trueskill.

nunya confirmed bottom 3 high team by Trueskill.
36
#36
0 Frags +
Setsulnunya confirmed bottom 3 high team by Trueskill.

The (probably not really) interesting thing about Trueskill is that it's a 2 part rating system, the rating and a level of uncertainty about the rating from which you can generate a conservative rating figure (shown in the site).

Trueskill only gains confidence about a team's rating when it has both wins and losses to look at, because Nunya lose so many games Trueskill is not really sure of where the bottom of their rating really is so it's conservative estimate of their position is really low despite receiving an improvement.

By the same token 7's rating is far higher but they never lose so Trueskill isn't really sure how high their comparative skill level really is compared to the field, particularly given their relatively low volume of scrims

[quote=Setsul]nunya confirmed bottom 3 high team by Trueskill.[/quote]
The (probably not really) interesting thing about Trueskill is that it's a 2 part rating system, the rating and a level of uncertainty about the rating from which you can generate a conservative rating figure (shown in the site).

Trueskill only gains confidence about a team's rating when it has both wins and losses to look at, because Nunya lose so many games Trueskill is not really sure of where the bottom of their rating really is so it's conservative estimate of their position is really low despite receiving an improvement.

By the same token 7's rating is far higher but they never lose so Trueskill isn't really sure how high their comparative skill level really is compared to the field, particularly given their relatively low volume of scrims
37
#37
0 Frags +

I know.
It was a joke.
God damn mathematicians.
You've destroyed the nunya elo memes before they even started.

You can make it up to me if you show me some stats. :D

I know.
It was a joke.
God damn mathematicians.
You've destroyed the nunya elo memes before they even started.

You can make it up to me if you show me some stats. :D
38
#38
0 Frags +
SetsulYou can make it up to me if you show me some stats. :D

99.9% of everything collected is output to stream or on the ranking site

[quote=Setsul]You can make it up to me if you show me some stats. :D[/quote]
99.9% of everything collected is output to stream or on the ranking site
39
#39
0 Frags +

Did you change anything about how elo worked today? Last night when I checked my team had just above 1100 elo and now we're at 950 suddenly, everyone's elo in open div seems to have been lowered as well.

Show Content
muh elo ;(
Did you change anything about how elo worked today? Last night when I checked my team had just above 1100 elo and now we're at 950 suddenly, everyone's elo in open div seems to have been lowered as well.

[spoiler]muh elo ;([/spoiler]
40
#40
3 Frags +
RacsoDid you change anything about how elo worked today? Last night when I checked my team had just above 1100 elo and now we're at 950 suddenly, everyone's elo in open div seems to have been lowered as well.
Show Content
muh elo ;(

read above?

[quote=Racso]Did you change anything about how elo worked today? Last night when I checked my team had just above 1100 elo and now we're at 950 suddenly, everyone's elo in open div seems to have been lowered as well.

[spoiler]muh elo ;([/spoiler][/quote]

read above?
41
#41
3 Frags +
RacsoDid you change anything about how elo worked today? Last night when I checked my team had just above 1100 elo and now we're at 950 suddenly, everyone's elo in open div seems to have been lowered as well.
Show Content
muh elo ;(

Read the post #34 above

[quote=Racso]Did you change anything about how elo worked today? Last night when I checked my team had just above 1100 elo and now we're at 950 suddenly, everyone's elo in open div seems to have been lowered as well.

[spoiler]muh elo ;([/spoiler][/quote]

Read the post #34 above
42
#42
2 Frags +

#40 and #41 my bad and thanks

#40 and #41 my bad and thanks
43
#43
refresh.tf
1 Frags +
GentlemanJonIt's just for fun at the moment but as the ratings become more accurate over time the results should become more interesting

Maybe they become more accurate on wether or not the team in question will win or loose, but the scoreline predictions always predicts the winner to get 5 rounds (or 6, in case of golden cap)

[quote=GentlemanJon]It's just for fun at the moment but as the ratings become more accurate over time the results should become more interesting[/quote]

Maybe they become more accurate on wether or not the team in question will win or loose, but the scoreline predictions always predicts the winner to get 5 rounds (or 6, in case of golden cap)
44
#44
0 Frags +
CollaideGentlemanJonIt's just for fun at the moment but as the ratings become more accurate over time the results should become more interesting
Maybe they become more accurate on wether or not the team in question will win or loose, but the scoreline predictions always predicts the winner to get 5 rounds (or 6, in case of golden cap)

Elo has nothing to say about the number of rounds that will be scored only how close it thinks the game will be. 5 is chosen arbitrarily as a reasonable proxy value for the winning team

[quote=Collaide][quote=GentlemanJon]It's just for fun at the moment but as the ratings become more accurate over time the results should become more interesting[/quote]

Maybe they become more accurate on wether or not the team in question will win or loose, but the scoreline predictions always predicts the winner to get 5 rounds (or 6, in case of golden cap)[/quote]
Elo has nothing to say about the number of rounds that will be scored only how close it thinks the game will be. 5 is chosen arbitrarily as a reasonable proxy value for the winning team
45
#45
0 Frags +

With the classic ESEA ruleset 5 rounds for the winning team would actually be a fairly reasonable assumption. So with that ruleset the system might actually reliably get you the score +- a round here and there obviously.

With the classic ESEA ruleset 5 rounds for the winning team would actually be a fairly reasonable assumption. So with that ruleset the system might actually reliably get you the score +- a round here and there obviously.
46
#46
1 Frags +
SetsulWith the classic ESEA ruleset 5 rounds for the winning team would actually be a fairly reasonable assumption. So with that ruleset the system might actually reliably get you the score +- a round here and there obviously.

It would probably need some work on Elo to make it more responsive to the result like scaling the K factor to a log of the winning margin. 7 would be further ahead than they currently are if that was done for example.

[quote=Setsul]With the classic ESEA ruleset 5 rounds for the winning team would actually be a fairly reasonable assumption. So with that ruleset the system might actually reliably get you the score +- a round here and there obviously.[/quote]
It would probably need some work on Elo to make it more responsive to the result like scaling the K factor to a log of the winning margin. 7 would be further ahead than they currently are if that was done for example.
47
#47
0 Frags +

Yeah, obviously. The more interesting conclusion is the opposite though. With windifference 5 you'd have to account for both maps and teams on top of elo, to even get a basic idea of rounds played.
I wouldn't want to do that, whereas adding scaling to k takes a few seconds and then a bit of trial and error.

So I'm not sure if "expected rounds" really adds anything to the stats. Most of the time it's going to be wrong. Winning chance seems like more than enough information.

Or do you plan on making the expected rounds more accurate? Imho it would be a bit overkill.

Yeah, obviously. The more interesting conclusion is the opposite though. With windifference 5 you'd have to account for both maps and teams on top of elo, to even get a basic idea of rounds played.
I wouldn't want to do that, whereas adding scaling to k takes a few seconds and then a bit of trial and error.

So I'm not sure if "expected rounds" really adds anything to the stats. Most of the time it's going to be wrong. Winning chance seems like more than enough information.

Or do you plan on making the expected rounds more accurate? Imho it would be a bit overkill.
48
#48
1 Frags +
SetsulOr do you plan on making the expected rounds more accurate? Imho it would be a bit overkill.

No I don't track the data that would even allow me to start making precise estimates of scores

[quote=Setsul]Or do you plan on making the expected rounds more accurate? Imho it would be a bit overkill.[/quote]
No I don't track the data that would even allow me to start making precise estimates of scores
49
#49
2 Frags +

7 only lost 1 game all season against champ.gg, but they were playing Stark on medic. Arctic Foxes took Elo off them a couple of times but only by getting draws

7 only lost 1 game all season against champ.gg, but they were playing Stark on medic. Arctic Foxes took Elo off them a couple of times but only by getting draws
50
#50
4 Frags +
GentlemanJon7 only lost 1 game all season against champ.gg, but they were playing Stark on medic. Arctic Foxes took Elo off them a couple of times but only by getting draws

im pretty sure nR and ams team beat them in scrims at least once

[quote=GentlemanJon]7 only lost 1 game all season against champ.gg, but they were playing Stark on medic. Arctic Foxes took Elo off them a couple of times but only by getting draws[/quote]
im pretty sure nR and ams team beat them in scrims at least once
51
#51
10 Frags +

Will this be a thing for s27?

Will this be a thing for s27?
52
#52
9 Frags +

^ yeah would really appreciate if this could be updated for season 27 :p

^ yeah would really appreciate if this could be updated for season 27 :p
53
#53
2 Frags +

yes please

yes please
54
#54
3 Frags +

My understanding is that ETF2L are doing their own version. Adding a div has meant it needs a load of changes and I don't have time to put them in place

My understanding is that ETF2L are doing their own version. Adding a div has meant it needs a load of changes and I don't have time to put them in place
1 2
Please sign in through STEAM to post a comment.