Algorithm of 400 Clan Ranking Algorithm

by **jpcloet** on Fri Mar 25, 2011 9:23 am

danryan wrote:A lot of very odd looking results in that output...

Yes but at some point you say clan x is overrated and clan y is under rated and they should battle, but that is not happening. In theory it should sort itself out. The same principle applies to the leap. Speaking of that, I have put an updated model in the CLA for them to review/discuss.

by **Leehar** on Fri Mar 25, 2011 9:24 am

danryan wrote:A lot of very odd looking results in that output...

Not really, it's mostly understandable how it came out.
Maybe if you add some restrictions like minimum wars played or only privies/medals using ones it could weed out some of the more wonky ones. (Or it could just be that I'm looking forward to seeing Nem at #5 :lol:

)

I'm actually trying to just discount the 'z's completely since they're no longer existing clans I presume

by **jpcloet** on Fri Mar 25, 2011 9:26 am

Leehar wrote:I'm actually trying to just discount the 'z's completely since they're no longer existing clans I presume

yes, the z-clans are dead clans. The z was put in front so that we could display better in the clan results thread.

by **Dako** on Fri Mar 25, 2011 9:32 am

Also, if you add criteria with 3 wars completed - then the results should be much better.

by **jpcloet** on Fri Mar 25, 2011 9:38 am

Dako wrote:Also, if you add criteria with 3 wars completed - then the results should be much better.

In one of the ELO versions I had, I used a display factor of (.5 for wars of 1 or 2, .75 if 3 wars and 1.00 if 4 or more wars) You could do something like that. I called it a Reliability Rating.

by **FarangDemon** on Fri Mar 25, 2011 10:53 am

danryan wrote:
FarangDemon wrote:I should add some sort of filter to only include clans that have played a minimum number of wars in the ranking.
It is only meaningful after a certain amount of data.

I'd say 3 wars or a total of 100 games played as a minimum amount of data, then it would make sense. The trouble is, some decay needs to be in there as well as clans' strength change over time. This is a hard nut to crack.

Yes, I think we'd want a 100, or maybe 150 or 200 game minimum for a clan to appear in the ranking.

As for decay, I agree with you. That's why I created a Data Window parameter. Using the default parameter (1 year), I intended for only the past 1 year of games to be included to determine the performance rating. Either jpcloet increased this value or there is a bug I need to fix.

jpcloet, which value did you use for Data Window?
Also, please provide me the data files you used so I can investigate.

by **jpcloet** on Fri Mar 25, 2011 10:58 am

FarangDemon wrote:jpcloet, which value did you use for Data Window?

I used all of time, so I set it to 5 years.

by **FarangDemon** on Fri Mar 25, 2011 6:19 pm

jpcloet wrote:
Dako wrote:Also, if you add criteria with 3 wars completed - then the results should be much better.

In one of the ELO versions I had, I used a display factor of (.5 for wars of 1 or 2, .75 if 3 wars and 1.00 if 4 or more wars) You could do something like that. I called it a Reliability Rating.

This is definitely a good idea. I appreciate you testing out this metric and your feedback, as I know you have a lot of experience doing this.

I came across the algorithm of 400 by playing chess online on Gameknot. The way they implemented it, if you played under 10 games, your score was computed the same as everybody else, but it was noted as being "provisional". Perhaps I could add option to include/exclude provisional clans from the ranking list.

jpcloet wrote:I used all of time, so I set it to 5 years.

It's not necessary to set it to 5 years if you want to take 5 year-old data into consideration. It's a bit confusing. It will help if I explain a little of what exactly the program does.

The way the program works, it starts at the first challenge, no matter what the Data Window value, and adds challenge by challenge, recomputing each clan's rating as it goes.

In order for this to work, every time I calculate the rating for a clan, I must pass a time parameter. The time parameter specifies what point in time I am calculating the rating. This is the date used to establish the data window that determines which challenges are counted.

Each clan war provides one data point, which is either +/- 400 of the opponent clan's rating at the time of the challenge. For ties, the opponent's rating is used unaltered. Data points are dated to the time of conclusion of the challenge.

For example, when I calculate THOTA's rating for Mar 25, 2007, it will only be looking at challenges that happened prior to that date. If Data Window is set to 1 year, the rating will be computed by games-weighted averaging the data points (i.e. results from challenges) between Mar 25, 2006 and Mar 25, 2007. However, given that one of these THOTA challenges (against Z-Nihilists) concluded on June 6, 2006, the value of that data point would be based on Z-Nihilists rating at that time (June 6, 2006), meaning it would have been computed by averaging data points of the Z-Nihilists' challenges between June 6, 2005 and June 6, 2006. If the Z-Nihilists played against Z-Beelzebubs on July 14, 2005, it would then need to calculate rating of Z-Beelzebubs on July 14, 2005. This required looking at all of Z-Beelzebubs' challenges between July 14, 2004 and July 14, 2005. And so on and so forth until the beginning of time.

So since it starts at the beginning, you can be assured that all 5 years of data will be used as the algorithm computes and recomputes ratings over the past 5 years no matter what the Data Window setting.

by **FarangDemon** on Sat Mar 26, 2011 1:34 am

I've updated the script.

http://www.killersapp.com/algorithmof400.htm

Decay Factor

There is now a decay factor. If the Data Window is set to 2 years (default), then a war today is weighted 100%, a war 6 months ago is 75%, 12 months ago is 50% and 18 months ago is 25%.

decay factor = Elapsed time from start of data window to clan war record / total length of data window

The weight changes continuously, day by day, so that over time, a clan's score will will not perceptibly spike up or down except immediately after conclusion of a new clan war.

Min Weight to Display in Ranking List

Restricts display of clans in the ranking list to those with minimum weight total. Total weight = Sum of products (decay * games) for each clan war record within data window. This ensures reliability of ranking.

by **shocked439** on Sat Mar 26, 2011 1:52 am

JP why are results from 3 years ago important? Clans are fluid and the skill of the clan 3 or four years ago don't matter. A true ranking should show who you are now. It should have a beginning and an end to the ranking period and take into account the results during that period. This system has merit if used appropriately but the results of the first ever clan war don't matter now, maybe in a historical ranking but does the 1927 Yankees winning the world series have any impact on the 2011 Yankees winning it? Nope. So define a clan year and base the rankings on challenges begun during that year.

by **jigger1986** on Sat Mar 26, 2011 1:58 am

This is some interesting info and I thank you for it. According to FD's initial calculations there we alot of surprises in the league.

by **Qwert** on Sat Mar 26, 2011 6:51 am

farang can you give results start from 2011.
If i understand correct-every clan start with 1000 pt.
Can you tell me if THOTA play against T4c- how many pts will get thota if win,and how many pts will get T4C if win?

by **jpcloet** on Sat Mar 26, 2011 7:17 am

shocked439 wrote:JP why are results from 3 years ago important? Clans are fluid and the skill of the clan 3 or four years ago don't matter. A true ranking should show who you are now. It should have a beginning and an end to the ranking period and take into account the results during that period. This system has merit if used appropriately but the results of the first ever clan war don't matter now, maybe in a historical ranking but does the 1927 Yankees winning the world series have any impact on the 2011 Yankees winning it? Nope. So define a clan year and base the rankings on challenges begun during that year.

You have to look at the history of clans, some are not as active as the others. I simply used all the data to start with. Various other ELO's I used removed older data as we went along. 2 Year RPI was the last 24 months when it was in use.

by **FarangDemon** on Tue Mar 29, 2011 3:18 am

The rankings presented below are based on all clan wars I got from jpcloet and all Clan League (1-3) that had finished at time of posting.

http://killersapp.com/Algorithmof400.htm

Note: I'm not advocating that the particular model with these specific parameters is the best model to use to compare clan performance. There are a lot of parameters that could be tweaked to result in higher accuracy. I used rough estimates for end dates of CL1 and 2. So consider this post as an illustrative example of how to use my script to create and evaluate your own models. More details below.

I've updated the script to:

Graph the top clans' performance over specified time window
List all clan wars, in order of decreasing weight, which contributed to each clan's final rating (Display Basis of Each Clan's Rating)
Compute accuracy of the model, in terms of how often the higher ranked clan actually won
Change parameters to allow rating for a clan war to increase/decrease based on win margin. The model I have provided is just an example. You guys can:
- Add/adjust Data
- Play around with the parameters

by **FarangDemon** on Wed Mar 30, 2011 1:04 am

Dako wrote:Also, if you add criteria with 3 wars completed - then the results should be much better.

I forgot to implement this! So what I've implemented now, if a clan has under 3 wars, any query on their score will result in the average of 1000 and the otherwise regularly computed score. This is what gameknot did, though they may have used a different cutoff value.

So if a clan has only completed 1 war against a clan with 1000 points, they are considered a 1200 instead of a 1400 at that time. This prevents clans from skyrocketing positive or negative by playing just 1 or 2 games, which adversely affected TOFU in the example in my previous post.

This is what it looks like with this fix and using 600 points max delta (if one clan wins/loses 100% of games in a war), 200 points min delta (if a clan barely wins/loses).

(All screenshots from the same model)

Click image to enlarge.

Click image to enlarge.

Click image to enlarge.

As you can see above, this model is a lot more forgiving to higher-ranked clans that take on lower-ranked clans (as all clans that participated in CL3 did). Using the standard implementation of algorithm of 400, a higher ranked clan is automatically penalized even for winning against a clan less than 400 points below them. With this model, they can make up for the point differential by having a high win margin.

Example, in this model TOFU is penalized 200 points for the 80% win against T4C. In my opinion, this is fairer than the 400 point penalty resulting from previous posts' min=400, max=400 standard model (with top-of-post-mentioned fix for clans with less than 3 wars).

Click image to enlarge.

For the accuracy calculation, I've made the interval length a user-defined parameter, so you can play around with different values that you think provide more meaningful results.

This table shows that according to this model, any clan that is about 22-66 points ahead of another has a 67% chance to beat or tie the lower-ranked clan. For an 67-110 point lead, the chance is 83%.

by **Dako** on Wed Mar 30, 2011 2:36 am

Wow, great job. I will look at it again to reread it cause there are a lot of info here.

by **danryan** on Wed Mar 30, 2011 8:38 am

I have to say, this is a very credible model from what I'm seeing so far. For one, it clearly shows what effect your input data has. Secondly, it removes all incentive for "farming" a clan. Lastly, it penalizes a clan (but not too severely) for inactivity, due to the decay factor. I'd like to hear jpcloet's take on it since he has put a lot of work into a lot of different models over the years to try and come up with a mathematical model.

by **jpcloet** on Wed Mar 30, 2011 9:33 am

I have a few question on the final rankings to help clarify.

1) TSM is due to lack of challenges correct?
2) MM seems to be the biggest outlier in my mind, considering the overall quality of opponents. Is S2 playing a part? This is the only real anomaly for me.
3) Is the O&H loss by LOW keeping LOW from the top 3?
4) What does the top 10 look like with a KORT win over THOTA in CCup1?

by **josko.ri** on Wed Mar 30, 2011 12:44 pm

I very like this system of calculation, results are very reliable I think. decay idea is great. only one complaint, you put 32-28 result in TOFU vs KORT challenge in database, which couldnt be valid. for achieving that result TOFU used forbidden bonus according to the official rules of the challenge, so it shouldnt be validated as regular result.

by **Gold Knight** on Wed Mar 30, 2011 12:52 pm

Dako wrote:Wow, great job. I will look at it again to reread it cause there are a lot of info here.

Agreed, someone finally put some colorful pictures up so us peasants can understand... :lol:

It does seem like a pretty close representation of what the clan landscape is IMO. Where the hell have you been in the CLA the past year while we've been bickering over 40 different ranking systems?

by **Leehar** on Wed Mar 30, 2011 1:19 pm

josko.ri wrote:I very like this system of calculation, results are very reliable I think. decay idea is great. only one complaint, you put 32-28 result in TOFU vs KORT challenge in database, which couldnt be valid. for achieving that result TOFU used forbidden bonus according to the official rules of the challenge, so it shouldnt be validated as regular result.

It's already been validated under clan results so I doubt this is really the place for that argument

by **Leehar** on Wed Mar 30, 2011 1:42 pm

jpcloet wrote:I have a few question on the final rankings to help clarify.

1) TSM is due to lack of challenges correct?
2) MM seems to be the biggest outlier in my mind, considering the overall quality of opponents. Is S2 playing a part? This is the only real anomaly for me.
3) Is the O&H loss by LOW keeping LOW from the top 3?
4) What does the top 10 look like with a KORT win over THOTA in CCup1?

I was wondering more along the lines of if cl3 results were taken into account or something for tsm to be so far down?
I'd think the O&H loss would be too far back to have that much waiting, and it's still a surprise that IA is in the top 3 after losing to Low in their most recent full-scale challenge?

by **FarangDemon** on Thu Mar 31, 2011 12:22 am

Admittedly, I'm not the data guy and I don't want to be the data guy. If someone wants to maintain a data file(s) I can link to it from my site.

All you have to do to change/add data on your own:

1) Click the link to the data file from my site (http://www.killersapp.com/algorithmof400.htm)
2) Open it in Excel or notepad
3) Change existing war data / add new entries
4) Paste it into the input box of my site

by **FarangDemon** on Thu Mar 31, 2011 2:57 pm

jpcloet wrote:I have a few question on the final rankings to help clarify.

1) TSM is due to lack of challenges correct?

You can see the basis of the rating for every clan by checking "Display Basis".

Tieing Grim Reapers and losing to BSS and Gen 1 gives TSM an 800 weighted at 17%, so that's what is bringing them down the most. I'm guessing that was CL3, so it's recent. The weight that is applied to each challenge is the number of games in the challenge times the decay factor (a number between 0%-100% based on how long ago the challenge took place). They don't have as much total weight as other clans do, so these CL3 losses/ties affect their score more than a clan with more wars and hence more weight.

I did not include their recent win in CL3 Quarterfinals as the final score was undecided when I put the data file together. This should give them a boost.

jpcloet wrote:2) MM seems to be the biggest outlier in my mind, considering the overall quality of opponents. Is S2 playing a part? This is the only real anomaly for me.

Looks like 39% of their rating weight (losses to BSS, AOD, EMPIRE, Devils Brigade) averages below 800. Three out of four of these losses were brutal, winning 28-33% of games. In this model (min delta = 200, max delta = 600), clans that lose by a larger margin are penalized more.

jpcloet wrote:3) Is the O&H loss by LOW keeping LOW from the top 3?

That O&H loss is weighted at just 5%, but at 599, it is still quite damaging.
Losses to THOTA, KoRT and TOFU weighted at combined 27% are also lowering them a little from where they otherwise would be based on other, lower weighted results (perhaps from past).

jpcloet wrote:4) What does the top 10 look like with a KORT win over THOTA in CCup1?

What margin should I use? It would probably end up #1 KoRT, #2 TOFU.

leehar wrote:I'd think the O&H loss would be too far back to have that much waiting, and it's still a surprise that IA is in the top 3 after losing to Low in their most recent full-scale challenge?

The IA loss to LOW accounts for 12% of IA's rating. It drags us down the most, but we are buoyed by our strong performance in CL3, which is very recent and accounts for a ton of games (90 games in regular season).

by **jpcloet** on Thu Mar 31, 2011 3:34 pm

FarangDemon wrote:Tieing Grim Reapers and losing to BSS and Gen 1 gives TSM an 800 weighted at 17%, so that's what is bringing them down the most. I'm guessing that was CL3, so it's recent.

Yes that was CL3, that mean's this is now a fully inclusive measure. Some will argue the league should not be included. One of the things I did before was to have the final league score against a generic team so that it got some weight as the divisions. More so for CL2.

Eg.

TSM 55 CL3A 45

Algorithm of 400 Clan Ranking Algorithm

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for CL3

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Re: Algorithm of 400 Clan Ranking for All Wars (NEW Graphs!)

Who is online