How to Distribute Players Between Teams – Evenly

Share

Facebook
Twitter
LinkedIn

In April 2017, Shenricus, posed a question in the Chandoo.org Forums:

“I have 24 people who each have their own score. I’ve been trying to figure out how I can divide these names into 3 even teams – or as close as possible.”

I answered with a Solver Based solution, and Bosco Yip also added to my solution with a slightly different approach.
This caused me to reconsider my first attempt and finally I posted a Final Solution, which was also a Solver based solution, but was a much more robust solution than my original solution or Bosco Yip’s solution.

This post will examine the thought process used to derive the solution and then implement that using solver.

As always a Sample file is provided so you can follow along: Download Sample File here.

 

Approach

Shenricus gave us a list of 24 players and a score for each player.

The players are Ranked from Best to Worst.

We have no other information as to the Sport or Score.

The question posed by Shenricus is to distribute the players into teams so that each team is “As even as possible”.

Considering that we have 24 players and need to put them into 3 teams, we will assume each team has the same number of players and hence requires 8 players.

My initial though was to setup a Delta or Difference between each Players Score and the Mean (Average of all scores).

First calculate the Average of All the Scores

Then calculate the Differences between the each players Score and the Average

Next we need to distribute each player into one of 3 teams.

Solver will put a value of 1 when a Player is in a Team, and a 0 when the player is not in a Team.

 

Next add a Formula to Calculate the Sum of the Variations from Mean for each Team

and Finally Sum these up

We should be able to get Solver to Minimise this value.

So lets look at how Solver is setup.

 

How Do Use Solver?

Solver is found in the Data, Analyze Tab.

Your screen may look different to mine depending on which version of Excel you are using and if you have your Excel window at a maximum size or not.

If you cannot see it, you may not have Solver Loaded.

 

How Do We Install Solver?

Right Click on any part of the Ribbon

Select Customize the Ribbon

Select Add-ins on the Left menu and

Manage Excel Add-ins in the Manage Dialog and press Go

 

Finally Select Solver and Ok

Solver will now be visible in the Data, Analyze Tab

 

How Do We Setup Solver?

Click anywhere in the model
Goto the Data, Analyze Tab
Select Solver

The Solver Dialog is show as:

Lets look at each of the highlighted sections first and I will discuss this first as a plain English and then I will discuss how it is implemented in Solver

Solver is asking us to Set our Objective, to a Minimum, Maximum or Value, by changing some cells, Subject to some constraints.

Set Objective

Solver is asking what our objective is?
In our Even Teams example we want to minimise the variance in the average Team Scores

By Changing variable Cells

We want to achieve our objective by setting Each Player to be a Member of 1 team
That is Each player must have a 1 in a Column of Team 1, Team 2 or Team 3

Subject to the Constraints

We have a Number of Constraints that our model will be subject to

Each player must have a 1 in a Column of Team 1, Team 2 or Team 3
Each Team must have 8 players
All 24 Players must be used only once each
Each player can only be in a Team, he can’t be shared between teams

Solver operates using a number of techniques to Solve the above problem.
Simplistically it iterates values into the Variable Cells, subject to meeting the constraints.
It measures the output and re-iterates until a better solution is reached.

In Solver Speak

Lets look at how our model is setup in Solver

Objective

The Objective is to Minimise the Sum of the Team Scores
That is to Minimize Cell E27

Variable Cells

We will be changing the allocation of players into each team.

This is the Variable Cells $E$2:$G$25

Subject to the Constraints

The variable cells will be changed by Solver subject to meeting our 4 criteria defined above
a. That each team has 8 players, each cell in $E$29:$G$29 is 8
b. That each player only plays in 1 team, that is cells $E$2:$G$25 can only be 0 or 1 (binary)
c. That all 24 players are used, ie: $H$26 = 24
d. That all 24 players are used only once, each cell in $H$2:$H$25 = 1

We haven’t yet setup Conditions C or D above in our model yet

So add a Column H

H2: =Sum(E2:G2) and copy that down to Row 25

This will add the Total of each Team per Player and should be 1

 

And add up the total of these in H26, This is the Total of all allocated Players and should be 24

H26: =Sum(H2:H25)

In solver setup each of these sections then click Solve

 

After a Minute or so, Solver will return to tell you that it has found a Solution

Select Keep Solver Solution

Lets check things

Firstly we can see that

1. The sum of the Team Scores, E27, is a very small number, as we requested
2. Each player was only used once Column H, True
3. All 24 Players were used H26, True
4. Each Team has 8 players, E29:G29, True
5. Each player is not split between teams, E2:G25, True

So all our Criteria are met, however if we start to look at the solution in more detail we can see that Team 3 has been assigned the Best 8 players, where as Team 1 has mostly the worst players, Team 2 is in the middle.

Solver has solved our problem, but our problem obviously hasn’t been correctly specified.
Solver has setup 2 teams with Low Negative Scores to Offset Team3 with a High Positive score, with the overall result being a low average Team Score

If we look at the Total Scores for each Team, E31:G31


We can see that the Total Team Scores vary between 7.705 and 7.891
A spread of 0.186

What we actually need to specify is that the Variation in these Total Team Scores is Minimised. That is the spread between the 3 scores is minimised.

There are Statistical Measurements called Variance and Standard Deviation
Without going into too much detail, each is a measure of how far a set of numbers are spread out from their average value.
Refer Wikipedia Wikipedia VarianceWikipedia Standard Deviation

Luckily we can easily calculate these using Excel

In cell E33 =STDEV.P(E31:G31)
Excel displays 0.078969

So the Standard Deviation of these 3 Team Scores is 0.0789

However we need to re-run the Solver Model with a new Objective

Firstly, reset all the players to 0, ie Players are not assigned to any Team
Select E2:G25 and type 0 Ctrl Enter

Click anywhere in the model,
Goto the Data, Analyze Tab
Select Solver

Set the Objective to $E$33

The Variable Cells and Constraints remain unchanged

Now Click Solve

After a minute or so, Solver will announce it has a New Solution
Accept that as before

Lets check things

Firstly we can see that

1. The sum of the Team Scores is a very small number, as we requested, Ok
2. Each player was only used once Column H, Ok
3. All 24 Players were used H26, Ok
4. Each Team has 8 players, E29:G29, Ok
5. Each player is not split between teams, E2:G25, Ok

If we look at the solution in more detail we can see that
The three Teams now have a spread of both good and not so good players

But the important thing to notice is that the Standard Deviation of the 3 Team Scores is now 0.001699, or 2.1% of the previous Standard Deviation.

This shows the teams are much more “Evenly” matched

Solver has solved our problem.

Bosco’s Solution

During the thread Bosco proposed an alternative, algebraic solution.

It involved distributing players according to simple rules

The team who got the Best player also took the worst player,

The next team who got the Second best player also took the second worst player

The next team who got the Third best player also took the third worst player, etc

This is shown:

We can see that it also meets all of the constraints of the model, but has a Standard Deviation 0.00368, that isn’t as low as the Solver solution 0.001699.

 

What are these Other Solving Methods?

When you were setting up Solver you may have noticed a dialog asking, Select a Solving Method:

The best discussion I have found on these alternative Solver Techniques is shown on the link below

http://www.engineerexcel.com/excel-solver-solving-method-choose/

 

Closing

We can see how Solver has been used to distribute players according to player ratings and even out teams.

Unfortunately, Shenicus never came back to the forums and so we don’t know how his teams went ?

How have you distributed players or anything else ensuring things are even ?

Let us know in the comments below:

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Facebook
Twitter
LinkedIn

Share this tip with your colleagues

Excel and Power BI tips - Chandoo.org Newsletter

Get FREE Excel + Power BI Tips

Simple, fun and useful emails, once per week.

Learn & be awesome.

Welcome to Chandoo.org

Thank you so much for visiting. My aim is to make you awesome in Excel & Power BI. I do this by sharing videos, tips, examples and downloads on this website. There are more than 1,000 pages with all things Excel, Power BI, Dashboards & VBA here. Go ahead and spend few minutes to be AWESOME.

Read my storyFREE Excel tips book

Overall I learned a lot and I thought you did a great job of explaining how to do things. This will definitely elevate my reporting in the future.
Rebekah S
Reporting Analyst
Excel formula list - 100+ examples and howto guide for you

From simple to complex, there is a formula for every occasion. Check out the list now.

Calendars, invoices, trackers and much more. All free, fun and fantastic.

Advanced Pivot Table tricks

Power Query, Data model, DAX, Filters, Slicers, Conditional formats and beautiful charts. It's all here.

Still on fence about Power BI? In this getting started guide, learn what is Power BI, how to get it and how to create your first report from scratch.

42 Responses to “Prevent Duplicate Data Entry using Cell Validations”

  1. Jair says:

    Hi Chandoo, I need you help in the following problem.
    I'm trying to get a direccion from a found result. With this dirreccion I will want the before cell value. For example, If result of a find is 38 localized in cell $C$2, I need to get previus value (cell $B$2 ), maybe Andrés.

    Do you know some way to do that?

    Thank you for you help.

  2. Lincoln says:

    Hi Chandoo

    Thanks for this. One thing though: In my pre-2007 version of Excel, the COUNTIF function doesn't recognise a semicolon (;), but requires a comma.

    Is the semicolon an Excel 2007 thing?

  3. Chandoo says:

    Jair... I am not sure I understand what you want. what do you mean by Dirreccion?

    @Lincoln: I am sorry, often I forget that I am using European version of excel where the delimiter is ; instead of ,. I have corrected the formula now.

  4. subbu says:

    Thanks for this nice tip, i used to do a find all after filling every new items which was cumbersome.

    Do you know a way to extend this validation search to other tabs/sheets ?

  5. Jair says:

    Thanks for you attention. I'm trying to get of value continue from a found value. Let me show a example:

    Name Years
    John 35
    Maria 28
    Teresa 32

    If I search the max years, the result is 35, but I need that result to be John. Do you know how I can do it?

  6. Chandoo says:

    @Subbu.. you can easily extend the validation to other sheets by pasting the data validations. See the latest article here: http://chandoo.org/wp/2009/10/28/copy-data-validations/

    @Jair.. you can use the large() or small() formulas to do this. for eg. =index(A1:A3,large(B1:B3,1)) will get you the name of the person with highest "years". More help here: http://chandoo.org/excel-formulas/large.html

  7. Jair says:

    Hi, I don't know if I'm using bad the formula or its performance is diferent for my Office version. Large() formula return the value in the cell, in my example 35. The index() formula use a range, row and column. I'm using the large() as number of row, and it is bad because into the range don't have row 35. This is my perception. What do you think?

  8. Chad says:

    Hello,
    I am trying to attempt data validation in Excel Mobile, but the DV tool isnt available. I want to prevent duplicates is all, any advice on acheiving this in Excel Mobile? Thanks..

  9. Chandoo says:

    @Jair... my french aint that good. it starts at "merci" and ends at "beau coup".

    Anyhow, you need to merge the large with vlookup to do this. I am not sure if you have solved the problem. Otherwise let me know with details and I can write the formula in comments.

    @Chad... I have never used excel mobile, so I have no idea. May be they have not implemented data validations in excel mobile.

    Any excel mobile users out there?

  10. Jair says:

    Hi Chandoo, the proposed solution by JlD is interesting. He created a macro to get values when the matrix is not one dimensional, how on my problem. This fuction for me.
    I would like to share you my work, how can I upload?

  11. Chandoo says:

    @Jair.. sorry for such a delayed reply.. you can upload the files to skydrive and link them here. Or you can email them to me at chandoo.d @ gmail.com and I will upload them somewhere. But it could take forever if you email files to me as I am a bit lazy.

  12. [...] Day 31: Advanced Data Validation Tricks in Excel – Part 2 [...]

  13. Muhammad Moin says:

    Hi,

    Can you help me in Microstrategy?

    Br,
    Moin

  14. Ramprasad says:

    really wonderful article. I feel it is implementing Primary Key concept into spreadsheets.

  15. sriram says:

    Hi article on data validation. Excel is a very versatile platform to work with and we use it for all kinds of data tabulation. In fact this must have been the most rudimentary data management tools I must have worked with and knowing such tips only adds functuionality to our user experience. Great article. looking forawrd to read more.

  16. Vasanth says:

    Hi Chandoo,

    Thanks for such a nice idea.

    I tried copy paste the data into the validated area, but the pop-up msg (warning msg) doesn't came. Is it something that we need to update the data manually each time,.

    Do we have any option where we can bulk upload the number and it throws a warning message that the data already exits and do we want to continue with this ?

    Please do reply me.

    Thank you.

    Regards,
    Vasanth.

  17. kochu says:

    It was really useful chandoo...thanks a lot...

  18. Leo says:

    Tried this in excel 2010 and it did not work?
    Could the newer excel have changed that much?

    • Hui... says:

      @Leo

      It works fine in Excel 2010

      The formula used above =COUNTIF($B$4:$B$11,B4)<=1

      only applies to the range B4:B11

      Did you adjust the range to your data?

  19. Tariq Khan says:

    This page helped me accurately to find solution of my question. thanx

  20. Murli says:

    we want to prevent duplicate entries in three columns combined, using data validation, i.e. say, column A has first name and Column B has middle name, Column C has last name. the first name can be duplicate, middle name can be duplicate, last name can be duplicate, but not all three at the same time.

  21. Murli says:

    I want to prevent duplicate entries in three columns combine, using data validation, i.e. say, column A has first name and Column B has middle name, Column C has last name. the first name can be duplicate, middle name can be duplicate, last name can be duplicate, but not all three at the same time.

  22. KokTiong says:

    Hi, I've tried above validation method to prevent duplicate value from entering into the cells. It's work, when user key in the data into the selected range. However, it's not working when user copy-&-paste the info into the same range.

    Please advice. Thanks. 

  23. ZAMEER SHAIKH says:

    Hi Chandoo,
     
    Does it work in Excel 2007?
     
    Please Reply

  24. mahavir says:

    thanks chandoo........

  25. SUSHOBH says:

    it does not work when data is copy pasted...any solution for this??

  26. shaloo says:

    hi i m shaloo and i want to know in excel if i write duplicate no.then it says or show about we are write duplicate no.

  27. Kris says:

    Hi Chandoo

    I've tried using this with a Named Range, which is actually a column in a Table as DV wont accept a table reference, and it wont work.
    Also tried using Offset to specify the Named Range, but that wont work either.

    Is it possible to use Named Ranges with DV?

    Thanks
    Kris

  28. Paula says:

    I have tried the above formula on a table column. The Error box does not pop up, there is only the small ! next to the cell with the duplicate. The column I am working with is formulas that produce a date. Is the reason it doesn't work that the cells contain formulas rather than data?

  29. Ken says:

    The formula works but only if I enter data in cell above it. So for example, if I have "123" in B11 it does not allow me to enter "123" in B10, B9, B8, etc. But I can still enter "123" in B12. Please help! 🙂

  30. Karan says:

    Great tip.. thanks a lot

  31. I have 21 years of experience working as data entry assistant. I constantly read several blogs to keep myself up-to-date with the advances in data entry profession. I really enjoyed this blog post. From my several years of experience, I agree with you 100% when you say, “ We all know that data validation is a very useful feature in Excel. You can use data validation to create a drop-down list in a cell and limit the values user can enter. ”

    Keep blogging. I will come here again.

    --data entry assistant

  32. HaroonRashid says:

    Hi,
    This is really very helpful.
    Thank you

  33. Junaid says:

    how can i assign two validation on a single cell
    one is for list validation (means the data should be from that range)
    second i want to prevent them from repetition

    how can i do this ?
    P7 to P506 have GR# which are for list
    i want to prevent C column to not to repeat and should be from the P column

  34. Gaurav says:

    friend can any one tell me the formula
    exname location qty
    gaurav 1 1
    rofan 2 5
    sandeep 3 6
    gaurav 4 3
    rofan 5 4
    sandeep 6 8
    gaurav 7 9

    If this is a data.
    if i want a formula by which if i type gaurav then all the location and qty should be shown in a new page.
    i had 5,00,000 sku so if i punch one name i can get the entire details

  35. Gaurav says:

    IF(ISERROR(INDEX($B$3:$C$9,SMALL(IF($B$3:$B$9=$B$12,ROW($B$3:$B$9)-ROW($C$2)),ROW(A1:C1)),2)),"",INDEX($B$3:$C$9,SMALL(IF($B$3:$B$9=$B$12,ROW($B$3:$B$9)-ROW($C$2)),ROW(A1:C1)),2))
    please explain

  36. MD. RASEL SARDER says:

    YOUR COUNTIF FORMULA IS REALLY HELPFUL AND WORKS. I TRIED SEVERAL SITES BUT THEIR FORMULA DOES NOT WORK. ONLY YOU HAVE GIVEN A RIGHT FORMULA!
    THANK YOU!!!!!

Leave a Reply