How to Distribute Players Between Teams – Evenly

In April 2017, Shenricus, posed a question in the Chandoo.org Forums:

“I have 24 people who each have their own score. I’ve been trying to figure out how I can divide these names into 3 even teams – or as close as possible.”

I answered with a Solver Based solution, and Bosco Yip also added to my solution with a slightly different approach.
This caused me to reconsider my first attempt and finally I posted a Final Solution, which was also a Solver based solution, but was a much more robust solution than my original solution or Bosco Yip’s solution.

This post will examine the thought process used to derive the solution and then implement that using solver.

As always a Sample file is provided so you can follow along: Download Sample File here.

 

Approach

Shenricus gave us a list of 24 players and a score for each player.

The players are Ranked from Best to Worst.

We have no other information as to the Sport or Score.

The question posed by Shenricus is to distribute the players into teams so that each team is “As even as possible”.

Considering that we have 24 players and need to put them into 3 teams, we will assume each team has the same number of players and hence requires 8 players.

My initial though was to setup a Delta or Difference between each Players Score and the Mean (Average of all scores).

First calculate the Average of All the Scores

Then calculate the Differences between the each players Score and the Average

Next we need to distribute each player into one of 3 teams.

Solver will put a value of 1 when a Player is in a Team, and a 0 when the player is not in a Team.

 

Next add a Formula to Calculate the Sum of the Variations from Mean for each Team

and Finally Sum these up

We should be able to get Solver to Minimise this value.

So lets look at how Solver is setup.

 

How Do Use Solver?

Solver is found in the Data, Analyze Tab.

Your screen may look different to mine depending on which version of Excel you are using and if you have your Excel window at a maximum size or not.

If you cannot see it, you may not have Solver Loaded.

 

How Do We Install Solver?

Right Click on any part of the Ribbon

Select Customize the Ribbon

Select Add-ins on the Left menu and

Manage Excel Add-ins in the Manage Dialog and press Go

 

Finally Select Solver and Ok

Solver will now be visible in the Data, Analyze Tab

 

How Do We Setup Solver?

Click anywhere in the model
Goto the Data, Analyze Tab
Select Solver

The Solver Dialog is show as:

Lets look at each of the highlighted sections first and I will discuss this first as a plain English and then I will discuss how it is implemented in Solver

Solver is asking us to Set our Objective, to a Minimum, Maximum or Value, by changing some cells, Subject to some constraints.

Set Objective

Solver is asking what our objective is?
In our Even Teams example we want to minimise the variance in the average Team Scores

By Changing variable Cells

We want to achieve our objective by setting Each Player to be a Member of 1 team
That is Each player must have a 1 in a Column of Team 1, Team 2 or Team 3

Subject to the Constraints

We have a Number of Constraints that our model will be subject to

Each player must have a 1 in a Column of Team 1, Team 2 or Team 3
Each Team must have 8 players
All 24 Players must be used only once each
Each player can only be in a Team, he can’t be shared between teams

Solver operates using a number of techniques to Solve the above problem.
Simplistically it iterates values into the Variable Cells, subject to meeting the constraints.
It measures the output and re-iterates until a better solution is reached.

In Solver Speak

Lets look at how our model is setup in Solver

Objective

The Objective is to Minimise the Sum of the Team Scores
That is to Minimize Cell E27

Variable Cells

We will be changing the allocation of players into each team.

This is the Variable Cells $E$2:$G$25

Subject to the Constraints

The variable cells will be changed by Solver subject to meeting our 4 criteria defined above
a. That each team has 8 players, each cell in $E$29:$G$29 is 8
b. That each player only plays in 1 team, that is cells $E$2:$G$25 can only be 0 or 1 (binary)
c. That all 24 players are used, ie: $H$26 = 24
d. That all 24 players are used only once, each cell in $H$2:$H$25 = 1

We haven’t yet setup Conditions C or D above in our model yet

So add a Column H

H2: =Sum(E2:G2) and copy that down to Row 25

This will add the Total of each Team per Player and should be 1

 

And add up the total of these in H26, This is the Total of all allocated Players and should be 24

H26: =Sum(H2:H25)

In solver setup each of these sections then click Solve

 

After a Minute or so, Solver will return to tell you that it has found a Solution

Select Keep Solver Solution

Lets check things

Firstly we can see that

1. The sum of the Team Scores, E27, is a very small number, as we requested
2. Each player was only used once Column H, True
3. All 24 Players were used H26, True
4. Each Team has 8 players, E29:G29, True
5. Each player is not split between teams, E2:G25, True

So all our Criteria are met, however if we start to look at the solution in more detail we can see that Team 3 has been assigned the Best 8 players, where as Team 1 has mostly the worst players, Team 2 is in the middle.

Solver has solved our problem, but our problem obviously hasn’t been correctly specified.
Solver has setup 2 teams with Low Negative Scores to Offset Team3 with a High Positive score, with the overall result being a low average Team Score

If we look at the Total Scores for each Team, E31:G31


We can see that the Total Team Scores vary between 7.705 and 7.891
A spread of 0.186

What we actually need to specify is that the Variation in these Total Team Scores is Minimised. That is the spread between the 3 scores is minimised.

There are Statistical Measurements called Variance and Standard Deviation
Without going into too much detail, each is a measure of how far a set of numbers are spread out from their average value.
Refer Wikipedia Wikipedia VarianceWikipedia Standard Deviation

Luckily we can easily calculate these using Excel

In cell E33 =STDEV.P(E31:G31)
Excel displays 0.078969

So the Standard Deviation of these 3 Team Scores is 0.0789

However we need to re-run the Solver Model with a new Objective

Firstly, reset all the players to 0, ie Players are not assigned to any Team
Select E2:G25 and type 0 Ctrl Enter

Click anywhere in the model,
Goto the Data, Analyze Tab
Select Solver

Set the Objective to $E$33

The Variable Cells and Constraints remain unchanged

Now Click Solve

After a minute or so, Solver will announce it has a New Solution
Accept that as before

Lets check things

Firstly we can see that

1. The sum of the Team Scores is a very small number, as we requested, Ok
2. Each player was only used once Column H, Ok
3. All 24 Players were used H26, Ok
4. Each Team has 8 players, E29:G29, Ok
5. Each player is not split between teams, E2:G25, Ok

If we look at the solution in more detail we can see that
The three Teams now have a spread of both good and not so good players

But the important thing to notice is that the Standard Deviation of the 3 Team Scores is now 0.001699, or 2.1% of the previous Standard Deviation.

This shows the teams are much more “Evenly” matched

Solver has solved our problem.

Bosco’s Solution

During the thread Bosco proposed an alternative, algebraic solution.

It involved distributing players according to simple rules

The team who got the Best player also took the worst player,

The next team who got the Second best player also took the second worst player

The next team who got the Third best player also took the third worst player, etc

This is shown:

We can see that it also meets all of the constraints of the model, but has a Standard Deviation 0.00368, that isn’t as low as the Solver solution 0.001699.

 

What are these Other Solving Methods?

When you were setting up Solver you may have noticed a dialog asking, Select a Solving Method:

The best discussion I have found on these alternative Solver Techniques is shown on the link below

http://www.engineerexcel.com/excel-solver-solving-method-choose/

 

Closing

We can see how Solver has been used to distribute players according to player ratings and even out teams.

Unfortunately, Shenicus never came back to the forums and so we don’t know how his teams went ?

How have you distributed players or anything else ensuring things are even ?

Let us know in the comments below:

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Facebook
Twitter
LinkedIn

Share this tip with your colleagues

Excel and Power BI tips - Chandoo.org Newsletter

Get FREE Excel + Power BI Tips

Simple, fun and useful emails, once per week.

Learn & be awesome.

Welcome to Chandoo.org

Thank you so much for visiting. My aim is to make you awesome in Excel & Power BI. I do this by sharing videos, tips, examples and downloads on this website. There are more than 1,000 pages with all things Excel, Power BI, Dashboards & VBA here. Go ahead and spend few minutes to be AWESOME.

Read my storyFREE Excel tips book

Overall I learned a lot and I thought you did a great job of explaining how to do things. This will definitely elevate my reporting in the future.
Rebekah S
Reporting Analyst
Excel formula list - 100+ examples and howto guide for you

From simple to complex, there is a formula for every occasion. Check out the list now.

Calendars, invoices, trackers and much more. All free, fun and fantastic.

Advanced Pivot Table tricks

Power Query, Data model, DAX, Filters, Slicers, Conditional formats and beautiful charts. It's all here.

Still on fence about Power BI? In this getting started guide, learn what is Power BI, how to get it and how to create your first report from scratch.

67 Responses

  1. Sure it’s a nice new command. It would be useful if everyone had access to it. But if there is any chance you will be sharing the file with someone who has a onetime payment Office license, or an older version of Office you can’t use it.

    1. That is my biggest gripe with many new features MS is launching. With such vast userbase and existing spreadsheet “systems”, all of these formulas are going to create more trouble than imagined. That said, we should learn new things, especially if you move to a new job chances are you will be using a different version of Excel there.

      1. I love to learn new things, like this new command. But I can’t afford, literally don’t have the money, to keep paying for 365.

        This is the thing that especially offends me about the Office 365 pricing scam/scheme. Sure, if they want to milk more money from users using the rental scam, fine I know I don’t have to fall for it. But restricting new “features”, like new commands to 365 is offensive. It makes one-time payment users “second class” customers, especially anyone who has paid for Office 2019. At least in the past new features/commands came only came out every few years, with new versions so there was some logic to the separation. But now the new features are coming every few months and there is no real separation between 2019 and 365, but still they limit the new features to 365. Even 2016 is close enough. MS “accidentally” pushes a few new features to 2016, when they feel like it or when they are too lazy to do the extra work to prevent them from going to 2016.

        1. I agree with Ron I have MS Office 2019 which I used for Charity work but a pensioner I find the cost of the MS365 unaffordable. Perhaps there is some way for a Ms Guru to perhaps create 3rd party update for the stand alone versions.
          I will however continues with Ms 365 this year as I have just renewed the subscription
          thanks very much for keeping us abreast of latest developments and also the excel community for their useful feed back
          regards Brian 18/03/2024

          1. Good point. I suggest using the free MS Office online (you just need onedrive account) to maintain old files and work on them. The only limitation is that it is browser based, so you won’t be able to do many advanced things. But it is better than the alternative of shelling out $100+ every year.

  2. Yes, of course this is the latest and excellent update from Microsoft but this feature will take years to come in the market because most of the people or offices are still using Office 2007 or 2013.

    1. Hmm, the base scenario is similar to index+match, but XLOOKUP makes life simple with single formula and default “exact match” setup. Plus I find the “lookup from last” and “less than” “greater than” options very useful and less cryptic than MATCH options.

  3. Thanks for sharing, it added some excitement to my Friday morning! I don’t have 365 but am still excited to be aware of the existence of these features! I know that vlookup on larger sets of data can really take up some resources–it makes sense, it’s performing a lot of operations for us while we sit and sip on coffee. 😉 However, I’m wondering if you’ve you noticed a difference in performance with xlookup? Is it slower, faster, or pretty much the same in terms of calculation speed?

    1. I haven’t tested it against VLOOKUP or INDEX+MATCH. If anything, I would guess that the performance should be similar as they could all use same logic internally. I will try this and share some outcomes later.

      1. I would love to know the results. We’re crunching a ton of data and I love the simplicity of XLOOKUP, but we can’t handle the sluggishness of VLOOKUP. I hope XL is faster!!!

        1. I believe XLOOKUP has been written to deliver exact matches at the same speed as a binary (vlookup’s approximate) search.

  4. A nice addition to the function list. Very usefull and easier to use then INDEX + MATCH.
    Since XLOOKUP is in beta testing, it would be great if Microsoft development team added a 5th. argument: if_na. That is: if XLOOKUP returns #N/A, an alternate value could be returned instead. Therefore, it wouldn’t be necessary to do =IFNA(XLOOKUP(…), value_if_na).

    1. Good idea. But I feel this can be a dangerous precedent as no other formula in Excel has fail-safe option (other than IFERROR and IFNA ofcourse). So may be leave it to return error.

  5. Although I don’t have and expecting to be around soon in EXCEL 2019, my question is there a way to work around the new function “xlookup” but not the old ones.
    However it is appreciated tip,thanks

      1. Here is an idea.
        If you wish to analyse data for a given month, the relevant portion of the Sales table (sorted by date) is given by
        = XLOOKUP( EOMONTH(month,0), EOMONTH(+sales[Date],0), sales,0,1 ) :
        XLOOKUP( EOMONTH(month,0), EOMONTH(+sales[Date],0), sales,0,-1 )
        which can be referred to as a named formula ‘selected’. Being a reference to the original table, range intersection with columns works. Hence
        = XLOOKUP( MAX(selected sales[Net Sales]),
        selected sales[Net Sales], selected sales[Sales Person] )
        provides an answer to
        Who had most sales for February?

        Caution: The formula requires 7 separate searches of the data but they are very fast.

    1. @Hamish… you should be able to use any reference styles that work with other formulas in XLOOKUP. So yes for names, structural, cell and references to other sheets / workbooks.

    2. Hamish, Yes it all works perfectly. That includes cases in which the data table does not comprise raw data but rather is made up of dynamic arrays. Naming the anchor cell of each dynamic array allows expressions such as
      = XLOOKUP( MAX(selectedNetSales#), selectedNetSales#, selectedSalesPerson# )

      Conversely, if the returned field is comprised of anchor cells for separate dynamic lists (e.g. employment data for the specified salesman) then the list can be returned by adding ‘#’
      =XLOOKUP(0,sales[Net Sales],EmployeeInfo,1)#

  6. Since the documentation says it returns a reference array, could you write formulas that could answer questions that need to perform a function upon a result set that contains multiple rows such as:

    1. What is the total Profit/Loss for SalesPersons named [Jamie]?
    2. What is the MAX/MIN Net Sales for SalesPersons named [Jamie]?
    3. What was the Average Net Sales for everyone that had exactly [8] Customers?

    1. I think the answer to your question is ‘no’ unless you are willing to sort the table so that the records you wish to aggregate form a continuous range. That is, the formula
      = SUM(
      XLOOKUP(salesPerson,sales[Sales Person],sales[Profit / Loss],,,1):
      XLOOKUP(salesPerson,sales[Sales Person],sales[Profit / Loss],,,-1))
      only works if the data is sorted by Sales Person.

      Otherwise it looks like SUMIFS (and similar) offers the best solutions with FILTER a close second.
      = SUMIFS( sales[Profit / Loss], sales[Sales Person], salesPerson )
      = SUM( FILTER(sales[Profit / Loss], sales[Sales Person]=salesPerson ) )

  7. XLOOKUP allows us to look for a variable in a column and return a value from a row: combining VLOOKUP ad HLOOKUP in essence.

    I watched a video last night in which the presenter showed an example that returned an error. The solution that the presented was using is this: =XLOOKUP(A4,B7:B9,C6:E6)

    To see the problem in action, put a b c in the range B7:B9 and 1 2 3 in the range C6:E6 and in A4 enter a or b or c

    I solved this problem in this way:
    =XLOOKUP(A12,B15:B17,TRANSPOSE(C14:E14))

    I have also set up a financial analysis example in which I wanted to find, for every line item in an income statement, which month was exactly equal to the mean of that row or which was immediately below the mean or immediately above it. Or Median, or Standard Deviation …

    I used XLOOKUP() and IFS() together with Data Validation (although that is optional) and while the formula is a little unwieldy, again I am effectively combining vertical and horizontal lookups.

  8. Hi,

    Can you please tell me if there is any way to return multiple values with a single match.
    Thanks in Advance

    1. Never.

      “New features” like the XLookUp() command are only added to Office 365. They will never be added to Office 2019. They may show up in Office V-Next, when ever it comes out, in the near future. MS has not yet announced a new version. If they follow the pattern in the last few versions that would be fall 2021. But that is only a guess.

  9. I downloaded your sample spreadsheet and three of your first seven examples are incorrect. Then I stopped.

      1. Hi, Chandoo.
        Great tips, thanks!

        In example #11, “What is the ‘net sales’ for Johnson? = 1540” the formula only takes into account the first match for Johnson (D10)?
        In row 21 Johnson appears again so the correct answer should be 4192 (D10 + D21).
        Imagine a DB with hundreds of records!
        How can we deal with duplicates using XLOOKUP?
        Thanks.

  10. Is there an easy way to handle if the cell is blank in the data table to prove the result of a blank? With VLOOKUP, previously to get this result, I had to do:

    =IF(VLOOKUP($B2,data,6,FALSE)=””,””,VLOOKUP($B2,data,6,FALSE))

    I am hoping that I don’t have to resort to the same lengthy format. I did try the “Value Not Found” example you provided (love it). However that is when the search value is not listed, not when the search value is found and the result value is a blank cell.

    Thanks for everything you do!!!!

    1. Hi Sherry,

      Are you using the IF formula to show “” instead of 0 ?

      If so, you can use this structure

      =XLOOKUP($B$2, data[col1], data[col6]) & “”

      This will force 0 to convert to empty space. It won’t impact other results though, (assuming column 6 is text)

  11. A bit longer, but to force the ‘value not found’ you could remove the entry from the lookup array
    = XLOOKUP(lookupValue,
    IF(data[col6]””, data[col1]),
    data[col6], “Missing data”)

  12. Hi Chandoo,

    I’ve been waiting for this function for months so that I could replace all my INDEX / MATCH / MATCH statements. However, I have hit a snag with using nested XLOOKUPs as replacements. If the inner XLOOKUP can’t find a value, then whatever value I specify as the [if not found] value causes the outer XLOOKUP to fail and return #VALUE. So the [if not found] functionality works if a single XLOOKUP can’t find the search value, but it causes nested XLOOKUPs to fail. Can you see any way around that?

    Thanks

    1. Hey Stuart… Can you share an example of what result you are expecting in nested case? One option is to use a single IFERROR outside all the nested functions.

  13. @Stuart

    Do not limit yourself to thinking of [if_not_found] as being a text string, e.g. “Oops”; it can be a formula in its own right, returning a default row from the original table or even a lookup from an alternative table.

    What it must return is an array in order to form a valid parameter for the outer XLOOKUP.

    1. Hi Peter,

      You’ve got it! As you suggest, by setting the inner XLOOKUP to return an array full of zeroes (or whatever) solves the problem. The outer XLOOKUP can of course just have 0, or whatever, stated its if_not_found value.

      I am surprised that I haven’t come across this issue or solution anywhere else. There are lots of blogs / videos which mention using nested XLOOKUPs as a replacement for INDEX / MATCH / MATCH. I can’t say I’ve read or watched them all, but the ones I have don’t mention this issue. I suspect there are / will be a lot of people getting #N/As or, worse, #VALUES depending on what they specify as the inner function’s if_not_found.

      Thanks for your help!

  14. I am trying to lookup a date and name and return the number of hours from another worksheet? If I’m mixing text and dates, will this still work?

  15. Great article. But,…two questions:
    1) I do have Office 365. Yet, the XLookup is not recognized by Excel. Your sample file displays a #NAME? Why?
    2) In your samplefile you have a leading ‘_xlfn.’ in front of the formula. Why is that?

    1. Hi Michael…

      Can you confirm what is your current version of Excel is? Also see if you can update to newer version. You can do both from File > Account.

  16. My values that I want to join are not exact, i.e.

    000025868 and 0000258 68 Total

    Is there a way to join the data?

    1. Interesting. Assuming the space is in the lookup column, try this:

      =xlookup(“000025868″, substitute(lookup_col, ” “,””), result_col)

  17. Getting a #N/A as the results.

    Is there a way to convert “0000258 68 Total” to 000025868 (or visa versa) before I run the =XLOOKUP?

    1. If you just want to remove the word “total” at the end, use SUBSTITUTE for that. If there can be other words, you are better off first running the data thru Power Query so you can clean it.

    2. One thing that is possible is to take a numeric lookup value and convert it to text before searching a text lookup array. For example
      = XLOOKUP(TEXT( value, “0000000\?00\*” ), array, return, , 2 )
      will perform a search with wildcards that allow “Total” to be appended or any character to be inserted two digits before the end of the number.

      That would pick up
      “0000258 68 Total”
      but you would need an alternative test to match the number 25868, itself.

  18. Check the reference, while selecting data the xlookup function automatically starts from new line. Try changing it to the first row and it would work.

  19. Hi Chandoo,
    I have 2 sheets with 5 columns. data in columns A:C is similar except that changes are made in columns A and C. I want to lookup in column C in Sheet2 and update Sheet1 columns A:C.
    for example
    Sheet1
    ColA ColB ColC
    123 AB12 One
    234 BC23
    323 CB22 Six

    Sheet2
    ColA ColB ColC
    123 AB12 One
    234 BB22 Two
    323 CB22 Six

  20. I don’t think we can claim that XLOOKUP “replaces” INDEX+MATCH. Yes, it provides a suitably powerful alternative, and is absolutely a full replacement for VLOOKUP and HLOOKUP, but it can’t easily play some of the “math” games that are possible with INDEX+MATCH and sometimes even necessary when the data isn’t in a convenient layout.

    What if you needed the row above or below the match or if the data was laid out in repeating sections where you first needed to know the location of the section header and then the location of a given item within each section? Both of those problems can be solved with plus/minus shifting of the number returned from the MATCH.

    So I would argue that INDEX+XMATCH are the true replacement for INDEX+MATCH, thus taking full advantage of the X — defaulting to exact matches, virtual sorting, and so on — while preserving the ability to “shift” the match as needed.

  21. I’m looking for a price in a multiple column price list. With Vlookup, I specified the entire table and for the column, looked at the user selected model/column. In Xlookup, how to specify the column number and the range up and down or can I just specify the column number only?

    1. One advantage that VLOOKUP retains over XLOOKUP is the ability to supply a lookup column number dynamically, as a purely numerical result of a calculation. To replicate this functionality using XLOOKUP, you would need seperate logic to calculate the column reference (i.e. the column’s number, range name or range address) and pass it to the XLOOKUP formula. You could do this inside the XLOOKUP function by setting up the 3rd param of XLOOKUP to be based on your “user selected model/column”.

  22. Using Xlookup with “match mode” = -1 and “if not found” = “ABC”

    Now if the lookup value is not found in the lookup_array excel gives the the highest value from the return_array.
    This is not what I expect from xlookup.
    It should return “ABC”

    Can you explain why?

  23. Chandoo,
    I am having trouble with XLookUp. How do I get it to return multiple values such as employees with salary greater than $45,000 or to sum all the sales in the East region? Are these more pivot table inquires?

    Is XLOOKUP more useful for finding one record than multiple records?

    Thank you,
    Jennifer Jeffords

  24. Hi Chandoo,

    Is it possible to use XLOOKUP to return a status such as “Checked” and “NoCheck”(something similar to IF stmt)

    Thank you.

  25. I used the index and match to look up the hourly rate for a job classification as a part of a drop down. Now, I want to calculate the hourly rate multiplied by hours worked and the cell will not calculate. What might be the problem? The results cell of the look-up is formatted to be currency?

  26. You show return array can be more than 1 column but what about Look up array? What if I want to find a value than can be in 1 of 3 columns and then return one value from another column.

    1. You can use XLOOKUP for such things too.

      For example, if you have three columns: home phone, cell phone and email address
      and a column with customer name
      and you want to lookup the name of the customer when you specify any value from one of those 3 columns,
      you can use the below XLOOKUP.

      =XLOOKUP(TRUE,BYROW(C3:E22=I2,LAMBDA(a, OR(a))), B3:B22, “No record found!”)

      Here I2 contains the search criteria (either home phone, cell phone or email)
      B3:B22 have names
      C3:E22 have the home / cell / email values

  27. Hi my name is Musawir Rasool i am from India in a state of jammu and Kashmir I love watching your videos and lot from your videos
    Thanks
    And one more can u teach me full power bi?

  28. Hi Chandoo,
    I was referring to your xlookup-examples file, and in that I saw your formula for Sl. 8 – Who has least sales? You wrote formula =XLOOKUP(0,sales[Net Sales],sales[Sales Person],,1) but I think a more better way would be to write =XLOOKUP(MIN(sales[Net Sales]),sales[Net Sales],sales[Sales Person],,1). This is because your formula would not reliable unless you’re specifically looking for a salesperson who has exactly 0 in sales, which is not the same as the least sales — unless 0 happens to be the lowest. Also, the 1 as the last argument means “approximate match in ascending order,” which could return wrong results if 0 isn’t found.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.