Are You Trendy? (Part 2)

Share

Facebook
Twitter
LinkedIn

Forecasting using Excel Functions

“Todays forecast will be Hot and Humid with a Chance of Snow?”

(Even the experts with big computers get it wrong)

In the previous post we looked at Manual Forecasting techniques and how Excel can be used to assist. In this post we will look at how we can use Excel built in functions to aid us in forecasting.

This post is going to delve slowly at first and then deeper into some of Excels Statistical Functions. Readers are encouraged to follow along at your own pace and use the examples in the Examples Workbook attached.

All charts, tables and diagrams in this post with the associated Excel formulas are included in the Example workbook.

In this post I will be using the following nomenclature

^ means raise to the power eg: 10^2 = Power(10,2) = 100

.  means multiply eg 10.2.M.X = 10.2 * m * x

Why do we need to use Excel Functions?

In the first post we looked at some simple data with only a few points and a trend that was very fairly obvious or was it.

A number of other linear trends could have equally been used and all look about right.

However in real life data is rarely this simple.

Fortunately Excel has a Number of Functions and Tools that allow us to look for trends and use the data natively for forecasting purposes.

There are a number of standard types of trends which can be classified as:

Linear – Approximating a straight line

Polynomial – Approximating a Polynomial function to a power

Power – Approximating a power function

Logarithmic – Approximating a Logarithmic line

Exponential – Approximating an Exponential line

Excel supports the use of these trend types in a number of ways.

Excel Functions and Tools

Excel has a number of Worksheet functions specifically designed to assist us with analysing various trends.

They are categorised by type below

Excel Functions for Linear Trends

  • Slope
  • Intercept
  • Linest
  • Trend
  • Forecast

Excel Functions for Exponential Trends

  • Logest
  • Growth

Other Excel Tools

  • Excel Chart + Trendline


USING EXCELS WORKSHEET FUNCTIONS


Linear Estimates

In the first Post we looked at using a linear equation in the form Y=mX + c to express our estimated line of best fit which we manual estimated was linear.

Excel has 2 functions which we can use to calculate the actual slope (m) and intercept (c) for the above equation.

Slope

The Slope function returns the slope or gradient of the linear regression line through data points in Known_Y’s and Known_X’s.

eg: =SLOPE(Known Y values, Known X values)

Intercept

The Intercept function calculates the point at which a linear regression line will intersect the Y-axis by using existing X-values and Y-values.

eg: = INTERCEPT (Known Y values, Known X values)

Use

To use the above 2 equations we simply enter 2 equations in cells

m = SLOPE(C47:C51, B47:B51)                     = 1.298

c = INTERCEPT(C47:C51, B47:B51)            = 0.140

We can now use our revised linear equation to plot a line of best fit

Y = m.X + c

Y = 1.298.X + 0.140

So for

X = 5, Y= 6.63 &

X = 30, Y = 39.07

Which we can plot as a new series on our chart


Linest

The Linest function can be used to calculate the Slope and Intercept parameters for a linear function

Linest is an array formula which must be entered as an array formula to return all the values that it can return.

Eg:   = LINEST(Known Y Values, Known X Values,Const , Stats)

=LINEST(C47:C51,B47:B51,TRUE,FALSE) will return the Slope (m) component of the equation

Const = True b parameter is calculated

False b is set to 0 (zero)

Stats =  True Return additional regression statistics

False Return the m co-efficient and const b

To return both components you must enter the same formula in adjacent cells in the same row

and the equation must be entered as an array formula

Eg: = LINEST(C47:C51, B47:B51, TRUE, FALSE) Ctrl Shift Enter

Slope (m) Intercept (c)
Linest 1.298 0.140

Alternatively the values can be retrieved from the Linest array function using the Index function

Gradient m =INDEX(LINEST(C47:C51, B47:B51, TRUE, FALSE),1)

Intercept c =INDEX(LINEST(C47:C51, B47:B51, TRUE, FALSE),2)

The use of the Index function negates the requirement to use an Array Entered formula.

Stats

Linest can also return a number of statistics when Stats parameter is set to True

Eg: =LINEST(C47:C51, B47:B51, TRUE,TRUE) Ctrl Shift Enter

This must be entered as an array formula of 2 columns by 5 rows

The formula can also be entered as a normal equation also using the Index function to extract the array values

Eg:          = INDEX( LINEST($C$47:$C$51, $B$47:$B$51, TRUE, TRUE), Row ,Column)

If you want to know the r2 value (discussed later) it is in the 3rd row, 1st column.

Eg:          = INDEX( LINEST($C$47:$C$51, $B$47:$B$51, TRUE, TRUE), 3 , 1)

The above table shows the statistic and the value for our example above using both array entered and Index formulas

The r2 parameter highlighted will be discussed later.

Trend

The Trend function is used to calculate a straight line best fit line based on a number of known X & Y values.

Values of Y can be calculated for values of X inside or outside the know range of X values and so Trend can be used to interpolate or extrapolate data.

eg:          = INTERCEPT (known Y values, known X values, New X Value, Const)

Const    = True; Calculate the Intercept value

= False; Set the Intercept value c = 0

If for example you are using this to model your power cost.

If you have a fixed monthly cost plus a cost per kW, you would set Const to True

If you have no fixed monthly cost and are only charged per kW set Const to false

eg:          =TREND($C$101:$C$105,$B$101:$B$105,B106,TRUE)

Forecast

The Forecast function is used to calculate a straight line best fit line based on a number of known X & Y values.

Values of Y can be calculated for values of X inside or outside the know range of X values and so Trend can be used to interpolate or extrapolate data.

eg:      = FORECAST (New X Value, Known Y values, Known X values)

= FORECAST(B129, $C$124:$C$128, $B$124:$B$128)


Non-Linear Estimates

So far our examination of trends has revolved around the use of linear estimates and the Excel functions that support that.

But as we saw above there are lots of cases where non-linear estimates are required.

This section will deal with the following estimate types.

  • Polynomial – Approximating a Polynomial function, a.x^n + b.x^(n-1) + c.x^(n-2) + … + m = 0
  • Power – Approximating a Power function, y = a.x^b
  • Logarithmic – Approximating a Logarithmic line, y = b.ln(x) + a
  • Exponential – Approximating an Exponential line, y = b.m^x

Luckily Excel has a number of function and some tools to assist us here as well.

Exponential Functions

Exponential functions are based around the formula y = b.m^x

Excel has one function specific to growth estimates and that is the Logest function.

As with Linest, Logest is an array function.

eg:     =LOGEST(Known Y’s, Known X’s, Const, Stats)

=LOGEST(C6:C13, B6:B13, true, false)  Ctrl Shift Enter

Const = True or omitted b parameter is calculated

False b is set to 1

Stats =  True Return additional regression statistics in an array

False Return the m co-efficient and const b

Alternatively the values can be retrieved from the Logest array function using the Index function

B = INDEX( LOGEST( C6:C13, B6:B13, True, False), 1)

X = INDEX( LOGEST(C6:C13, B6:B13, True, False), 2)

The use of the Index function negates the requirement to use an Array Entered formula.1

However Logest, is a tricky function as it actually just passes values to the Linest function!

So we can actually use the Linest function for doing nearly all of our Exponential, Logarithmic and Power function trends.

But you ask “Doesn’t Linest give us the parameters for a straight line?”

Absolutely.

To use Linest to analyse an Exponential function we need to unwrap it so to speak and that is done by taking the Log of the Y values prior to putting them into the Linest equation, like this:

Form:    = LINEST( LN(Known Y Values), Known X Values)

eg:          = LINEST( LN(C32:C39), B32:B39) Which is an array formula

or            = INDEX( LINEST( LN(C32:C39), B32:B39), 1) as a normal formula

Now the tricky part is that the m component or array parameter 2 must now be converted back to an exponential so we can use exp(m component) or  =EXP( INDEX( LINEST( LN(C32:C39), B32:B39),2))

This is difficult to explain but is shown in a worked example on the Exponential Functions section of the Non-linear Functions page of the example workbook attached.

Growth

The Growth function can be used to calculate an exponential curve that best fits your data based on a number of known X & Y values.

Form:    = LINEST(Known Y Values, Known X Values, New X Values)

eg:          = GROWTH($C$32:$C$39, $B$32:$B$39, B40) as a normal formula

This is also shown in a worked example on the Exponential Functions section of the Non-linear Functions page of the example workbook attached.

Logarithmic Functions

Logarithmic functions are based around the formula y = b.LN(x)+a

Excel doesn’t have a specific function dealing with Logarithmic functions, however we can use the Linest function as previously described by first converting the data from a Logarithmic to Straight line and this is done by talking the LN of the X values.

Form:    = LINEST( Known Y Values, LN(Known X Values))

eg:          = LINEST( LN(C32:C39), B32:B39) as an array formula

or            = INDEX( LINEST( LN(C32:C39), B32:B39), 1) as a normal formula

This is shown in a worked example on the Logarithmic Functions section of the Non-linear Functions page of the example workbook attached.

Power Functions

Power functions are based around the formula y = a.x^b

Excel doesn’t have a specific function dealing with Power functions, however we can again use the Linest function as previously described by first converting the data from a Power function to Straight line and this is done by talking the LN of the X and Y values.

Form:    =LINEST( LN(Known Y Values), LN(Known X Values))

eg:          =LINEST( LN(C58:C65), LN(B58:B65)) as an array formula

or            =INDEX( LINEST( LN(C58:C65), LN(B58:B65)), 1) as a normal formula

The above equations return Parameter 1 as b and Parameter 2 as LN(a)

LN(a) must be converted back to Parameter a by taking the Exp(a)

This is shown in a worked example on the Power Functions section of the Non-linear Functions page of the example workbook attached.

Polynomial Functions

Polynomial functions are based around the formula y = a.x^n + b.x^(n-1) + c.x^(n-2) + … + m

Which typically looks like  y = a.x^5 + b.x^4 + c.x^3 + d.x^2 + e.x +m

And if any of the parameters a to m are zero that part of the function will be zero and not shown.

Excel does have a specific function dealing with Polynomial functions, and you guessed it, it is the Linest function. The Linest function must be told that it is dealing with a polynomial function and this is done by adding another parameter to it’s input. The extra parameter is added by raising the know X values to the power of an array of number 1..n, where n is the power of the polynomial you want to use.

Form:  = LINEST( Known Y Values, Known X Values^{1,2,3,..n})

eg:     for a polynomial of power 3

= LINEST(C84:C94, B84:B94^{1,2,3}) as an array formula

or      =INDEX( LINEST(C84:C94, B84:B94^{1,2,3}), 1) as a normal formula

The above equations return Parameter 1 as a, Parameter 2 as b, Parameter 3 as c if a power 3 polynomial is used.

This is shown in a worked example on the Power Functions section of the Non-linear Functions page of the example workbook attached.


Multiple Variable Linear Regressions

The Linest function is able to be used to determine the regressions of multiple input variables (X1, X2, … Xn)  that may contribute to a single output variable (Y).

This is best demonstrated with a simple example:

Hui’s Fruit Shop

Say we have a Fruit Shop and we only sell Apples & Oranges and we know how many Staff and what our Overhead Costs were and how much Profit we have made each year for the past decade.

This could be tabulated below:

We can use Linest to work out a regression for this model. That is what is the relationship between the output and all the inputs.

The format of this will be

Form:    = LINEST(Known Y values, Known X Values, TRUE, TRUE) as an Array Formula

eg:          = LINEST(E122:E132, A122:D132, TRUE, TRUE)

Note that the Known X Values of this example is a 4 column wide area representing the 4 variables.

This must be array entered in an area Xn + 1 columns wide and 5 rows deep, in our case a 5 column x 5 row area.

Note that the equation for then profit is made up of the array values from the first row of the answer array in reverse order

Y = 18.84.X1 + 27.98.X2 + 3851.79.X3 -0.26.X4 -15406.84

And that the parameters are in highest X4 to lowest X1 order followed by b at the end

You can also see the other parameters of the array of which the most important is the r2 factor which in this example is 0.90 indicating that there is a good fit between the Inputs and the Profit. Hence we could be relatively comfortable using our profit equation for the estimate of future profits.

Measuring the accuracy of the Estimation.

In the linear Linest section at the start and in the previous example we briefly mentioned a measure called the r2 parameter and said that because it had a value of 0.90 we would be comfortable using our estimation parameters to estimate future profits.

r2 is a measure of the error between the data points and the estimated values.

Its values vary between 0 = no relationship and 1 = a perfect relationship.

For example here are 3 charts based on the equation of Y = 3 X + 5

The equations of the lines of best fit and the r2 values are shown on each chart.

You can see that the data of Chart Y1 has a very close fit to the equation both visually and through a very high r2 value of 0.9962, where as at Y3 there is a very loose relationship between the data and the estimate which is shown visually as well as a low r2 value of 0.2552.

The derivation and use of this is beyond this post and I would refer you to the Excel Help of the Linest function, where it is discussed or Wikipedia.

How Does All This Work ?

The Excel Linest, Logest and Growth Functions all use a technique called “Least Squares Approximation”.

This is an iterative process which minimises the sum of the square of the distance from the estimated line to the actual data for all known data points. Once this is minimised the parameters which define the estimated line are returned to the user.

The scope of how Least Squares works is beyond the scope of this post, but if you are interested have a read at Wikipedia.

There are a number of other estimation techniques available which excel doesn’t support.

One should never assume that “just because Excel gave me the answer – it is correct” and this applies to the use of statistics more than any other area in maths or Excel usage.

Limitations:

The above techniques need to be used with a degree of caution.

Often a trend will exactly mathematically fit the data but in reality you wouldn’t use the equations.

In the picture below (courtesy of Wikipedia) 10 data points are exactly matched by a Polynomial function , whereas the linear estimate misses every point.

Which estimate would you choose to use?  The linear function I hope.

This is discussed in more detail at Wikipedia.

Disclaimer

It should be noted that just because Excel returns an estimated line of best fit to your data, it doesn’t mean that your data actually follows that trend, it just may be coincidental and that user discretion is advised in all cases, refer Limitations above.

There are a number of other estimation techniques available and users interested should discuss these if required with a person expert in their data distribution.

Excel Functions Referred to in this Post

Exp – Return the exponential value of the input

Forecast – Forecast intermediate or future values based on known X and Y values

Growth – Derive an exponential estimate for a known set of X & Y values

Index – Lookup a value at row/column intercept from a table or array of data

Intercept – Return the intercept of a linear estimate

Linest – Derive a linear estimate for a known set of X & Y values

LN – Return the Natural Log value of the input

Logest – Derive an exponential estimate for a known set of X & Y values

Power – Returns the value of a number raised to a power

Slope – Return the slope of a linear estimate

Trend – Forecast intermediate or future values based on known X and Y values

Further Readings

Excel has a number of extra Statistical functions hidden in the Data Analysis addin.

I have not discussed or used these tools here as not all users will have access to them and the post is getting longish already.

Functions you may want to have a look at include:

Correl & Pearson: Both functions allow the calculation of correlation coefficients between variables.

Exponential Smoothing: The Exponential Smoothing analysis tool predicts a value that is based on the forecast for the prior period, adjusted for the error in that prior forecast

Fourier Analysis: The Fourier Analysis tool solves problems in linear systems and analyzes periodic data by using the Fast Fourier Transform (FFT) method to transform data, great for analysing periodic and frequency based data.

I would direct readers who are interested in using these techniques to look at the following sources

Microsoft Excel Help – Statistical Functions

Wikipedia

Physics Labs Tutorials

Newton Excel Bach, not (just) an Excel Blog

 

Further Readings

Are You Trendy (Part 1)

Are You Trendy (Part 3)

 

What’s Next ?

In the next post we will looks at some Tools that Excel has to assist us in quickly determining which estimate method we can use.

I will also give you a neat little UDF to assist in your interpolations/extrapolations of your data which was used to make the animated GIF at the top of the first post.

ps: Happy Australia Day Everyone 🙂 !

Facebook
Twitter
LinkedIn

Share this tip with your colleagues

Excel and Power BI tips - Chandoo.org Newsletter

Get FREE Excel + Power BI Tips

Simple, fun and useful emails, once per week.

Learn & be awesome.

Welcome to Chandoo.org

Thank you so much for visiting. My aim is to make you awesome in Excel & Power BI. I do this by sharing videos, tips, examples and downloads on this website. There are more than 1,000 pages with all things Excel, Power BI, Dashboards & VBA here. Go ahead and spend few minutes to be AWESOME.

Read my storyFREE Excel tips book

Overall I learned a lot and I thought you did a great job of explaining how to do things. This will definitely elevate my reporting in the future.
Rebekah S
Reporting Analyst
Excel formula list - 100+ examples and howto guide for you

From simple to complex, there is a formula for every occasion. Check out the list now.

Calendars, invoices, trackers and much more. All free, fun and fantastic.

Advanced Pivot Table tricks

Power Query, Data model, DAX, Filters, Slicers, Conditional formats and beautiful charts. It's all here.

Still on fence about Power BI? In this getting started guide, learn what is Power BI, how to get it and how to create your first report from scratch.

39 Responses to “11 very useful excel keyboard shortcuts you may not know”

  1. Judy Fearn says:

    You asked about a favorite keyboard shortcut: I often right click the navigation arrows at the bottom of an Excel workbook to get a list of the worksheets. I can click the one I want without having to scroll left or right.

  2. Sam Krysiak says:

    I regularly use the networkdays(x,y,z) function to show the number of working days between two given dates. To exclude public holidays I reference a list of dates ("z" in the above reference) which I periodically update to reflect upcoming non-working days. To keep the sheet looking tidy for other users, I like to hide this column when I'm done, and then unhide it when I update the sheet.

    With 40 separate workbooks to edit, these shortcuts make it a breeze...

    ? Hide selected column: CTRL+0 [zero]
    ? Unhide hidden column(s) within selection: CTRL+SHIFT+) [closed parenthesis]

    If an "Autofit Selection" keyboard shortcut (not just a key sequence) existed, I'd be as happy as a clam!

  3. [...] 11 very useful excel keyboard shortcuts you may not know [...]

  4. [...] an Excel Conditional Formatting Rock Star 11 very useful excel keyboard shortcuts 73 Free Designer Quality Excel Chart Templates Tracking mutual fund / Stock portfolios using Excel [...]

  5. 1xoid1 says:

    Hello Chandoo, thanks for sharing this information. With some of the shortcuts I seem to have difficulties as they do not seem to work on the German keyboard.

    Can you maybe verify that those combos are only working with the keyboard setup you are using? What would be a good source to lookup combinations for other layouts?

    Regards, 1xoid1

  6. Chandoo says:

    @1xoid1 ... Thanks for visiting PHD and taking timeout to ask your question. Unfortunately all my German can be summarized to one phrase: "guten tag".

    I wont be able to help you, but I can request other readers to respond. So if you know German or use German keyboard and can answer 1xoid1's question, then you get a free donut.

    Guten Tag 🙂

  7. Martin Williamson says:

    To Sam Krysiak.
    Shortcut to Autofit Selection (assuming you mean autofit columns). If you right click toolbar, click customise. From Commands tab/Categories select Built-in Menus.

    In Categories window scroll down and select Columns and drag drop it onto toolbar. Then click the new toolbar Columns button and drag drop Autofit button onto your toolbar (note Autofit for Columns will no longer appear in your menus, only on toolbar).

    Remove Columns button from toolbar (if you want to keep clutter down) drag and drop it off of your toolbar.

    Close Customise box.

    Now to Autofit columns just press "Alt" then "A".

  8. Martin Williamson says:

    Comment 8 correction - 2nd paragraph should read
    "In Commands window...

  9. Robert says:

    @1xoid1:

    Read the following text as follows: The key ,[;] is the one right to the M on the German keyboard. Here are the differences you have to know when using a German keyboard:

    2. Press strg .[:] for inserting the current date (and strg shift .[:] for inserting current time)

    3. Press strg ,[;] to copy values from cell above

    8. Press strg shift –[_] to apply an outline border

    10. Press strg-shift S to activate the font drop down (Schriftgroesse)

    11. Press strg-shift G to activate the font size (Groesse)

    Number 10 and 11 do not work with Excel 2007 anymore, but strg-shift-P shows the font tab of the cell format dialogue in Excel 2007.

    All other shortcuts should work on a German keyboard exactly as Chandoo described them.

    More information needed? Download a complete list with all shortcuts for Microsoft Excel in German (for free):

    http://www.freeware-download.com/downloaddetails/5655.html

    @Chandoo: please do not send a donut, unless you are able to attach one to an email. Otherwise the donut might be able to walk by itself, when it arrives here in Germany...

  10. Robert says:

    I forgot to mention:

    For all readers using an English keyboard: Chip Pearson offers a comprehensive list of Excel shortcuts on the English keyboard:

    http://www.cpearson.com/excel/ShortCuts.aspx

  11. [...] your own keyboard shortcuts in Excel 2007, knowing a few keyboard shortcuts in excel is a huge help. Lyte Byte describes a nifty way to create your own key board shortcuts in [...]

  12. [...] Select a bunch of cells and click on the Sigma symbol on the standard tool bar. Alternatively you can use Alt+= keyboard shortcut. [...]

  13. Prashant R.Moholkar says:

    I do some data entries column A,column B ,Column C , A and B have 10 to 12 digit codes , C has the names ; Kindly suggest me a format or formula for excel to avoid duplication of entries in all the the three columns.

    Regards,
    Prashant

  14. Chandoo says:

    @Prashant... You can use conditional formatting to highlight duplicate entries in the three columns. That way whenever you type a dupe value in a cell the formatting would highlight the values so that you can avoid the error.

    check this post for more on using this way to handling duplicates: http://chandoo.org/wp/2008/03/13/want-to-be-an-excel-conditional-formatting-rock-star-read-this/

    If you are looking for a way to remove duplicates from an existing range, you can try one of the various techniques we have described here. Try these tips:

    http://chandoo.org/wp/2008/11/06/unique-duplicate-missing-items-excel-help/
    http://chandoo.org/wp/2008/08/01/15-fun-things-with-excel/

  15. [...] good alternative (although manual) is to use keyboard shortcuts CTRL + ; or CTRL + : to insert current date and time in the active cell. Since this places the [...]

  16. GesyimmeliA says:

    Your site doesn't correctly work in safari browser

    • Chandoo says:

      Hi GesyimmeliA: Can you tell me which version of Safari on which OS has this problem. I use Macbook at home and loaded the site quite often in Safari and never seen any layout or content issues. Are you facing any script issues while posting comments or somethings like that ?

  17. Daniel Shi says:

    Hey Chandoo. Great site. Learning lots.

    My favorite Excel shortcut has got to be Alt+Down when over an autofilter drop down. Learning that changed my life. That was one of the last things I needed to use a mouse for. Changed my life.

  18. [...] are a big advocate of keyboard shortcuts. I think learning a handful of keyboard shortcuts can improve your productivity tremendously, [...]

  19. Barbara says:

    My favourit keyboard shortcut is control and 1 (use the 1 above the letters on the keyboard, not the number pad) for format cells.

  20. DJ says:

    Favourite shortcut: alt + shift + right/left arrow for grouping/ungrouping!

  21. [...] Select a bunch of cells and click on the Sigma symbol on the standard tool bar. Alternatively you can use Alt+= keyboard shortcut. [...]

  22. [...] clicking on these: excel keyboard shortcuts, excel mouse tips & tricks, excel productivity tips part 1 & part [...]

  23. M Meraz says:

    Martin Williamson thanks for the autofit tip! You rock.

  24. Ayan says:

    In order to generate charts/bar graph with a single key:

    1. Select the data
    2. Press F11
    3. Magic.... 🙂

  25. DiverseIT says:

    F3 = Paste a Name or the entire list of Names
    Crtl + F3 = Name Manager
    Crtl + : = Inserts current time.
    F12 = Save As

  26. DiverseIT says:

    Mistake!
    Crtl + Shift + : = Inserts current time.

  27. JAY SHANKAR says:

    SIR U R THE BEST PERSON WHO SHARES A WONDERFULL AND IMPORTANT TIPS IN EXCEL. THANKS AND KEEP ROCKING.

  28. Amit says:

    How do i hide / unhide a work sheet using the keyboard.

  29. PARBATI says:

    input in one cell 1a23bc output in two cell one of 123 and other one is abc how to possible, please help me.

  30. Woj says:

    Hey cool shortcuts but excel have more shortcuts then you listet.

    i find a big database of supportet shortcuts for Excel 2007 here
    http://www.veodin.com/excel-2007-shortcuts/

  31. jayjaymartin says:

    Great article with some very useful follow-up comments and tips.

    One simple question … how do you vertically align the drop-down filter button in a cell with a larger than normal height?

    It’s easy enough to do so with a cell’s contents but the drop-down filter button stubbornly remains at the bottom and I need it at the top!

    I’ve looked everywhere and haven’t located an explanation to what I am sure is considered an Excel basic.

    Cheers

Leave a Reply