Bar chart or Line? Scatter plot or box plot? These are the questions we ask ourselves when we set out to make a chart. In this article let me tell you how to pick a chart type so you can make best charts for every situation.
Why is it important to pick right chart?
Because right charts lead to right decisions. We use charts to tell stories, evaluate alternatives, understand trends or find-out if everything is normal. So, an incorrect charting choice can lead to poor judgment of the messages where as a correct chart can lead to right and faster decisions.
How to pick a chart type:
Chart making process can be divided in to 4 steps
- Find-out what you want to say?
- (Re)arrange the data
- Prepare the chart
- Format the chart
1. What is the purpose of this chart?
This is the first and most important step in chart preparation. You must ask yourself, “what is the purpose of this chart?”. Once we know the clear reason why the chart should exist, we will naturally be able to select the correct chart type for that reason.
But I realize that finding the reason itself can be a bit tedious. So I have listed down 6 common reasons that we often have to make a chart:
- to Compare
- to show the Distribution
- to explain Parts of the Whole
- to tell the Trend over time
- to findout the Deviations
- to understand the Relationship
Let us understand these reasons along with the type of charts that go well with these.
1. To Compare:
What it means? You want to compare one set of value(s) with another.
Examples:
- Performance of Product A vs. Product B in 5 regions
- Interview performance of various candidates
Charts that can be used for this reason:
- Bar Charts,
- Column Charts
- Scatter Plots
- Pie Charts
- Line Charts
- Data Tables
2. To Show the Distribution
What it means? You want to show the distribution of a set of values (to understand the outliers, normal ranges etc.)
Examples:
- Distribution of Call waiting times in a call center
- Distribution of bugs found in 10 week software testing phase
Charts that can be used to show distribution:
- Column Charts
- Scatter Plots
- Line charts
- Box Plots
3. Parts of Whole
What it means? You want to show how various parts comprise the whole
Examples:
- Individual product sales as a percentage of whole revenue
- Browser types of customers visiting our website
Charts that can be used to show Parts of Whole:
- Column Charts
- Bar Charts
- Pie Charts
- Data Table
4. Trend over time
What it means? You want to understand the trend over time of some variable(s).
Examples:
- Customer footfalls on the last 365 days
- Share price of MSFT in the last 100 trading sessions
Charts that can be used to show Trend Over Time:
- Column Charts
- Line Charts
- Data Table
5. Deviations
What it means? You want to see which values deviate from the norm.
Examples:
- Failures (or bugs) in the context of Quality Control
- Sales in Various Stores
Charts that can be used to show Deviations:
- Column Charts
- Bar Charts
- Line Charts
- Data Table
6. Relationship
What it means? You want to establish (or show) relationship between 2 (or more) variables
Examples:
- Relationship between Search Phrases and Product Purchases in your website
- Relationship between in-store sales and holidays
Charts that can be used to show Relationship:
- Scatter Plot
- Line Chart
- Data Table
How to pick a chart type when you have more than one reason for the chart?
Simple, use common sense. If I were you, I would either cut down the messages to one or make 2 charts (each conveying one message). If that is not possible, I would consider using dynamic charts or combination charts.
2. (Re)arranging the Data
Even when we know the message and corresponding chart, sometimes, our data may not support us. We then have to rearrange the data. Using excel formulas, pivot tables, tables and data cleaning tools we can easily massage the data.
Once we have the data in required format, we proceed to step 3.
3. Prepare the chart
Since you have already picked the chart type in Step 1, this is very straight forward. Most of the regular charts are available in MS Excel as default charts. You can insert them with few clicks.
But for some special chart types, you may have to prepare the chart by helper series, formatting etc.
4. Format the chart
While most formatting is done as per individual taste, there are some ground rules that apply on almost all charts. Here they are,
- No non-zero axis scale on bar charts [reasons and discussions]
- Make subtle grid-lines (or remove them) [how to remove grid lines]
- Add labels to important points [labeling techniques]
- Add descriptive, bold titles
- Position axis, scales at the right places (for eg. y-axis to the right on a large time series chart)
- Use simple, easy colors
A final word:
The ideas in this post are meant to be guide lines, not final words in the world of visualization. While these rules can help you make a good chart, a great chart take so much more. Knowledge of your data, Passion for what you do and Genuine focus on your audience’ needs can make your chart truly outstanding. All the best.
References:
Communicating Numbers – White Paper by Stephen Few [PDF]
Resources & Further Help:
- Chart chooser – Juice Analytics
- Data vis 101 – How to choose a chart – Hubspot
- Selecting right char type – KD Nuggets
- Charting Principles
- Charting Tutorials from Peltier Tech
- More articles from Stephen Few
- Charting Wisdom from Jorge Camoes
What is the process you use for Chart Selection?
I would love to know the process you use when selecting a chart type. Please share using comments.
41 Responses to “SQL Queries from Excel”
I use this method very often.
I always use =SUBSTITUTE (ColumnWithText,"'","''")
to be sure that potential apostrophe in text columns are doubled as required in SQL.
Awesome ! I don't use excel very often so the substitute thing is gold to me 🙂 thanks !
@Leonid.. that is a good technique to use substitute to clean up text apostrophes. thanks
Goal:
Generate update statement in excel where the columns that can be updated are dynamic
You want the columns which are not updated to keep the same value
(or not be overwritten with NULL values with the new generated statement)
the statement can be applied to multiple rows in excel for the same column headers
(This is why the '$' exist for the column headers that are being set)
A1 = First_Name
B1 = Last_Name
C1 = Middle_Name
="
UPDATE PERSONS "&CHAR(10)&
" SET 1 = 1 "&CHAR(10)&
IF(LEN(TRIM($A2))=0,"",", "&$A$1&" = '"&$A2&"'"&CHAR(10))&
IF(LEN(TRIM($B2))=0,"",", "&$B$1&" = '"&$B2&"'"&CHAR(10))&
IF(LEN(TRIM($C2))=0,"",", "&$C$1&" = '"&$C2&"'"&CHAR(10))&
" WHERE name = 'staticordynamicvalue' AND gender = 'staticordynamicvalue'
"
Output (if all columns are set):
UPDATE PERSONS SET 1 = 1,
First_Name = 'Joe',
Last_Name = 'ORien',
Middle_Name = 'Richard'
WHERE age = 28 AND gender = 'm'
Output (if only First _Name (A1) is set):
UPDATE PERSONS SET 1 = 1,
First_Name = 'Joe'
WHERE age = 28 AND gender = 'm'
Possibly my post above is confusing without the actual table to look at. I will do the same example with the table used here. Instead of an insert statement I will generate an update statement for the columns, Cust_Name, Phone & E-mail
where we can generate an update statement for any column individually or together. 🙂 I hope this can help.
=”
UPDATE table “&CHAR(10)&
” SET 1 = 1 “&CHAR(10)&
IF(LEN(TRIM($A2))=0,”",”,Cust_Name = ‘”&$B3&”‘”&CHAR(10))&
IF(LEN(TRIM($B2))=0,”",”, Phone = ‘”&$C3&”‘”&CHAR(10))&
IF(LEN(TRIM($C2))=0,”",”, E-mail = ‘”&$D3&”‘”&CHAR(10))&
” WHERE Cust_Name = ’Bill Gates'
”
Thanks, it has been very useful !
It saved me at least 30 minutes, and time is the most expensive thing in our world...
Hey Paul,
What if any of A2, B2, or C2 is a date field?
The formula above is taking date as string. Any solution?
Even I faced the same problem. If any of the above columns are date, it is taken as string. Any work around for this?
I've found the string concatenation method works well.
At the risk of sounding spammy I would mention that
if it's something your are doing regularly it might be worth investigating a tools
that make it easier, such as QueryCell, an excel add-in I've developed.
It gives you a right click menu option that will produce and then customize insert statements for the selected region of Excel data.
Cheers
Sam
Hi,
For inserting the excel data to your SQL table, you can create insert statements in excel file according to your columns.
then just execute the statements all at once, it will insert the required data to sql server table.
thanks,
How...?
I tried to generate t-sql insert queries from the above example
="insert into values('" &A2 &"','" & B2& "');"
but it generates on one record instead of all records from excel sheet.
I'm using Excel 2003 and the excel sheet contains 922 records.
Most data bases can generate DDL for any object but not a lot of them allow generation of INSERT statements for the table data.
The workaround is to make use of ETL Tools for transferring data across servers. However, there exists a need to generate INSERT statements from the tables for porting data.
Simplest example is when small or large amount of data needs to be taken out on a removable storage media and copied to a remote location, INSERT..VALUES statements come handy.
There is a number of scripts available to perform this data transformation task. The problem with those scripts that all of them database specific and they do not work with textiles
Advanced ETL processor can generate Insert scripts from any data source including text files
http://www.dbsoftlab.com/generating-insert-statements.html
Super Aiticle. Thanks for this post.
I used to deal with the same problem, until found this awsome and free tool.
http://www.xtrategics.com/shapp/String%20Handler.application
regards,
Hi ,
i need a sql query to update a DB in excel 2010..
i have the query(SQL) for insert in excel as ,
="insert into customers values('" &B3 &"','" & C3 & "','"&D3&"');"
similarly i need q sql query for update in excel
i want clear formulas only for insert,delete,update,select
Hi !
I would like to thank you so much ! This trick saves me a lot of time. Thank you so much. Really appreciate it !
-Ankit
You may like to take advantage of this unique tool 'Excel to Database'.
(free for 60 days)http://leansoftware.net The Excel-to-Database utility enables you to validate and transfer data from Microsoft Excel or text file to a database table or stored procedure process. Any text data can be pasted into the application, this may be from another Excel sheet or from text files such as CSV format. SQL Server, Access, MySQL, FoxPro .. Application features Some unique features of Excel to Database include: ?Easy to use color coded/traffic light data validation ?Data is validated as soon it is typed or pasted into Excel ?Upload Excel data to a table or stored procedure process ?Allow default values ?Mandatory/must have fields can be specified ?Allow user friendly column names ?Allow excel formula / calculated fields ?Multiple database type support: Microsoft SQL Server, Access, MySQL and others (to be tested) ?Supports Custom SQL scripts, with SQL/Excel merge fields ?Database validation checks ensure you comply with any rules defined within the database ?Multiple Task configuration ?For co-operative use, Tasks can be shared across a network ?Task configuration is password protected http://leansoftware.net
Its works fine for single record.
I want to update 1000 records in DB. Can you help me.
[...] [...]
Excel database tasks 2.3 (EDT)
you can now load directly from any source into Excel, validate and upload to most SQL database platforms including SQL Server with automatic transaction wrapping.
You can also use EDT as a multi-user application by easily designing your own Edit data tasks and deploying EDT on your users workstations.
Automatically creates UPDATE/INSERT statements based on the primary key. Default SQL can be modified as you require.
Makes the best use if Excel power - formatting, formula, validation, conditional formatting.. without creating any problematic spreadsheets!
Release details on the blog:
http://leansoftware.net/forum/en-us/blog.aspx
Thanks for the interest
Richard
Thanks for the valueable information, it really help me alot.
Thanks again.
As I do with a field of type date?
= "UPDATE SET business datetime =" & "'" & A2 & "' WHERE ID =" & B2 & ""
the date is not 03/10/2012 is 41246. Even putting quotes ...
Please show how to do it properly with dates as well as when those dates are empty. Thanks!
In a separate column make the date to Text using below formula
=TEXT(C2,"mm/dd/yyyy") Then Refer this text column in your update statement
Great post saved me a a load of time on a task i had to complete
thanks for sharing article... helpful!
Thanks 🙂
Hello,
Nice article.
I have also created one tool for create table script using excel http://devssolution.com/create-table-in-sql-using-excel/
Please check it.
Thanks & Regards,
Sandeep Bhadauriya
[…] Excel formula used – http://chandoo.org/wp/2008/09/22/sql-insert-update-statements-from-csv-files/ […]
If any one can help me out with following.
I want to know a SQL query of below excel formula:
=LOOKUP(0,-SEARCH(LEFT(F2,LEN($B$2:$B$100))+0,$B$2:$B$100),$A$2:$A$100)
Excel data is as below;
Name Codes
names1 992
names2 57
names3 856
names4 297
names5 63
if there is a number (29756789) then it should search in sql by taking the prefix of number (297) from (29756789) and return the name field (name4).
Codes can be of two digit or three.
Thanks
Here is a link to an Online automator to convert CSV files to SQL Insert Into statements:
CSV-to-SQL: http://csv-to-sql.herokuapp.com
http://stackoverflow.com/questions/1570387/how-to-insert-data-from-an-excel-sheet-into-a-database-table/37409790#37409790
="INSERT INTO table VALUES (" &A3 &",'" & B3 & "','"&C3&"','" & D3 & "','" & E3 & "'," & F3 & "," & G3 & "," & H3 & ",'" & I3 & "'," & J3 & ");"
B3 has date data that looks like 9/22/17 but with the formula above b3 is coming out as 43000?
how do i fix that?
I just want to insert the Excel records in Sql table without Visiting SQL.
basically i m just want to run a command in Excel Only.
Help Me..plz..?
Hi I have a question maybe you guys have an answer for me
="insert into customers values('" &B3 &"','" & C3 & "','"&D3&"');" where B3, C3, D3 refer to above table data.
the above technique works but is there a way to write it so it takes a range instead of individual columns. because I have an extremely wide table
="insert into customers values(B3:D3);" where B3, C3, D3 refer to above table data.
Awsome
Its Great Effort to help everyone who working with excel.
Thanks for the mini-tutorial on SQL from Excel. Didi it several years ago, but couldn't remember the syntax! All the dialogue was really helpful as well!
The formula above is taking date as string. Any solution?