Descriptive Statistics: A set of methods used to summarize and describe the main features of a dataset. It does not make inferences or predictions, it simply *describes* what the data shows.
It is typically broken down into two categories:
Excel has built-in functions for all major measures of central tendency. Assume your data is in the range A2:A101.
The sum of all values divided by the count of values.
Interpretation: The "balancing point" of the data. It is sensitive to outliers.
The middle value when the data is sorted. 50% of data is above it, 50% is below.
Interpretation: The "true middle" of the data. It is not sensitive to outliers, making it a "robust" measure.
The value that occurs most frequently in the dataset.
Interpretation: The most "typical" or "popular" value. A dataset can have no mode or multiple modes (=MODE.MULT).
These functions measure how spread out your data is. Assume data in A2:A101.
The difference between the maximum and minimum values.
Interpretation: The total spread of the data. Very sensitive to outliers.
The average of the squared differences from the Mean. It measures the average spread.
Interpretation: Hard to interpret directly because its units are squared (e.g., "squared dollars"). Its main use is to calculate the standard deviation.
The square root of the variance. This is the most common measure of spread, expressed in the same units as the data.
Interpretation: A "typical" or "average" distance of a data point from the mean.
The Analysis ToolPak is a powerful Excel add-in that can perform complex statistical analysis with a few clicks, generating a full report.
A1:A101).When the ToolPak generates its report, you must be able to interpret it.
| Statistic | Interpretation | 
|---|---|
| Mean | The "average" value. | 
| Standard Error | A measure of how accurate the sample mean is as an estimate of the population mean. (Smaller is better). | 
| Median | The 50th percentile, or "middle" value. Not sensitive to outliers. | 
| Mode | The most frequent value. (#N/A means no mode). | 
| Standard Deviation | The average spread around the mean. (Use this to understand variability). | 
| Sample Variance | The square of the standard deviation. | 
| Kurtosis | Measures "peakedness". (This is Excess Kurtosis, so 0 is normal, >0 is peaked, <0 is flat). | 
| Skewness | Measures asymmetry. (0 is symmetric, >0 is positive skew, <0 is negative skew). | 
| Range | Total spread (Max - Min). | 
| Minimum | The smallest value in the dataset. | 
| Maximum | The largest value in the dataset. | 
| Sum | The total of all values. | 
| Count | The number of observations (n). |