

How to Calculate the Arithmetic Mean with Examples
The arithmetic mean provides a precise quantitative measure for the central tendency of a collection of real numbers. It is foundational in statistics, data analysis, and sequence-based mathematics, serving as the standard approach for summarising a collection through a single representative value.
Formal Definition and Expression of the Arithmetic Mean
Given a finite set of real numbers $x_1, x_2, x_3, \ldots, x_n$, where $n$ is a positive integer, the arithmetic mean (denoted by $\overline{x}$) is defined by the formula \[ \overline{x} = \frac{1}{n}\sum_{i=1}^{n} x_i \] This formula states that the arithmetic mean is obtained by calculating the sum of all elements in the data set and then dividing this total by the number of elements.
Computation of the Arithmetic Mean for Ungrouped Data
If the data set is ungrouped, that is, each observation is counted singly, with no assigned frequency (apart from $1$), the computation proceeds directly. For example, for data points $a_1, a_2, \ldots, a_n$, \[ \overline{x} = \frac{a_1 + a_2 + \cdots + a_n}{n} \] No construction of a frequency table is necessary in this case.
Arithmetic Mean with Frequency Distribution (Grouped Discrete Data)
Suppose the data set consists of distinct values $x_1, x_2, \ldots, x_k$, each associated with a corresponding positive integer frequency $f_1, f_2, \ldots, f_k$. The total number of data points is $N = \sum_{i=1}^{k} f_i$. The arithmetic mean is then \[ \overline{x} = \frac{\sum_{i=1}^{k} f_i x_i}{\sum_{i=1}^{k} f_i} \] This formula is required when the data consists of repeated values, each with an associated empirical frequency.
Arithmetic Mean for Grouped Continuous Data: Direct, Assumed Mean, and Step-Deviation Methods
In continuous or interval data, values are grouped into non-overlapping classes, each typically represented by a class interval $[a_j, b_j)$ with frequency $f_j$ for $j$ from $1$ to $k$. Each class is associated with a class mark or mid-value $x_j = \dfrac{a_j + b_j}{2}$, representing all data points in the interval for the purpose of mean calculation. The mean is evaluated as \[ \overline{x} = \frac{\sum_{j=1}^{k} f_j x_j}{\sum_{j=1}^{k} f_j} \] For computational efficiency, especially when $x_j$ are large or awkwardly distributed, transformations are employed:
If an assumed mean $A$ is selected (typically near the center of $x_j$ values), set $d_j = x_j - A$. The mean can then be written as \[ \overline{x} = A + \frac{\sum_{j=1}^{k} f_j d_j}{\sum_{j=1}^{k} f_j} \] This method reduces computation when deviations $d_j$ remain small.
If all class intervals are equal with width $h$, define $u_j = \dfrac{x_j - A}{h}$. The step-deviation method yields \[ \overline{x} = A + h \cdot \frac{\sum_{j=1}^{k} f_j u_j}{\sum_{j=1}^{k} f_j} \] This form further simplifies calculation, as $u_j$ are integral for regular intervals.
Illustrative Example: Arithmetic Mean for Ungrouped Data
Given: Data points: $3$, $5$, $7$, $10$, $15$.
Substitution: $\overline{x} = \dfrac{3 + 5 + 7 + 10 + 15}{5}$.
Simplification: $3 + 5 = 8$, $8 + 7 = 15$, $15 + 10 = 25$, $25 + 15 = 40$.
Final result: $\overline{x} = \dfrac{40}{5} = 8$.
Illustrative Example: Arithmetic Mean for Discrete Frequency Distribution
Given: $x_i = 2,\,4,\,6,\,8$ with corresponding $f_i = 4,\,3,\,2,\,1$.
Substitution: $\overline{x} = \dfrac{(4\times 2) + (3 \times 4) + (2 \times 6) + (1 \times 8)}{4+3+2+1}$.
Simplification: $4 \times 2 = 8$, $3 \times 4 = 12$, $2 \times 6 = 12$, $1 \times 8 = 8$; sum of numerators: $8+12+12+8=40$; denominator $4+3+2+1=10$.
Final result: $\overline{x} = \dfrac{40}{10} = 4$.
Illustrative Example: Arithmetic Mean for Grouped Continuous Data using Step-Deviation
Given: Classes: $10$–$20$, $20$–$30$, $30$–$40$, $40$–$50$, $50$–$60$; frequencies: $3,6,9,7,5$.
Step 1 (Class marks): $x_1 = 15$, $x_2 = 25$, $x_3 = 35$, $x_4 = 45$, $x_5 = 55$.
Step 2 (Assumed mean): Let $A = 35$.
Step 3 (Step deviations $u_j$): $h=10$, $u_j = \dfrac{x_j - 35}{10}$. So, $u_1 = \dfrac{15 - 35}{10} = -2$, $u_2 = -1$, $u_3 = 0$, $u_4 = 1$, $u_5 = 2$.
Step 4 (Compute $f_ju_j$): $3 \times (-2) = -6$, $6 \times (-1) = -6$, $9 \times 0 = 0$, $7 \times 1 = 7$, $5 \times 2 = 10$. Sum: $-6 + (-6) + 0 + 7 + 10 = 5$.
Step 5: $N = 3+6+9+7+5 = 30$.
Substitution: $\overline{x} = 35 + 10 \cdot \frac{5}{30}$.
Simplification: $\frac{5}{30} = \frac{1}{6}$, $10 \cdot \frac{1}{6} = \frac{10}{6} = 1.\overline{6}$.
Final result: $\overline{x} = 35 + 1.\overline{6} = 36.\overline{6}$.
Fundamental Properties of the Arithmetic Mean
The sum of deviations of a finite data set from its arithmetic mean is zero. For $x_1, x_2, \ldots, x_n$ with mean $\overline{x}$, \[ \sum_{i=1}^n (x_i - \overline{x}) = 0 \] This holds for both ungrouped and grouped data (adjusting for frequency if applicable).
The arithmetic mean is affected linearly by addition or subtraction of a constant: If $y_i = x_i + c$ for all $i$, then the mean is translated by $c$, i.e., $\overline{y} = \overline{x} + c$.
Under scaling of all data by a constant $k$, i.e., $y_i = kx_i$, the mean scales accordingly: $\overline{y} = k\overline{x}$.
The sum of the squared deviations from the arithmetic mean is minimized compared to any other real number, that is, \[ \sum_{i=1}^n (x_i - \overline{x})^2 \leq \sum_{i=1}^n (x_i - a)^2 \quad \forall a \in \mathbb{R} \] This is a fundamental property in variance minimization and least-squares estimation.
Advantages and Limitations of the Arithmetic Mean
The arithmetic mean takes into account every observation in the data set, is uniquely defined, and enables further algebraic manipulation (such as combined mean of multiple groups). Its calculation is straightforward for ungrouped, discrete, and continuous data formats. It is commonly used in various fields including statistics, economics, physical sciences, and sequence analysis. For an extended discussion of related concepts, see Understanding Arithmetic Mean.
However, the arithmetic mean is not resistant to extreme (outlie) values, and may not be representative in highly skewed distributions. It is not defined for qualitative data and may not be calculable if one or more observations are missing or intervals are open-ended. For a detailed distinction between mean and other central tendencies, refer to Difference Between Mean, Median, and Mode.
Application of Arithmetic Mean in Sequences and Series
Given two real numbers $a$ and $b$, the arithmetic mean between $a$ and $b$ is their average: \[ A = \frac{a+b}{2} \] This quantity is precisely the central term in an arithmetic progression (AP) constructed with first term $a$, common difference $d = b-a$ (with three terms: $a, A, b$). For arithmetic means in general sequences, consult Understanding Arithmetic Mean.
Practice Problem
Given: The arithmetic mean of $x$, $x+4$, $x+8$, $x+12$ is $20$. Find $x$.
Substitution: Arithmetic mean $= \dfrac{x + (x+4) + (x+8) + (x+12)}{4} = 20$.
Simplification: $x + x + 4 + x + 8 + x + 12 = 4x + 24$.
$\dfrac{4x+24}{4} = 20$.
$4x+24 = 80$.
$4x = 56$.
Final result: $x = 14$.
For additional foundational formulas, refer to Basic Math Formulas Overview.
FAQs on Understanding the Arithmetic Mean Formula
1. What is the formula for arithmetic mean?
The formula for arithmetic mean is the sum of all values divided by the number of values.
Arithmetic Mean (AM) = (Sum of Values) / (Number of Values)
For example:
- AM = (x₁ + x₂ + x₃ + ... + xn) / n
- Where x₁, x₂, x₃, …, xn are the values and n is the total number of values.
2. How do you calculate the arithmetic mean for grouped data?
To calculate the arithmetic mean for grouped data, use the formula involving class marks and frequencies.
Arithmetic Mean (Grouped Data) = Σ(f × x) / Σf
- f: frequency of each class
- x: class mark (mid-value of each class interval)
3. What are the steps to find the arithmetic mean for ungrouped data?
To find the arithmetic mean for ungrouped data, follow these basic steps:
- Add all the given values together.
- Count the total number of values.
- Divide the total sum by the number of values.
4. Why is arithmetic mean important in statistics?
Arithmetic mean is important in statistics because it serves as a measure of central tendency.
- Represents the average value in a data set
- Helps compare different data sets easily
- Used for further statistical calculations like variance and standard deviation
- Commonly used in everyday data analysis and mathematics exams
5. What are the properties of arithmetic mean?
The main properties of arithmetic mean are:
- Every data set has only one arithmetic mean
- It is affected by extreme values or outliers
- Sum of deviations of values from the mean is always zero
- Simple to calculate and widely used in statistics
6. Can the arithmetic mean be used for all types of data?
Arithmetic mean can only be used for quantitative data that is measured on an interval or ratio scale.
- Not suitable for qualitative or categorical data
- Not reliable if the data contains extreme outliers
- Best for data sets with evenly distributed values
7. What is the difference between arithmetic mean and median?
Arithmetic mean and median are both measures of central tendency, but with differences:
- Arithmetic mean: Average of all values
- Median: Middle value when data is arranged in order
- Mean is affected by outliers; median is not
8. How do you find the arithmetic mean if frequencies are given?
If frequencies are given, use the formula:
Arithmetic Mean = Σ(f × x) / Σf
- Multiply each value (x) by its frequency (f)
- Add all such products
- Divide the total by sum of frequencies
9. Is arithmetic mean always a value from the data set?
No, the arithmetic mean is not always from the data set.
- The mean can be a value that is not present among the original numbers
- This happens especially when the sum is not a multiple of the number of items
- Mean simply shows the central value, not an actual observed value
10. What are the limitations of arithmetic mean?
Arithmetic mean has some limitations, such as:
- Heavily influenced by extreme values (outliers)
- Not suitable for qualitative data
- May misrepresent data if distribution is skewed
- Not useful when data contain open-ended intervals





















