Advertisements
Advertisements
प्रश्न
The mean and standard deviation of a set of n1 observations are `barx_1` and s1, respectively while the mean and standard deviation of another set of n2 observations are `barx_2` and s2, respectively. Show that the standard deviation of the combined set of (n1 + n2) observations is given by
S.D. = `sqrt((n_1(s_1)^2 + n_2(s_2)^2)/(n_1 + n_2) + (n_1n_2 (barx_1 - barx_2)^2)/(n_1 + n_2)^2)`
Advertisements
उत्तर
Let xi' = 1, 2, 3, 4, ..., n1
And yj' = 1, 2, 3, 4, ..., n2
∴ `barx_1 = 1/n_1 sum_(i = 1)^n x_i`
And `barx_2 = 1/n_2 sum_(j = 1)^n y_j`
⇒ `sigma_1^2 = 1/n_1 sum_(i = 1)^(n_1) (x_i - barx_1)^2`
And `sigma_2^2 = 1/n_2 sum_(j = 1)^(n_2) (y_i - barx_2)^2`
Now mean of the combined series is given by
`barx = 1/(n_1 + n_2) [sum_(i = 1)^(n_1) + sum_(j = 1)^(n_2) y_j]`
= `(n_1 barx_1 + n_2 x_2)/(n_1 + n_2)`
Therefore, `sigma^2` of the combined series is
`sigma^2 = 1/(n_1 + n_2) [sum_(i = 1)^(n_1) (x_i - barx)^2 + sum_(j = 1)^(n_2) (y_j - barx)^2]`
Now, `sum_(i = 1)^(n_1) (x_i - barx)^2 = sum_(i = 1)^(n_1) (x_i - barx_j + bar_j - barx)^2`
= `sum_(i = 1)^(n_1) (x_i - x_j)^2 + n_1 (barx_j - barx)^2 + 2(barx_j - barx) sum_(i = 1)^(n_1) (x_i - barx_j)^2`
But `sum_(i = 1)^n (x_i - barx_i)` = 0
∵ The algebraic sum of the deviation of values of first series from their mean is zero
Also `sum_(i = 1)^(n_1) (x_i - barx)^2 = n_1s_1^2 + n_1(barx_1 - barx)^2`
= `n_1s_1^2 + n_1d_1^2`
Where `d_1 = (barx_1 - barx)`
Similarly, we have
`sum_(j = 1)^(n_2) (y_j - barx)^2 = sum_(j = 1)^(n_2) (y_j - barx_i + barx_i - barx)^2`
= `n_2s_2^2 + n_2d_2^2`
Where `d_2 = (barx_2 - barx)`
Now combined Standard Deviation (S.D.)
`sigma = sqrt((n_1(s_1^2 + d_1^2) + n_2(s_2^2 + d_2^2))/(n_1 + n_2))`
Where `d_1 = barx_1 - barx`
= `barx_1 - ((n_1barx_1 + n_2 barx_2)/(n_1 + n_2))`
= `(n_2(barx_1 - barx_2))/(n_1 + n_2)`
And `d_2 = barx_2 - barx`
= `barx_2 - ((n_1barx_1 + b_2barx_2)/(n_1 + n_2))`
= `(n_1(barx_2 - barx_1))/(n_1 + n_2)`
∴ `sigma^2 = 1/(n_1 + n_2)[n_1s_1^2 + n_2s_2^2 + (n_1n_2^2(barx_1 - barx_2)^2)/(n_1 + n_2)^2 + (n_2n_1^2(barx_2 - barx_1)^2)/(n_1 + n_2)^2]`
So, `sigma = sqrt((n_1s_1^2 + n_2s_2^2)/(n_1 + n_2) + (n_1n_2(barx_1 - barx_2)^2)/(n_1 + n_2)^2`
Hence proved.
APPEARS IN
संबंधित प्रश्न
Find the mean and variance for the first n natural numbers.
The following is the record of goals scored by team A in a football session:
|
No. of goals scored |
0 |
1 |
2 |
3 |
4 |
|
No. of matches |
1 |
9 |
7 |
5 |
3 |
For the team B, mean number of goals scored per match was 2 with a standard deviation 1.25 goals. Find which team may be considered more consistent?
The sum and sum of squares corresponding to length x (in cm) and weight y (in gm) of 50 plant products are given below:
`sum_(i-1)^50 x_i = 212, sum_(i=1)^50 x_i^2 = 902.8, sum_(i=1)^50 y_i = 261, sum_(i = 1)^50 y_i^2 = 1457.6`
Which is more varying, the length or weight?
The mean and standard deviation of 20 observations are found to be 10 and 2, respectively. On rechecking, it was found that an observation 8 was incorrect. Calculate the correct mean and standard deviation in each of the following cases:
- If wrong item is omitted.
- If it is replaced by 12.
Find the mean, variance and standard deviation for the data:
2, 4, 5, 6, 8, 17.
Find the mean, variance and standard deviation for the data:
6, 7, 10, 12, 13, 4, 8, 12.
The variance of 15 observations is 4. If each observation is increased by 9, find the variance of the resulting observations.
The mean and standard deviation of 6 observations are 8 and 4 respectively. If each observation is multiplied by 3, find the new mean and new standard deviation of the resulting observations.
The mean and standard deviation of 100 observations were calculated as 40 and 5.1 respectively by a student who took by mistake 50 instead of 40 for one observation. What are the correct mean and standard deviation?
Find the standard deviation for the following data:
| x : | 3 | 8 | 13 | 18 | 23 |
| f : | 7 | 10 | 15 | 10 | 6 |
Calculate the mean and S.D. for the following data:
| Expenditure in Rs: | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 |
| Frequency: | 14 | 13 | 27 | 21 | 15 |
Calculate the standard deviation for the following data:
| Class: | 0-30 | 30-60 | 60-90 | 90-120 | 120-150 | 150-180 | 180-210 |
| Frequency: | 9 | 17 | 43 | 82 | 81 | 44 | 24 |
Calculate the A.M. and S.D. for the following distribution:
| Class: | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 | 50-60 | 60-70 | 70-80 |
| Frequency: | 18 | 16 | 15 | 12 | 10 | 5 | 2 | 1 |
Find the mean and variance of frequency distribution given below:
| xi: | 1 ≤ x < 3 | 3 ≤ x < 5 | 5 ≤ x < 7 | 7 ≤ x < 10 |
| fi: | 6 | 4 | 5 | 1 |
Mean and standard deviation of 100 observations were found to be 40 and 10 respectively. If at the time of calculation two observations were wrongly taken as 30 and 70 in place of 3 and 27 respectively, find the correct standard deviation.
Two plants A and B of a factory show following results about the number of workers and the wages paid to them
| Plant A | Plant B | |
| No. of workers | 5000 | 6000 |
| Average monthly wages | Rs 2500 | Rs 2500 |
| Variance of distribution of wages | 81 | 100 |
In which plant A or B is there greater variability in individual wages?
From the data given below state which group is more variable, G1 or G2?
| Marks | 10-20 | 20-30 | 30-40 | 40-50 | 50-60 | 60-70 | 70-80 |
| Group G1 | 9 | 17 | 32 | 33 | 40 | 10 | 9 |
| Group G2 | 10 | 20 | 30 | 25 | 43 | 15 | 7 |
Find the coefficient of variation for the following data:
| Size (in cms): | 10-15 | 15-20 | 20-25 | 25-30 | 30-35 | 35-40 |
| No. of items: | 2 | 8 | 20 | 35 | 20 | 15 |
If the sum of the squares of deviations for 10 observations taken from their mean is 2.5, then write the value of standard deviation.
If each observation of a raw data whose standard deviation is σ is multiplied by a, then write the S.D. of the new set of observations.
If v is the variance and σ is the standard deviation, then
The standard deviation of the data:
| x: | 1 | a | a2 | .... | an |
| f: | nC0 | nC1 | nC2 | .... | nCn |
is
Let a, b, c, d, e be the observations with mean m and standard deviation s. The standard deviation of the observations a + k, b + k, c + k, d + k, e + k is
The mean of 100 observations is 50 and their standard deviation is 5. The sum of all squares of all the observations is
The standard deviation of the observations 6, 5, 9, 13, 12, 8, 10 is
Show that the two formulae for the standard deviation of ungrouped data.
`sigma = sqrt((x_i - barx)^2/n)` and `sigma`' = `sqrt((x^2_i)/n - barx^2)` are equivalent.
The mean life of a sample of 60 bulbs was 650 hours and the standard deviation was 8 hours. A second sample of 80 bulbs has a mean life of 660 hours and standard deviation 7 hours. Find the overall standard deviation.
The standard deviation of the data 6, 5, 9, 13, 12, 8, 10 is ______.
Let a, b, c, d, e be the observations with mean m and standard deviation s. The standard deviation of the observations a + k, b + k, c + k, d + k, e + k is ______.
Let x1, x2, ... xn be n observations. Let wi = lxi + k for i = 1, 2, ...n, where l and k are constants. If the mean of xi’s is 48 and their standard deviation is 12, the mean of wi’s is 55 and standard deviation of wi’s is 15, the values of l and k should be ______.
Coefficient of variation of two distributions are 50 and 60, and their arithmetic means are 30 and 25 respectively. Difference of their standard deviation is ______.
The standard deviation of a data is ______ of any change in orgin, but is ______ on the change of scale.
