| Variable | Discrete | Continuous | Possibly Both |
| Daily low temperature in New York | ![]() |
| |
| Brand of cereal in supermarket |
| ||
| Telephone number |
| ||
| License plates of cars |
| ||
| Weight lost in a weight loss program |
| ||
| Time spent on studying for ESMA 3101 during last week |
|
Instead draw this barchart:
Things that you need to change from the default graph:
Numbers are based on percentages
Percentages appear as labels above bars
Bars have a black right slant at 22.5°
Title - with your name
Do this work by hand. You can check your answers using MINITAB. Remember, though, the answers might be slightly different.
a) Find the 20th and the 80th percentiles of the AIDS rates. What are the corresponding countries?
b) Find the 5 number summary for the aids rates.
c) According to the boxplot as drawn by MINITAB, which countries are outliers in this data set?
d) Let's say the WHO wants to use the "average" rate of AIDS infection (together with the number of people living in the Americas) to estimate the number of AIDS infected people in the Americas. Should they use the mean or the median to find the "average"?
a): What is the type of data of the variables?
b): Find the mean and standard deviation of Time
c): Find the 5-number summary and draw the boxplot of Time
d): Draw the boxplot of Time, by hand
=15.344 and s=0.041. Should they except this shipment?
| Variable | Discrete | Continuous | Possibly Both |
| Daily low temperature in New York | ![]() |
| |
| Brand of cereal in supermarket |
| ||
| Telephone number |
| ||
| License plates of cars |
| ||
| Weight lost in a weight loss program |
| ||
| Time spent on studying for ESMA 3101 during last week |
|
Problem 2 1) Ask the question "Do you agree that Coca-Cola is much better than any of these other soft drinks?"
2) Wait outside a supermarket and only ask those customers who have a coke bottle in their shopping cart.
3) Do a survey with 10 customers and keep repeating it until six say they prefer coke over any other soft drink.
= ∑x/n = 65/6 = 10.83,
Sxx = (∑x2-(∑x)2/n) = 769-652/6 = 64.8
so s = √Sxx/(n-1) = √64.8/5 = 3.60
b)
| Data Set | Mean | Stand. Dev. | Mean | Stand. Dev. |
| Traffic | 128385 | 7259 | 126550 | 7664 |
| Shoppers | 4971 | 1166 | 5017 | 1173 |
| Accidents | 7.50 | 3.33 | 10.83 | 3.6 |
b) We have Min=0.0 and Max=131.4
Median: The number in the middle is 5.6 (Argentina), so Median=5.6
Q1: Q1=P25, so np/100=45*25/100=11.25 ~ 12, the 12th observation is 1.5 (Chile), so Q1=14
Q3: Q3=P75, so np/100=45*75/100=33.75 ~ 34, the 34th observation is 10.9 (Saint Kitts & Nevis), so Q3=10.9
and we have the 5-number summary:
c) The countries with the five highest AIDS rates are outliers (Guadaloupe, Barbados, French Guiana, Bermuda, Bahamas)
d) Mean, because the countries with the highest AIDS rates have to influence our "average", and they don't if we use the median.
b)
a) n=24, ∑x = 632, ∑x2 = 21524
so
= ∑x/n = 632/24 = 26.33,
Sxx = (∑x2-(∑x)2/n) = 21524-6322/24 = 4881.3
so s = √Sxx/(n-1) = √4881.3/23 = 14.57
c) The numbers in the 5 number summary are: Min, Q1, Median, Q3 and Max. For all those numbers we need the data to be ordered:
3 5 8 11 13 14 19 20 22 22 23 26 26 27 27 28 29 29 35 43 43 47 55 57.
So we have Min=3 and Max=57
Median: The two numbers in the middle are 26 and 26, so Median =(26+26)/2 = 26
Q1: Q1=P25, so np/100=24*25/100=6, the 6th observation is 14, so Q1=14
Q3: Q3=P75, so np/100=24*75/100=18, the 18th observation is 29, so Q3=29
and we have the 5-number summary:

For the boxplot we need IQR=Q3-Q1=29-14=15
LF=Q1-1.5IQR = 14-1.5×15=-8.5<3, so the left line goes to 3
RF=Q3+1.5IQR = 29+1.5×15=51.5<57, so the right line goes to 51.5
±s = 15.344±0.041 = 15.303, 15.385
±2s = 15.344±2·0.041 = 15.262, 15.426
±3s = 15.344±3·0.041 = 15.221, 15.467
The last interval is supposed to include "almost all" the observations (or lengths of the rods), so we can conclude that "almost all" the rods have a length at least 15.221cm and at most 15.467cm, which is in accordance with the contract. So XYZ should accept the shipment.
Select rogaine1.mtw (raw data)
Graph > Bar Chart > Counts of unique value, Cluster, Categorical Variables: Growth Group, Bar Chart Options > Show Y as Percent, Within categories of level 1, hit enter
but it has one major problem. The variable Growth has values that have an ordering (No Growth - Dens Growth) and the graph should reflect that ordering. We can do this as follows: in the worksheet select the variable by clicking on C1-T, Editor > Column > Value Order, User-specified order. The ordering that appears in the box is already ok, so hit enter. Now redraw the graph as described above: