A6 P2#
A generic brand of Ibuprofen is compared with the brand name Advil. 20 healthy male subjects are recruited to take the generic Ibuprofen and, once the drug has disappeared from the blood, take Advil. The blood absorption levels for each drug are recorded and summarized in the drugs.csv. We want to investigate if the blood absorption of the generic brand is, on average, is significantly different from that of Advil at a significance level of \(\alpha = 0.05\).
Summary Statistics:
Sample Size |
Sample Mean |
Sample Standard Deviation |
---|---|---|
20.0000 |
2006.1348 |
632.3612 |
20.0000 |
1888.6328 |
863.6503 |
Part 1#
You can read the data into R using the following code:
dat <- read.csv("drugs.csv")
x <- dat$generic
y <- dat$advil
boxplot(x,y)
Based solely on the side-by-side box plot, would you guess there will be a statistically significant result?
Answer Section#
No, there won’t be a statistically significant result.
Yes, there will be a statistically significant result.
Part 2#
What is the appropriate test of these data?
Answer Section#
Paired t-test
One Sample t-test
Pooled t-test
Welch Procedure
Part 3#
What is the null hypothesis for a two-sided hypothesis test in this context (in words)?
Answer Section#
The average blood absorption levels of the generic drug is the same as Advil’s.
The average blood absorption levels of the generic drug is greater than Advil’s.
The average blood absorption levels of the generic drug is less than Advil’s.
The average blood absorption levels of the generic drug and Advil is equal to 0.
Part 4#
What is the alternative hypothesis for a two-sided hypothesis test in this context? Let \(\mu\_{diff} = \mu_G - \mu_A\) where \(\mu_G\) is the average blood absorption of the generic brand of Ibuprofen and \(\mu_A\) is the average blood absorption of Advil.
Answer Section#
No, there won’t be a statistically significant result.
Yes, there will be a statistically significant result.
Part 5#
What is the value of pooled variance?
Answer Section#
Enter in a numeric value.
Part 6#
What is the value of the test statistic?
Answer Section#
Enter in a numeric value.
Part 7#
What is the critical value? That is, provide the (positive) value of \(t\_{crit}\) such that \(P(-t\_{crit}\) < \(T\) < \(t\_{crit}) = 1 - \alpha\) (be sure you can find this both in R and using the \(t\)-tables).
Answer Section#
Enter in a numeric value.
Part 8#
What is the appropriate test of these data?
Answer Section#
qt(0.025, df = 19, lower.tail = FALSE)
qt(0.05, df = 19)
qt(0.05, df = 19, lower.tail = FALSE)
qt(0.025, df = 39, lower.tail = FALSE)
pt(0.025, df = 19, lower.tail = FALSE)
Part 9#
What is the p-value?
Answer Section#
Enter in a numeric value.
Part 10#
How would this p-value be expressed based solely on the t-tables?
Answer Section#
p-value < 0.005
0.005 < p-value < 0.01
0.01 < p-value < 0.025
0.025 < p-value < 0.05
0.05 < p-value
Part 11#
What is the generic conclusion of the tests (select the most correct)? If we fail to reject the null hypothesis:
Answer Section#
We fail to reject the null at a significance level of 0.05.
We prove the null
We reject the alternative in favour of the null
We prove the null is true
Part 12#
What is the generic conclusion of the tests (select the most correct)? If we fail to reject the null hypothesis:
Answer Section#
We reject the null in favour of the alternative.
We reject the null
We reject the null in favour of the alternative at a significance level of 0.05.
We prove the alternative is true
Part 13#
Which of the following R code would conduct the appropriate test as found above? (All the R variables have been defined above).
Answer Section#
t.test(x, y)
t.test(x, y, var.equal = TRUE)
t.test(x, y, paired = TRUE)
t.test(mean(x), mean(y), paired = TRUE)
Part 14#
Using the output from the previous question, state the lower bound of the confidence interval for the mean difference.
Answer Section#
Enter in a numeric value.
Part 15#
State the upper bound of the confidence interval for the mean difference from the previous question.
Answer Section#
Enter in a numeric value.
Part 16#
Now suppose that the measurements in our csv file are not repeated measurements on the same individual. That is the 20 men in the generic Ibuprofen group are independent of the 20 men in the Advil group. What would be the most appropriate test in this case? Hint: your box plot will be a useful reference to help make this decision.
Answer Section#
t.test(x, y)
t.test(x, y, var.equal = TRUE)
t.test(x, y, paired = TRUE)
t.test(mean(x), mean(y), paired = TRUE)
Attribution#
Problem is licensed under the CC-BY-NC-SA 4.0 license.