Understanding p-values using an exampleHow to interpret the results of a t test?Meaning of “was produced by the null hypothesis”Comparing and contrasting, p-values, significance levels and type I errorInterpreting p-values in Fisher vs Neyman-Pearson frameworksIs the p-postulate (equal p-values provide equal evidence against the null) true?Why are lower p-values not more evidence against the null? Arguments from Johansson 2011Are smaller p-values more convincing?Doubling &/or halving p-values for one- vs. two-tailed testsWhy are Type-II errors in hypothesis testing considered to be errors?Conceptual understanding of the P-valueWhy reject Null Hypothesis when p value< alpha?Where is my potential flaw in my Z-test of Proportions?

Realistic Necromancy?

Why do Computer Science majors learn Calculus?

What was the first Intel x86 processor with "Base + Index * Scale + Displacement" addressing mode?

How can I practically buy stocks?

How to reduce LED flash rate (frequency)

Rivers without rain

What language was spoken in East Asia before Proto-Turkic?

How to make a pipeline wait for end-of-file or stop after an error?

What's the polite way to say "I need to urinate"?

Who is the Umpire in this picture?

US visa is under administrative processing, I need the passport back ASAP

Does a semiconductor follow Ohm's law?

How to pronounce 'C++' in Spanish

Will a top journal at least read my introduction?

Is the 5 MB static resource size limit 5,242,880 bytes or 5,000,000 bytes?

Unexpected email from Yorkshire Bank

Why isn't the definition of absolute value applied when squaring a radical containing a variable?

Is there a way to get a compiler for the original B programming language?

What route did the Hindenburg take when traveling from Germany to the U.S.?

What makes accurate emulation of old systems a difficult task?

Document starts having heaps of errors in the middle, but the code doesn't have any problems in it

Why was Germany not as successful as other Europeans in establishing overseas colonies?

Controversial area of mathematics

Is there really no use for MD5 anymore?



Understanding p-values using an example


How to interpret the results of a t test?Meaning of “was produced by the null hypothesis”Comparing and contrasting, p-values, significance levels and type I errorInterpreting p-values in Fisher vs Neyman-Pearson frameworksIs the p-postulate (equal p-values provide equal evidence against the null) true?Why are lower p-values not more evidence against the null? Arguments from Johansson 2011Are smaller p-values more convincing?Doubling &/or halving p-values for one- vs. two-tailed testsWhy are Type-II errors in hypothesis testing considered to be errors?Conceptual understanding of the P-valueWhy reject Null Hypothesis when p value< alpha?Where is my potential flaw in my Z-test of Proportions?






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








5












$begingroup$


Definition of p-values: A p-value is a probability that provides a measure of the evidence against the null hypothesis provided by the sample. Smaller p-values indicate more evidence against null hypothesis. Can someone please explain this in simpler terms or in a language easy to understand?



I know there might already be tons of questions around understanding the interpretation of p-values, however I would ask the question in a very limited form and with the use of a specific example:




A manufacturing company fills up can with mean weight of 3 pounds, the level of significance is assumed to be 0.01



H(0) : u >= 3 -- Null hypotheses
H(a) : u < 3 -- Alternate hypotheses


We are trying to perform a one tailed test for the case where the population standard deviation is known, so for a sample mean of 2.92 and a standard error of 0.03, we get the z-score as -2.67, giving us the probability (p-value) of 0.0038 or 0.38% that the sample mean would be equal to or lower than 2.92.




Since the probability of getting a sample mean equal or less than 2.92 is 0.38%, which is very small, doesn't it mean that we should accept the null hypotheses? As the chances of getting a mean of 2.92 from a sample is only 0.38%.



Or am I completely missing something here?




Edit - It has been three days now since I tried understanding hypothesis testing and I think I am almost there, I will try to articulate what I have understood so far and then let me know if there are still any gaps in my understanding



p-values measure the likelihood of obtaining the sample mean that we obtained given that the null hypothesis is true. So for the example that I mentioned, the probability of obtaining a sample mean of 2.92 is 0.038 if that population's mean is 3 (as assumed by the null hypothesis).



Now there could be two reasons for obtaining means of 2.92:



  1. The assumed population mean (i.e., the null hypothesis) is not correct, or

  2. the population mean is 3 but due to a sampling error / an unlikely sample we got a mean of 2.92.

Now if we select statement 1, we run the chance of making type 1 error and this is where the level of significance comes into play. Using the level of significance we can see if we can reject the null hypothesis or cannot reject null hypothesis.










share|cite|improve this question











$endgroup$











  • $begingroup$
    You say the population standard deviation $sigma$ is known. Can you provide the known value? The terminology "correction factor" is not familiar to me; can you give a formula for finding that? // The sample mean $bar X = 2.92$ is below the hypothetical population mean $mu_0 = 3.$ The issue is whether it is enough smaller to warrant rejecting the null hypothesis.
    $endgroup$
    – BruceET
    Apr 20 at 13:20











  • $begingroup$
    The population standard deviation is .18 and sample size is 36, hence the correction factor is 0.18/sqrt(36) equals 0.03
    $endgroup$
    – Rohit Saluja
    Apr 20 at 13:24






  • 2




    $begingroup$
    Thanks for the additional information. The usual terminology is to call $sigma/sqrtn$ the 'standard error'.
    $endgroup$
    – BruceET
    Apr 20 at 13:27










  • $begingroup$
    @BruceET - the issue is if it is enough smaller to reject the null hypotheses, however the probability of sample mean of being less than or equal to 2.92 is only 0.0038, so can't we say that the probability of sample mean of less than 3 is very less hence we support null hypotheses.
    $endgroup$
    – Rohit Saluja
    Apr 20 at 13:39

















5












$begingroup$


Definition of p-values: A p-value is a probability that provides a measure of the evidence against the null hypothesis provided by the sample. Smaller p-values indicate more evidence against null hypothesis. Can someone please explain this in simpler terms or in a language easy to understand?



I know there might already be tons of questions around understanding the interpretation of p-values, however I would ask the question in a very limited form and with the use of a specific example:




A manufacturing company fills up can with mean weight of 3 pounds, the level of significance is assumed to be 0.01



H(0) : u >= 3 -- Null hypotheses
H(a) : u < 3 -- Alternate hypotheses


We are trying to perform a one tailed test for the case where the population standard deviation is known, so for a sample mean of 2.92 and a standard error of 0.03, we get the z-score as -2.67, giving us the probability (p-value) of 0.0038 or 0.38% that the sample mean would be equal to or lower than 2.92.




Since the probability of getting a sample mean equal or less than 2.92 is 0.38%, which is very small, doesn't it mean that we should accept the null hypotheses? As the chances of getting a mean of 2.92 from a sample is only 0.38%.



Or am I completely missing something here?




Edit - It has been three days now since I tried understanding hypothesis testing and I think I am almost there, I will try to articulate what I have understood so far and then let me know if there are still any gaps in my understanding



p-values measure the likelihood of obtaining the sample mean that we obtained given that the null hypothesis is true. So for the example that I mentioned, the probability of obtaining a sample mean of 2.92 is 0.038 if that population's mean is 3 (as assumed by the null hypothesis).



Now there could be two reasons for obtaining means of 2.92:



  1. The assumed population mean (i.e., the null hypothesis) is not correct, or

  2. the population mean is 3 but due to a sampling error / an unlikely sample we got a mean of 2.92.

Now if we select statement 1, we run the chance of making type 1 error and this is where the level of significance comes into play. Using the level of significance we can see if we can reject the null hypothesis or cannot reject null hypothesis.










share|cite|improve this question











$endgroup$











  • $begingroup$
    You say the population standard deviation $sigma$ is known. Can you provide the known value? The terminology "correction factor" is not familiar to me; can you give a formula for finding that? // The sample mean $bar X = 2.92$ is below the hypothetical population mean $mu_0 = 3.$ The issue is whether it is enough smaller to warrant rejecting the null hypothesis.
    $endgroup$
    – BruceET
    Apr 20 at 13:20











  • $begingroup$
    The population standard deviation is .18 and sample size is 36, hence the correction factor is 0.18/sqrt(36) equals 0.03
    $endgroup$
    – Rohit Saluja
    Apr 20 at 13:24






  • 2




    $begingroup$
    Thanks for the additional information. The usual terminology is to call $sigma/sqrtn$ the 'standard error'.
    $endgroup$
    – BruceET
    Apr 20 at 13:27










  • $begingroup$
    @BruceET - the issue is if it is enough smaller to reject the null hypotheses, however the probability of sample mean of being less than or equal to 2.92 is only 0.0038, so can't we say that the probability of sample mean of less than 3 is very less hence we support null hypotheses.
    $endgroup$
    – Rohit Saluja
    Apr 20 at 13:39













5












5








5


2



$begingroup$


Definition of p-values: A p-value is a probability that provides a measure of the evidence against the null hypothesis provided by the sample. Smaller p-values indicate more evidence against null hypothesis. Can someone please explain this in simpler terms or in a language easy to understand?



I know there might already be tons of questions around understanding the interpretation of p-values, however I would ask the question in a very limited form and with the use of a specific example:




A manufacturing company fills up can with mean weight of 3 pounds, the level of significance is assumed to be 0.01



H(0) : u >= 3 -- Null hypotheses
H(a) : u < 3 -- Alternate hypotheses


We are trying to perform a one tailed test for the case where the population standard deviation is known, so for a sample mean of 2.92 and a standard error of 0.03, we get the z-score as -2.67, giving us the probability (p-value) of 0.0038 or 0.38% that the sample mean would be equal to or lower than 2.92.




Since the probability of getting a sample mean equal or less than 2.92 is 0.38%, which is very small, doesn't it mean that we should accept the null hypotheses? As the chances of getting a mean of 2.92 from a sample is only 0.38%.



Or am I completely missing something here?




Edit - It has been three days now since I tried understanding hypothesis testing and I think I am almost there, I will try to articulate what I have understood so far and then let me know if there are still any gaps in my understanding



p-values measure the likelihood of obtaining the sample mean that we obtained given that the null hypothesis is true. So for the example that I mentioned, the probability of obtaining a sample mean of 2.92 is 0.038 if that population's mean is 3 (as assumed by the null hypothesis).



Now there could be two reasons for obtaining means of 2.92:



  1. The assumed population mean (i.e., the null hypothesis) is not correct, or

  2. the population mean is 3 but due to a sampling error / an unlikely sample we got a mean of 2.92.

Now if we select statement 1, we run the chance of making type 1 error and this is where the level of significance comes into play. Using the level of significance we can see if we can reject the null hypothesis or cannot reject null hypothesis.










share|cite|improve this question











$endgroup$




Definition of p-values: A p-value is a probability that provides a measure of the evidence against the null hypothesis provided by the sample. Smaller p-values indicate more evidence against null hypothesis. Can someone please explain this in simpler terms or in a language easy to understand?



I know there might already be tons of questions around understanding the interpretation of p-values, however I would ask the question in a very limited form and with the use of a specific example:




A manufacturing company fills up can with mean weight of 3 pounds, the level of significance is assumed to be 0.01



H(0) : u >= 3 -- Null hypotheses
H(a) : u < 3 -- Alternate hypotheses


We are trying to perform a one tailed test for the case where the population standard deviation is known, so for a sample mean of 2.92 and a standard error of 0.03, we get the z-score as -2.67, giving us the probability (p-value) of 0.0038 or 0.38% that the sample mean would be equal to or lower than 2.92.




Since the probability of getting a sample mean equal or less than 2.92 is 0.38%, which is very small, doesn't it mean that we should accept the null hypotheses? As the chances of getting a mean of 2.92 from a sample is only 0.38%.



Or am I completely missing something here?




Edit - It has been three days now since I tried understanding hypothesis testing and I think I am almost there, I will try to articulate what I have understood so far and then let me know if there are still any gaps in my understanding



p-values measure the likelihood of obtaining the sample mean that we obtained given that the null hypothesis is true. So for the example that I mentioned, the probability of obtaining a sample mean of 2.92 is 0.038 if that population's mean is 3 (as assumed by the null hypothesis).



Now there could be two reasons for obtaining means of 2.92:



  1. The assumed population mean (i.e., the null hypothesis) is not correct, or

  2. the population mean is 3 but due to a sampling error / an unlikely sample we got a mean of 2.92.

Now if we select statement 1, we run the chance of making type 1 error and this is where the level of significance comes into play. Using the level of significance we can see if we can reject the null hypothesis or cannot reject null hypothesis.







hypothesis-testing statistical-significance p-value






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Apr 22 at 14:34









gung

110k34269540




110k34269540










asked Apr 20 at 12:34









Rohit SalujaRohit Saluja

1284




1284











  • $begingroup$
    You say the population standard deviation $sigma$ is known. Can you provide the known value? The terminology "correction factor" is not familiar to me; can you give a formula for finding that? // The sample mean $bar X = 2.92$ is below the hypothetical population mean $mu_0 = 3.$ The issue is whether it is enough smaller to warrant rejecting the null hypothesis.
    $endgroup$
    – BruceET
    Apr 20 at 13:20











  • $begingroup$
    The population standard deviation is .18 and sample size is 36, hence the correction factor is 0.18/sqrt(36) equals 0.03
    $endgroup$
    – Rohit Saluja
    Apr 20 at 13:24






  • 2




    $begingroup$
    Thanks for the additional information. The usual terminology is to call $sigma/sqrtn$ the 'standard error'.
    $endgroup$
    – BruceET
    Apr 20 at 13:27










  • $begingroup$
    @BruceET - the issue is if it is enough smaller to reject the null hypotheses, however the probability of sample mean of being less than or equal to 2.92 is only 0.0038, so can't we say that the probability of sample mean of less than 3 is very less hence we support null hypotheses.
    $endgroup$
    – Rohit Saluja
    Apr 20 at 13:39
















  • $begingroup$
    You say the population standard deviation $sigma$ is known. Can you provide the known value? The terminology "correction factor" is not familiar to me; can you give a formula for finding that? // The sample mean $bar X = 2.92$ is below the hypothetical population mean $mu_0 = 3.$ The issue is whether it is enough smaller to warrant rejecting the null hypothesis.
    $endgroup$
    – BruceET
    Apr 20 at 13:20











  • $begingroup$
    The population standard deviation is .18 and sample size is 36, hence the correction factor is 0.18/sqrt(36) equals 0.03
    $endgroup$
    – Rohit Saluja
    Apr 20 at 13:24






  • 2




    $begingroup$
    Thanks for the additional information. The usual terminology is to call $sigma/sqrtn$ the 'standard error'.
    $endgroup$
    – BruceET
    Apr 20 at 13:27










  • $begingroup$
    @BruceET - the issue is if it is enough smaller to reject the null hypotheses, however the probability of sample mean of being less than or equal to 2.92 is only 0.0038, so can't we say that the probability of sample mean of less than 3 is very less hence we support null hypotheses.
    $endgroup$
    – Rohit Saluja
    Apr 20 at 13:39















$begingroup$
You say the population standard deviation $sigma$ is known. Can you provide the known value? The terminology "correction factor" is not familiar to me; can you give a formula for finding that? // The sample mean $bar X = 2.92$ is below the hypothetical population mean $mu_0 = 3.$ The issue is whether it is enough smaller to warrant rejecting the null hypothesis.
$endgroup$
– BruceET
Apr 20 at 13:20





$begingroup$
You say the population standard deviation $sigma$ is known. Can you provide the known value? The terminology "correction factor" is not familiar to me; can you give a formula for finding that? // The sample mean $bar X = 2.92$ is below the hypothetical population mean $mu_0 = 3.$ The issue is whether it is enough smaller to warrant rejecting the null hypothesis.
$endgroup$
– BruceET
Apr 20 at 13:20













$begingroup$
The population standard deviation is .18 and sample size is 36, hence the correction factor is 0.18/sqrt(36) equals 0.03
$endgroup$
– Rohit Saluja
Apr 20 at 13:24




$begingroup$
The population standard deviation is .18 and sample size is 36, hence the correction factor is 0.18/sqrt(36) equals 0.03
$endgroup$
– Rohit Saluja
Apr 20 at 13:24




2




2




$begingroup$
Thanks for the additional information. The usual terminology is to call $sigma/sqrtn$ the 'standard error'.
$endgroup$
– BruceET
Apr 20 at 13:27




$begingroup$
Thanks for the additional information. The usual terminology is to call $sigma/sqrtn$ the 'standard error'.
$endgroup$
– BruceET
Apr 20 at 13:27












$begingroup$
@BruceET - the issue is if it is enough smaller to reject the null hypotheses, however the probability of sample mean of being less than or equal to 2.92 is only 0.0038, so can't we say that the probability of sample mean of less than 3 is very less hence we support null hypotheses.
$endgroup$
– Rohit Saluja
Apr 20 at 13:39




$begingroup$
@BruceET - the issue is if it is enough smaller to reject the null hypotheses, however the probability of sample mean of being less than or equal to 2.92 is only 0.0038, so can't we say that the probability of sample mean of less than 3 is very less hence we support null hypotheses.
$endgroup$
– Rohit Saluja
Apr 20 at 13:39










4 Answers
4






active

oldest

votes


















1












$begingroup$

Imagine you could measure the weight of all cans that the manufacturing company has ever made and the mean would be $2.87$ pounds. Then imagine you would take 10 cans randomly and see how much they weight. It is unlikely to get the exact mean of all cans ($2.87$ pounds), hence you end up with a mean of $2.65$, for example. If you would do that again and again - taking 10 cans and measuring the weight - you would get a distribution of means. The best guess about the true mean is the mean of the distribution you obtained. Extreme values like $1.9$ or $3.5$ pounds will be unlikely and even more extreme values will be even more unlikely.



Doing significance tests usually means that you look how likely the mean you observed is if you assume that your sample was drawn from a population with mean zero. If the mean that you observed is very unlikely you would decide to discard the null hypothesis. The only difference between what I have said so far and your example is that you assume the null hypothesis a mean of $ge 3$. So the $0.38%$ you report say that the probability of getting your mean of $2.92$ from a population with a mean of $ge 3$ is so unlikely that you would discard the null hypothesis and accept the alternative hypothesis which is $<3$. Your evidence indicate that the cans weight less than $3$ pounds.



This means it is the opposite: having a $p$ of $0.38%$ as you report doesn't mean you have to keep the null hypothesis because your result is so unlikely but it means that you can discard the null hypothesis because your data was very unlikely a randomly drawn sample from a population with a mean of $3$ (i.e., your data would be very unlikely given that the null hypothesis is true).






share|cite|improve this answer











$endgroup$












  • $begingroup$
    Comments are not for extended discussion; this conversation has been moved to chat.
    $endgroup$
    – gung
    Apr 23 at 14:51


















3












$begingroup$

Here is a figure that shows your problem on two scales:
at left is the original scale in terms of pounds; at right
is the standard of z-scale often used in testing.



enter image description here



To begin, let's look at your problem in terms of the fixed significance level $alpha = 0.01 = 1%.$ In the right-hand
panel, your $Z$-score is shown at the heavy vertical bar at
$-2.67.$ The "critical value" for a test at the 1% level is
shown by the vertical dotted line at $-2.326,$ which cuts
1% of the probability from the lower tail of the standard normal
distribution.



Because the $Z$-score is to the left of the
critical value, one rejects the null hypothesis at level $alpha = 1%.$ The P-value is the probability under the standard normal curve to the left of the heavy blue line. That area is smaller than $1%,$ so in terms of P-values, we reject $H_0$ when the P-value is smaller than $1%.$



You can see that the left-hand plot is the same as the right-hand plot, except for scale. It is not possible to make a printed normal table for all possible normal distributions.
By converting to $Z$-scores we can always use a single printed table for the 'standard' normal distribution, which has mean 0 and standard deviation 1.



If we were going to do this production-monitoring procedure repeatedly with $n = 36$ observations each time, then we could find the critical value on the 'pound' scale; it is
at 2.581 pounds. (That's because $(2.581 - 3)/.18 = -2.236,$ where the
$0.18$ is the standard error.) Then we could turn the testing
job over to a non-statistician, with instructions: "If the average weight for 36 cans is less than 2.581 pounds, let me know because we aren't putting enough stuff in our cans." (Or if we can't even trust the non-statistician with averages, the criterion might be a total weight less than 92.92 pounds.)






share|cite|improve this answer











$endgroup$




















    2












    $begingroup$

    Since your question is actually quite precise, I would like to keep it rather concise.



    Definition of p-value: the p-value is the probability of the data (or even more extrem data) given the null hypothesis is actually true.



    If this probability is high, then there is no reason why we should reject the null hypothesis: the data is perfectly in line with the null hypothesis. If the p-value is small, then the data seems implausible given the null hypothesis. The more implausible the data, the stronger our evidence against the null.



    A level of significance of 0.01 means: to reject the null hypothesis, the probability of the data must be less than 1%. If the null hypothesis is actually true, we have therefore 1% chance to see data, which is so implausible that we would wrongly reject the null hypothesis.



    Regarding your example: there is only 0.38% chance to see this data, if the null hypothesis is true, which is below our threshold of significance. Hence, the data seems very unlikely, and therefore we conclude that we don't believe in the null hypothesis anymore.






    share|cite|improve this answer









    $endgroup$




















      0












      $begingroup$

      Assume the significance level is $alpha$, which when talking about the null hypothesis, we are usually looking at 5% or 1% and so on.



      In simple terms: p-value is the smallest $alpha$ at which we reject the null hypothesis.



      So, when your p-value is 0.15, then we accept the null hypothesis when $alpha$ is 5% (or our confidence interval is 90%). But change that to only have a confidence interval of 60% and you reject your null hypothesis. Similarly, when your p-value=0.0038, it means you accept the null hypothesis under any value smaller than < 0.38%. That's why you compare p-value with $alpha$ and if p-value < $alpha$, you say that you cannot accept the null hypothesis.






      share|cite|improve this answer









      $endgroup$













        Your Answer








        StackExchange.ready(function()
        var channelOptions =
        tags: "".split(" "),
        id: "65"
        ;
        initTagRenderer("".split(" "), "".split(" "), channelOptions);

        StackExchange.using("externalEditor", function()
        // Have to fire editor after snippets, if snippets enabled
        if (StackExchange.settings.snippets.snippetsEnabled)
        StackExchange.using("snippets", function()
        createEditor();
        );

        else
        createEditor();

        );

        function createEditor()
        StackExchange.prepareEditor(
        heartbeatType: 'answer',
        autoActivateHeartbeat: false,
        convertImagesToLinks: false,
        noModals: true,
        showLowRepImageUploadWarning: true,
        reputationToPostImages: null,
        bindNavPrevention: true,
        postfix: "",
        imageUploader:
        brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
        contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
        allowUrls: true
        ,
        onDemand: true,
        discardSelector: ".discard-answer"
        ,immediatelyShowMarkdownHelp:true
        );



        );













        draft saved

        draft discarded


















        StackExchange.ready(
        function ()
        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f404119%2funderstanding-p-values-using-an-example%23new-answer', 'question_page');

        );

        Post as a guest















        Required, but never shown

























        4 Answers
        4






        active

        oldest

        votes








        4 Answers
        4






        active

        oldest

        votes









        active

        oldest

        votes






        active

        oldest

        votes









        1












        $begingroup$

        Imagine you could measure the weight of all cans that the manufacturing company has ever made and the mean would be $2.87$ pounds. Then imagine you would take 10 cans randomly and see how much they weight. It is unlikely to get the exact mean of all cans ($2.87$ pounds), hence you end up with a mean of $2.65$, for example. If you would do that again and again - taking 10 cans and measuring the weight - you would get a distribution of means. The best guess about the true mean is the mean of the distribution you obtained. Extreme values like $1.9$ or $3.5$ pounds will be unlikely and even more extreme values will be even more unlikely.



        Doing significance tests usually means that you look how likely the mean you observed is if you assume that your sample was drawn from a population with mean zero. If the mean that you observed is very unlikely you would decide to discard the null hypothesis. The only difference between what I have said so far and your example is that you assume the null hypothesis a mean of $ge 3$. So the $0.38%$ you report say that the probability of getting your mean of $2.92$ from a population with a mean of $ge 3$ is so unlikely that you would discard the null hypothesis and accept the alternative hypothesis which is $<3$. Your evidence indicate that the cans weight less than $3$ pounds.



        This means it is the opposite: having a $p$ of $0.38%$ as you report doesn't mean you have to keep the null hypothesis because your result is so unlikely but it means that you can discard the null hypothesis because your data was very unlikely a randomly drawn sample from a population with a mean of $3$ (i.e., your data would be very unlikely given that the null hypothesis is true).






        share|cite|improve this answer











        $endgroup$












        • $begingroup$
          Comments are not for extended discussion; this conversation has been moved to chat.
          $endgroup$
          – gung
          Apr 23 at 14:51















        1












        $begingroup$

        Imagine you could measure the weight of all cans that the manufacturing company has ever made and the mean would be $2.87$ pounds. Then imagine you would take 10 cans randomly and see how much they weight. It is unlikely to get the exact mean of all cans ($2.87$ pounds), hence you end up with a mean of $2.65$, for example. If you would do that again and again - taking 10 cans and measuring the weight - you would get a distribution of means. The best guess about the true mean is the mean of the distribution you obtained. Extreme values like $1.9$ or $3.5$ pounds will be unlikely and even more extreme values will be even more unlikely.



        Doing significance tests usually means that you look how likely the mean you observed is if you assume that your sample was drawn from a population with mean zero. If the mean that you observed is very unlikely you would decide to discard the null hypothesis. The only difference between what I have said so far and your example is that you assume the null hypothesis a mean of $ge 3$. So the $0.38%$ you report say that the probability of getting your mean of $2.92$ from a population with a mean of $ge 3$ is so unlikely that you would discard the null hypothesis and accept the alternative hypothesis which is $<3$. Your evidence indicate that the cans weight less than $3$ pounds.



        This means it is the opposite: having a $p$ of $0.38%$ as you report doesn't mean you have to keep the null hypothesis because your result is so unlikely but it means that you can discard the null hypothesis because your data was very unlikely a randomly drawn sample from a population with a mean of $3$ (i.e., your data would be very unlikely given that the null hypothesis is true).






        share|cite|improve this answer











        $endgroup$












        • $begingroup$
          Comments are not for extended discussion; this conversation has been moved to chat.
          $endgroup$
          – gung
          Apr 23 at 14:51













        1












        1








        1





        $begingroup$

        Imagine you could measure the weight of all cans that the manufacturing company has ever made and the mean would be $2.87$ pounds. Then imagine you would take 10 cans randomly and see how much they weight. It is unlikely to get the exact mean of all cans ($2.87$ pounds), hence you end up with a mean of $2.65$, for example. If you would do that again and again - taking 10 cans and measuring the weight - you would get a distribution of means. The best guess about the true mean is the mean of the distribution you obtained. Extreme values like $1.9$ or $3.5$ pounds will be unlikely and even more extreme values will be even more unlikely.



        Doing significance tests usually means that you look how likely the mean you observed is if you assume that your sample was drawn from a population with mean zero. If the mean that you observed is very unlikely you would decide to discard the null hypothesis. The only difference between what I have said so far and your example is that you assume the null hypothesis a mean of $ge 3$. So the $0.38%$ you report say that the probability of getting your mean of $2.92$ from a population with a mean of $ge 3$ is so unlikely that you would discard the null hypothesis and accept the alternative hypothesis which is $<3$. Your evidence indicate that the cans weight less than $3$ pounds.



        This means it is the opposite: having a $p$ of $0.38%$ as you report doesn't mean you have to keep the null hypothesis because your result is so unlikely but it means that you can discard the null hypothesis because your data was very unlikely a randomly drawn sample from a population with a mean of $3$ (i.e., your data would be very unlikely given that the null hypothesis is true).






        share|cite|improve this answer











        $endgroup$



        Imagine you could measure the weight of all cans that the manufacturing company has ever made and the mean would be $2.87$ pounds. Then imagine you would take 10 cans randomly and see how much they weight. It is unlikely to get the exact mean of all cans ($2.87$ pounds), hence you end up with a mean of $2.65$, for example. If you would do that again and again - taking 10 cans and measuring the weight - you would get a distribution of means. The best guess about the true mean is the mean of the distribution you obtained. Extreme values like $1.9$ or $3.5$ pounds will be unlikely and even more extreme values will be even more unlikely.



        Doing significance tests usually means that you look how likely the mean you observed is if you assume that your sample was drawn from a population with mean zero. If the mean that you observed is very unlikely you would decide to discard the null hypothesis. The only difference between what I have said so far and your example is that you assume the null hypothesis a mean of $ge 3$. So the $0.38%$ you report say that the probability of getting your mean of $2.92$ from a population with a mean of $ge 3$ is so unlikely that you would discard the null hypothesis and accept the alternative hypothesis which is $<3$. Your evidence indicate that the cans weight less than $3$ pounds.



        This means it is the opposite: having a $p$ of $0.38%$ as you report doesn't mean you have to keep the null hypothesis because your result is so unlikely but it means that you can discard the null hypothesis because your data was very unlikely a randomly drawn sample from a population with a mean of $3$ (i.e., your data would be very unlikely given that the null hypothesis is true).







        share|cite|improve this answer














        share|cite|improve this answer



        share|cite|improve this answer








        edited Apr 23 at 14:51









        gung

        110k34269540




        110k34269540










        answered Apr 20 at 13:44









        stats.and.rstats.and.r

        4239




        4239











        • $begingroup$
          Comments are not for extended discussion; this conversation has been moved to chat.
          $endgroup$
          – gung
          Apr 23 at 14:51
















        • $begingroup$
          Comments are not for extended discussion; this conversation has been moved to chat.
          $endgroup$
          – gung
          Apr 23 at 14:51















        $begingroup$
        Comments are not for extended discussion; this conversation has been moved to chat.
        $endgroup$
        – gung
        Apr 23 at 14:51




        $begingroup$
        Comments are not for extended discussion; this conversation has been moved to chat.
        $endgroup$
        – gung
        Apr 23 at 14:51













        3












        $begingroup$

        Here is a figure that shows your problem on two scales:
        at left is the original scale in terms of pounds; at right
        is the standard of z-scale often used in testing.



        enter image description here



        To begin, let's look at your problem in terms of the fixed significance level $alpha = 0.01 = 1%.$ In the right-hand
        panel, your $Z$-score is shown at the heavy vertical bar at
        $-2.67.$ The "critical value" for a test at the 1% level is
        shown by the vertical dotted line at $-2.326,$ which cuts
        1% of the probability from the lower tail of the standard normal
        distribution.



        Because the $Z$-score is to the left of the
        critical value, one rejects the null hypothesis at level $alpha = 1%.$ The P-value is the probability under the standard normal curve to the left of the heavy blue line. That area is smaller than $1%,$ so in terms of P-values, we reject $H_0$ when the P-value is smaller than $1%.$



        You can see that the left-hand plot is the same as the right-hand plot, except for scale. It is not possible to make a printed normal table for all possible normal distributions.
        By converting to $Z$-scores we can always use a single printed table for the 'standard' normal distribution, which has mean 0 and standard deviation 1.



        If we were going to do this production-monitoring procedure repeatedly with $n = 36$ observations each time, then we could find the critical value on the 'pound' scale; it is
        at 2.581 pounds. (That's because $(2.581 - 3)/.18 = -2.236,$ where the
        $0.18$ is the standard error.) Then we could turn the testing
        job over to a non-statistician, with instructions: "If the average weight for 36 cans is less than 2.581 pounds, let me know because we aren't putting enough stuff in our cans." (Or if we can't even trust the non-statistician with averages, the criterion might be a total weight less than 92.92 pounds.)






        share|cite|improve this answer











        $endgroup$

















          3












          $begingroup$

          Here is a figure that shows your problem on two scales:
          at left is the original scale in terms of pounds; at right
          is the standard of z-scale often used in testing.



          enter image description here



          To begin, let's look at your problem in terms of the fixed significance level $alpha = 0.01 = 1%.$ In the right-hand
          panel, your $Z$-score is shown at the heavy vertical bar at
          $-2.67.$ The "critical value" for a test at the 1% level is
          shown by the vertical dotted line at $-2.326,$ which cuts
          1% of the probability from the lower tail of the standard normal
          distribution.



          Because the $Z$-score is to the left of the
          critical value, one rejects the null hypothesis at level $alpha = 1%.$ The P-value is the probability under the standard normal curve to the left of the heavy blue line. That area is smaller than $1%,$ so in terms of P-values, we reject $H_0$ when the P-value is smaller than $1%.$



          You can see that the left-hand plot is the same as the right-hand plot, except for scale. It is not possible to make a printed normal table for all possible normal distributions.
          By converting to $Z$-scores we can always use a single printed table for the 'standard' normal distribution, which has mean 0 and standard deviation 1.



          If we were going to do this production-monitoring procedure repeatedly with $n = 36$ observations each time, then we could find the critical value on the 'pound' scale; it is
          at 2.581 pounds. (That's because $(2.581 - 3)/.18 = -2.236,$ where the
          $0.18$ is the standard error.) Then we could turn the testing
          job over to a non-statistician, with instructions: "If the average weight for 36 cans is less than 2.581 pounds, let me know because we aren't putting enough stuff in our cans." (Or if we can't even trust the non-statistician with averages, the criterion might be a total weight less than 92.92 pounds.)






          share|cite|improve this answer











          $endgroup$















            3












            3








            3





            $begingroup$

            Here is a figure that shows your problem on two scales:
            at left is the original scale in terms of pounds; at right
            is the standard of z-scale often used in testing.



            enter image description here



            To begin, let's look at your problem in terms of the fixed significance level $alpha = 0.01 = 1%.$ In the right-hand
            panel, your $Z$-score is shown at the heavy vertical bar at
            $-2.67.$ The "critical value" for a test at the 1% level is
            shown by the vertical dotted line at $-2.326,$ which cuts
            1% of the probability from the lower tail of the standard normal
            distribution.



            Because the $Z$-score is to the left of the
            critical value, one rejects the null hypothesis at level $alpha = 1%.$ The P-value is the probability under the standard normal curve to the left of the heavy blue line. That area is smaller than $1%,$ so in terms of P-values, we reject $H_0$ when the P-value is smaller than $1%.$



            You can see that the left-hand plot is the same as the right-hand plot, except for scale. It is not possible to make a printed normal table for all possible normal distributions.
            By converting to $Z$-scores we can always use a single printed table for the 'standard' normal distribution, which has mean 0 and standard deviation 1.



            If we were going to do this production-monitoring procedure repeatedly with $n = 36$ observations each time, then we could find the critical value on the 'pound' scale; it is
            at 2.581 pounds. (That's because $(2.581 - 3)/.18 = -2.236,$ where the
            $0.18$ is the standard error.) Then we could turn the testing
            job over to a non-statistician, with instructions: "If the average weight for 36 cans is less than 2.581 pounds, let me know because we aren't putting enough stuff in our cans." (Or if we can't even trust the non-statistician with averages, the criterion might be a total weight less than 92.92 pounds.)






            share|cite|improve this answer











            $endgroup$



            Here is a figure that shows your problem on two scales:
            at left is the original scale in terms of pounds; at right
            is the standard of z-scale often used in testing.



            enter image description here



            To begin, let's look at your problem in terms of the fixed significance level $alpha = 0.01 = 1%.$ In the right-hand
            panel, your $Z$-score is shown at the heavy vertical bar at
            $-2.67.$ The "critical value" for a test at the 1% level is
            shown by the vertical dotted line at $-2.326,$ which cuts
            1% of the probability from the lower tail of the standard normal
            distribution.



            Because the $Z$-score is to the left of the
            critical value, one rejects the null hypothesis at level $alpha = 1%.$ The P-value is the probability under the standard normal curve to the left of the heavy blue line. That area is smaller than $1%,$ so in terms of P-values, we reject $H_0$ when the P-value is smaller than $1%.$



            You can see that the left-hand plot is the same as the right-hand plot, except for scale. It is not possible to make a printed normal table for all possible normal distributions.
            By converting to $Z$-scores we can always use a single printed table for the 'standard' normal distribution, which has mean 0 and standard deviation 1.



            If we were going to do this production-monitoring procedure repeatedly with $n = 36$ observations each time, then we could find the critical value on the 'pound' scale; it is
            at 2.581 pounds. (That's because $(2.581 - 3)/.18 = -2.236,$ where the
            $0.18$ is the standard error.) Then we could turn the testing
            job over to a non-statistician, with instructions: "If the average weight for 36 cans is less than 2.581 pounds, let me know because we aren't putting enough stuff in our cans." (Or if we can't even trust the non-statistician with averages, the criterion might be a total weight less than 92.92 pounds.)







            share|cite|improve this answer














            share|cite|improve this answer



            share|cite|improve this answer








            edited Apr 20 at 15:23

























            answered Apr 20 at 14:31









            BruceETBruceET

            7,3211721




            7,3211721





















                2












                $begingroup$

                Since your question is actually quite precise, I would like to keep it rather concise.



                Definition of p-value: the p-value is the probability of the data (or even more extrem data) given the null hypothesis is actually true.



                If this probability is high, then there is no reason why we should reject the null hypothesis: the data is perfectly in line with the null hypothesis. If the p-value is small, then the data seems implausible given the null hypothesis. The more implausible the data, the stronger our evidence against the null.



                A level of significance of 0.01 means: to reject the null hypothesis, the probability of the data must be less than 1%. If the null hypothesis is actually true, we have therefore 1% chance to see data, which is so implausible that we would wrongly reject the null hypothesis.



                Regarding your example: there is only 0.38% chance to see this data, if the null hypothesis is true, which is below our threshold of significance. Hence, the data seems very unlikely, and therefore we conclude that we don't believe in the null hypothesis anymore.






                share|cite|improve this answer









                $endgroup$

















                  2












                  $begingroup$

                  Since your question is actually quite precise, I would like to keep it rather concise.



                  Definition of p-value: the p-value is the probability of the data (or even more extrem data) given the null hypothesis is actually true.



                  If this probability is high, then there is no reason why we should reject the null hypothesis: the data is perfectly in line with the null hypothesis. If the p-value is small, then the data seems implausible given the null hypothesis. The more implausible the data, the stronger our evidence against the null.



                  A level of significance of 0.01 means: to reject the null hypothesis, the probability of the data must be less than 1%. If the null hypothesis is actually true, we have therefore 1% chance to see data, which is so implausible that we would wrongly reject the null hypothesis.



                  Regarding your example: there is only 0.38% chance to see this data, if the null hypothesis is true, which is below our threshold of significance. Hence, the data seems very unlikely, and therefore we conclude that we don't believe in the null hypothesis anymore.






                  share|cite|improve this answer









                  $endgroup$















                    2












                    2








                    2





                    $begingroup$

                    Since your question is actually quite precise, I would like to keep it rather concise.



                    Definition of p-value: the p-value is the probability of the data (or even more extrem data) given the null hypothesis is actually true.



                    If this probability is high, then there is no reason why we should reject the null hypothesis: the data is perfectly in line with the null hypothesis. If the p-value is small, then the data seems implausible given the null hypothesis. The more implausible the data, the stronger our evidence against the null.



                    A level of significance of 0.01 means: to reject the null hypothesis, the probability of the data must be less than 1%. If the null hypothesis is actually true, we have therefore 1% chance to see data, which is so implausible that we would wrongly reject the null hypothesis.



                    Regarding your example: there is only 0.38% chance to see this data, if the null hypothesis is true, which is below our threshold of significance. Hence, the data seems very unlikely, and therefore we conclude that we don't believe in the null hypothesis anymore.






                    share|cite|improve this answer









                    $endgroup$



                    Since your question is actually quite precise, I would like to keep it rather concise.



                    Definition of p-value: the p-value is the probability of the data (or even more extrem data) given the null hypothesis is actually true.



                    If this probability is high, then there is no reason why we should reject the null hypothesis: the data is perfectly in line with the null hypothesis. If the p-value is small, then the data seems implausible given the null hypothesis. The more implausible the data, the stronger our evidence against the null.



                    A level of significance of 0.01 means: to reject the null hypothesis, the probability of the data must be less than 1%. If the null hypothesis is actually true, we have therefore 1% chance to see data, which is so implausible that we would wrongly reject the null hypothesis.



                    Regarding your example: there is only 0.38% chance to see this data, if the null hypothesis is true, which is below our threshold of significance. Hence, the data seems very unlikely, and therefore we conclude that we don't believe in the null hypothesis anymore.







                    share|cite|improve this answer












                    share|cite|improve this answer



                    share|cite|improve this answer










                    answered Apr 20 at 16:32









                    LuckyPalLuckyPal

                    342110




                    342110





















                        0












                        $begingroup$

                        Assume the significance level is $alpha$, which when talking about the null hypothesis, we are usually looking at 5% or 1% and so on.



                        In simple terms: p-value is the smallest $alpha$ at which we reject the null hypothesis.



                        So, when your p-value is 0.15, then we accept the null hypothesis when $alpha$ is 5% (or our confidence interval is 90%). But change that to only have a confidence interval of 60% and you reject your null hypothesis. Similarly, when your p-value=0.0038, it means you accept the null hypothesis under any value smaller than < 0.38%. That's why you compare p-value with $alpha$ and if p-value < $alpha$, you say that you cannot accept the null hypothesis.






                        share|cite|improve this answer









                        $endgroup$

















                          0












                          $begingroup$

                          Assume the significance level is $alpha$, which when talking about the null hypothesis, we are usually looking at 5% or 1% and so on.



                          In simple terms: p-value is the smallest $alpha$ at which we reject the null hypothesis.



                          So, when your p-value is 0.15, then we accept the null hypothesis when $alpha$ is 5% (or our confidence interval is 90%). But change that to only have a confidence interval of 60% and you reject your null hypothesis. Similarly, when your p-value=0.0038, it means you accept the null hypothesis under any value smaller than < 0.38%. That's why you compare p-value with $alpha$ and if p-value < $alpha$, you say that you cannot accept the null hypothesis.






                          share|cite|improve this answer









                          $endgroup$















                            0












                            0








                            0





                            $begingroup$

                            Assume the significance level is $alpha$, which when talking about the null hypothesis, we are usually looking at 5% or 1% and so on.



                            In simple terms: p-value is the smallest $alpha$ at which we reject the null hypothesis.



                            So, when your p-value is 0.15, then we accept the null hypothesis when $alpha$ is 5% (or our confidence interval is 90%). But change that to only have a confidence interval of 60% and you reject your null hypothesis. Similarly, when your p-value=0.0038, it means you accept the null hypothesis under any value smaller than < 0.38%. That's why you compare p-value with $alpha$ and if p-value < $alpha$, you say that you cannot accept the null hypothesis.






                            share|cite|improve this answer









                            $endgroup$



                            Assume the significance level is $alpha$, which when talking about the null hypothesis, we are usually looking at 5% or 1% and so on.



                            In simple terms: p-value is the smallest $alpha$ at which we reject the null hypothesis.



                            So, when your p-value is 0.15, then we accept the null hypothesis when $alpha$ is 5% (or our confidence interval is 90%). But change that to only have a confidence interval of 60% and you reject your null hypothesis. Similarly, when your p-value=0.0038, it means you accept the null hypothesis under any value smaller than < 0.38%. That's why you compare p-value with $alpha$ and if p-value < $alpha$, you say that you cannot accept the null hypothesis.







                            share|cite|improve this answer












                            share|cite|improve this answer



                            share|cite|improve this answer










                            answered Apr 20 at 21:54









                            EhsanKEhsanK

                            1264




                            1264



























                                draft saved

                                draft discarded
















































                                Thanks for contributing an answer to Cross Validated!


                                • Please be sure to answer the question. Provide details and share your research!

                                But avoid


                                • Asking for help, clarification, or responding to other answers.

                                • Making statements based on opinion; back them up with references or personal experience.

                                Use MathJax to format equations. MathJax reference.


                                To learn more, see our tips on writing great answers.




                                draft saved


                                draft discarded














                                StackExchange.ready(
                                function ()
                                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f404119%2funderstanding-p-values-using-an-example%23new-answer', 'question_page');

                                );

                                Post as a guest















                                Required, but never shown





















































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown

































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown







                                Popular posts from this blog

                                Wikipedia:Vital articles Мазмуну Biography - Өмүр баян Philosophy and psychology - Философия жана психология Religion - Дин Social sciences - Коомдук илимдер Language and literature - Тил жана адабият Science - Илим Technology - Технология Arts and recreation - Искусство жана эс алуу History and geography - Тарых жана география Навигация менюсу

                                Bruxelas-Capital Índice Historia | Composición | Situación lingüística | Clima | Cidades irmandadas | Notas | Véxase tamén | Menú de navegacióneO uso das linguas en Bruxelas e a situación do neerlandés"Rexión de Bruxelas Capital"o orixinalSitio da rexiónPáxina de Bruselas no sitio da Oficina de Promoción Turística de Valonia e BruxelasMapa Interactivo da Rexión de Bruxelas-CapitaleeWorldCat332144929079854441105155190212ID28008674080552-90000 0001 0666 3698n94104302ID540940339365017018237

                                What should I write in an apology letter, since I have decided not to join a company after accepting an offer letterShould I keep looking after accepting a job offer?What should I do when I've been verbally told I would get an offer letter, but still haven't gotten one after 4 weeks?Do I accept an offer from a company that I am not likely to join?New job hasn't confirmed starting date and I want to give current employer as much notice as possibleHow should I address my manager in my resignation letter?HR delayed background verification, now jobless as resignedNo email communication after accepting a formal written offer. How should I phrase the call?What should I do if after receiving a verbal offer letter I am informed that my written job offer is put on hold due to some internal issues?Should I inform the current employer that I am about to resign within 1-2 weeks since I have signed the offer letter and waiting for visa?What company will do, if I send their offer letter to another company