Go Back   Rhinocerus > Newsgroup > Newsgroup comp.soft-sys.sas

Reply
 
Thread Tools Display Modes
  #1 (permalink)  
Old 09-10-2008, 05:49 PM
Bross, Dean S
Guest
 
Posts: n/a
Default Re: Fisher exact test in PROC FREQ

I guess I just used the term marginals the way I'm used to using it.

Marginals is the term I use, and many people use, to mean the row
and column totals. These totals are usually printed in the margins
of the table.

The commonly used chi square test for a 2x2 table is conditional on
the observed marginals totals.

When I use the term: conditional on the total numbers of successes
another way of expressing it might be: given that the number of
successes is restricted to the number that was actually observed.
Both of these expressions indicate that I am referring to conditional
probability.

So a conditional test uses a conditional probability distribution to
evaluate the degree to which the data deviates from the null hypothesis.
But it is necessary to have some background in probability theory
to go into a further discussion that relies on using conditional
probability.

Dean Bross


-----Original Message-----
From: owner-sas-l@listserv.uga.edu [mailtowner-sas-l@listserv.uga.edu]
On Behalf Of RolandRB
Sent: Wednesday, September 10, 2008 1:24 PM
To: sas-l@uga.edu
Subject: Re: Fisher exact test in PROC FREQ

On Sep 10, 4:05 pm, dean.br...@VA.GOV ("Bross, Dean S") wrote:
> I'll try to add a little more insight into this question about
> fixed marginals.


I very much appreciate your efforts.

> The real answer to your question that the number of responders in a
> clinical trial is not a fixed number is to bring in the concept
> of a conditional test.
>
> In other words, when we test the null hypothesis in a 2x2 table,
> we might not choose to answer the question using the interpretation
> of a p value as: What is the probability that you would observe
> data the extreme or more extremely different from the null hypothesis,
> allowing for any possible set of outcomes of the clinical trial.
>
> It generally makes more sense to answer the question as:
> Given that we had this number of responders in the complete
> trial, what is the probability that we would have gotten this
> much or more divergence from the null hypothesis.


So is this a conditional test or an unconditional test if we calculate
based on the number of responders we got?

> In other words, the probabilities are calculated conditional
> on the total number of successes. And this is the same
> calculation that you would do if you assumed the marginals
> were fixed.


Here I am getting confused. When you say "the probabilities are
calculated conditional on the total number of successes" is this for a
conditional test or an unconditional test (or doesn't that question
make sense)?

Does that mean that the population in each treatment was a certain
number and the number of treatment successes was a certain number and
these are the numbers we got and are going to use so these are the
fixed marginals? I can understand "fixed numbers" but why "marginals"?
What does "marginals" mean in this regard?

> So it is actually a conscious decision to use a conditional test
> or an unconditional test.


Are the Chi-square and Fisher exact test conditional or unconditional?
The above has left me a bit confused.

> Now we have to descend deeper and deeper into statistical theory
> to discuss this choice of tests in full, but in this circumstance,
> the number of successes in the entire trial is something called
> an ancillary statistic.
>
> NOW QUOTING:
>
> From Wikipedia, the free encyclopedia
>
> In statistics, an ancillary statistic is a statistic whose probability
> distribution does not depend on which of the probability distributions
> among those being considered is the distribution of the statistical
> population from which the data were taken. This concept was introduced
> by the great statistical geneticist Sir Ronald Fisher.
>
> Unfortunately, it is a hard concept to fully understand.


I'm not sure I have grasped it. The mean of a set of values would be
independent of the distribution of the statistical population (I
assume) but I'm not sure about the distribution of the mean so I guess
this is not an ancillary statistic.

> But it does appear that the best way to make inferences (such as
> making estimates or testing hypotheses) is to evaluate the data
> conditional
> on the ancillary statistic.
>
> Now the 2x2 table has been studied for how many years by

statisticians,
> and the question of whether to use a conditional or unconditional test
> has been asked, and re-asked and re-re-asked. My assessment is that
> we always come back to the conditional tests used by Pearson
> (chi-square)
> and Fisher (Fisher's exact test)


I see you state that these are conditional tests. Conditional on what?

> To me, a very compelling argument that would lead to the Fisher exact
> test is based on the randomization test. This is actually the way
> Fisher
> looked at this type of data.
>
> I'll try to phrase the argument in very few words, but it might not be
> exactly correct.
>
> I'll return to the example data I used before, where there were two
> samples
> which might represent two different treatments. Each sample had 20
> cases.
> In sample 1 there were 6 successes and in sample 2 there were 2
> successes.
>
> The argument relies heavily on the notion that we randomly allocated

the
> 40 subjects to treatments 1 and 2. Now suppose the null hypothesis

that
> both treatments are exactly the same is absolutely true. Whether we
> gave
> a subject treatment one or treatment two would make no difference.
>
> Then in a sense, the results that each subject obtains is simply fixed
> in advance, even if we did not know what their results were going to

be.
> A subject that succeeded would have gotten this success no matter

which
> treatment they were assigned to, and those who failed would have

failed
> no matter which treatment they had been assigned to.
>
> So here is the key idea. The only reason there were more successes in
> sample one is purely due to the randomization placing more of the
> preordained successes in this sample. So to evaluate whether or not
> this randomization is a reasonable explanation for the observed

results
> we look at the probability that you would get this much difference in
> the success rates if the subjects were assigned purely at random. And
> this leads you to the calculation of the Fisher exact test.
>
> Does this explanation help clarify things


I'll have to read it over a few times. I am confused about what are
conditional tests and why. Many thanks for your explanation.
Reply With Quote
Alt Today
Advertising
 
and become member of Rhinocerus
Standard Sponsored Links

  #2 (permalink)  
Old 09-10-2008, 06:34 PM
RolandRB
Guest
 
Posts: n/a
Default Re: Fisher exact test in PROC FREQ

On Sep 10, 7:49*pm, dean.br...@VA.GOV ("Bross, Dean S") wrote:
> I guess I just used the term marginals the way I'm used to using it.
>
> Marginals is the term I use, and many people use, to mean the row
> and column totals. *These totals are usually printed in the margins
> of the table.
>
> The commonly used chi square test for a 2x2 table is conditional on
> the observed marginals totals.
>
> When I use the term: conditional on the total numbers of successes
> another way of expressing it might be: given that the number of
> successes is restricted to the number that was actually observed.
> Both of these expressions indicate that I am referring to conditional
> probability.
>
> So a conditional test uses a conditional probability distribution to
> evaluate the degree to which the data deviates from the null hypothesis.
> But it is necessary to have some background in probability theory
> to go into a further discussion that relies on using conditional
> probability.
>
> Dean Bross


Thanks. I understand that part of it now.
Reply With Quote
 
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
SUMMARY Difference between PROC SORT NODUPKEY and PROC SQL? Sally Muller Newsgroup comp.soft-sys.sas 0 03-03-2009 03:09 PM
Re: Processing Dozens of Vars in Proc Freq Paul Dorfman Newsgroup comp.soft-sys.sas 0 07-13-2007 05:41 PM
Re: PROC FREQ--DATA STEP--MODELING QUESTION nospam@HOWLES.COM (Howard Schreier Newsgroup comp.soft-sys.sas 0 06-07-2007 02:04 AM
Monte Carlo Estimation Fisher Exact test in PROC FREQ David Calbet Newsgroup comp.soft-sys.sas 1 02-12-2007 11:16 AM
Re: Proc Means with statistical Test David L Cassell Newsgroup comp.soft-sys.sas 0 11-22-2005 08:59 PM



All times are GMT. The time now is 05:20 PM.


Copyright ©2009

LinkBacks Enabled by vBSEO 3.3.0 RC2 © 2009, Crawlability, Inc.