Am 05.11.18 um 23:34 schrieb Allin Cottrell:
Summary, from my point of view: We're now clear that gretl is not
ignoring the sign of the test statistic, and is doing what R, Stata
and Eviews also do, namely reporting an estimate of the CDF of the
relevant distribution evaluated at the observed test statistic.
Although I think this convention is too strong to be bucked, I do
think that Sven has a point. It's not specific to ADF tests, but
pertains more generally to p-values for one-sided tests. It's not
clear to me that p-values are well defined for values of the test
statistic that are not at all adverse to the null --
Thank you for the summary, Allin, that's exactly what I meant.
(@ all others: thank you for your comments, but it seemed to me that you
were stating or proving things that I had never doubted in the first place)
For those who are still interested in the philosophy of statistics,
after some more reading (browsing, really) I think some of that
differing interpretation of p-values actually dates back to discussions
between Fisher vs. Neyman-Pearson. Fisher apparently advocated the
p-value as a continuous (or monotonous) summary statistic of how well
the data are compatible with the hypothesis. In that sense my doubts
about the one-sided test situation were Fisherian when I said, how can
data that exactly coincides with the H0 value be classified by the
software as *less* compatible (lower p-value, p<1) than data further
away from it.
However, I now actually think there is another layer of complexity added
by the special ADF situation.
In a standard one-sided test setup (non-ADF) the null hypothesis is what
is called "composite", it covers a continuum of values like in H0: b
\geq b_0. So the case bhat > b is no deviation from H0 at all. My old
argument was that P-values in that area are dubious, given that they are
supposed to refer to something "at least as extreme as the observation",
when the observation is not extreme at all.
In the ADF situation instead it really is just the unit-root point
hypothesis. Not composite. So rhohat > 1 is not part of H0, but at the
same time we would never reject H0 because of it. Very strange beast.
P-values in that region also dubious, but perhaps for different reasons.
My conjecture is that many people in R, Stata, Eviews have asked
themselves the same questions, but there is no satisfactory solution to
be communicated in a single number. Fortunately the numerical difference
is small (as someone already mentioned) and so for standard levels of
significance and decision rules it's just not relevant.
The current urcpval results in the upper tail may still be wrong and/or
meaningless when you take the meaning of p-values seriously, but I
concede it's hard to come up with something else which is also concise.
thanks,
sven