Here's some background:
I'm working on a model predicting periods of financial crises; the dependent variable
is usually coded 0/1 (no crisis/crisis) so probit/logit is fine. However Romer & Romer
(2014) have a new integer-valued classification scheme that ranges from 0 (no crisis) to
15 (extreme crisis). They present a chronology based on semi-annual observations, but my
data are annual. The half-integer values in my dependent variable arose because I averaged
R&R's classification over the year. So a country that goes from 0 in the first
half to 1 in the second gets a 0.5 from me.
I've never used interval regression, but a brief read of the user's guide re
'intreg' makes me think that my situation really is closer to ordered probit.
Also, note that the guide (p. 288) seems to specifically allow for the change Allin just
implemented: "In order to apply these models in gretl, the dependent variable must
either take on only nonnegative integer values, or be explicitly marked as discrete. (In
case the variable has non-integer values, it will be recoded internally.)"
PS
-----Original Message-----
From: gretl-users-bounces(a)lists.wfu.edu [mailto:gretl-users-bounces@lists.wfu.edu] On
Behalf Of Riccardo (Jack) Lucchetti
Sent: Wednesday, January 07, 2015 1:40 PM
To: Gretl list
Subject: Re: [Gretl-users] ordered probit
On Wed, 7 Jan 2015, Allin Cottrell wrote:
It should now be safe to pass a series with non-integral values as
the
dependent variable in ordered probit, provided it has been sucessfully
marked as discrete.
Excuse me, I may be missing something, but I fail to see the logic in this. In an ordered
probit model, the support of the dependent variable is supposed to be a sequence of
increasing numbers, which indicate increasing "degress of intensity" of a
certain unobserved variable, whose conditional mean is what we're trying to estimate.
Of course they could be any sequence, as long as it's increasing, but I would guess
that common sense dictates they should be increasing _integers_, since it's a purely
conventional way of saying labelling different degrees of intensity across observations.
IMHO allowing the dependent variable to be non-integer could easily lead to failure to
spot an incorrect application of ordered probit (eg, when you're using the wrong
variable from your dataset) and gives you nothing in return.
As I said before, if the dependent variable is truly quantitative (as opposed as being a
conventional coding for an unobserved continuous latent
variable) and for example 12.5 is just a way of saying "somewhere between
12 and 13" or whatever, the right tool for the job is interval regression, not
ordered probit.
-------------------------------------------------------
Riccardo (Jack) Lucchetti
Dipartimento di Scienze Economiche e Sociali (DiSES)
Università Politecnica delle Marche
(formerly known as Università di Ancona)
r.lucchetti(a)univpm.it
http://www2.econ.univpm.it/servizi/hpp/lucchetti
-------------------------------------------------------