Hi Allin, Jack, and Peter,
I just sorted the mroz dataset by GUI, and the screenshot follows.
BTY, as you might be able to see, email message from your responses gets garbled when I gets them, particularly the scripts that you’ve provided. Any reason why?
Fred


Date: Fri, 31 May 2019 09:21:08 +0200 (CEST)
From: "Riccardo (Jack) Lucchetti" <r.lucchetti@univpm.it>
Subject: [Gretl-users] Re: More minor bugs or errors (Riccardo (Jack)
Lucchetti) & (Allin Cottrell)
To: Gretl list <gretl-users@gretlml.univpm.it>
Message-ID: <alpine.DEB.2.21.1905310911480.16858@ec-4.econ.univpm.it>
Content-Type: multipart/mixed;
boundary="-842988409-1736161747-1559287268=:16858"

 This message is in MIME format.  The first part should be readable text,
 while the remaining parts are likely unreadable without MIME-aware tools.

---842988409-1736161747-1559287268=:16858
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: quoted-printable

On Fri, 31 May 2019, Fred Engst wrote:

#5: The sorting issue is only true in the famous mroz dataset that is=20
used by both Wooldridge=E2=80=99s =E2=80=9CIntroductory Econometrics A =
Modern Approach"=20
textbook and HIll-Grifit-Lim=E2=80=99s =E2=80=9CPrinciples of Econometr=
ics" textbook.

However, I=E2=80=99ve just confirmed, only the one =E2=80=9Cmroz.gdt" d=
ownloaded from=20
Wooldridge=E2=80=99s datalink=20
https://sourceforge.net/projects/gretl/files/datafiles/wooldridge.tar.g=
z/download=20
<https://sourceforge.net/projects/gretl/files/datafiles/wooldridge.tar.=
gz/download>=20
seems to have this problem, sorting by wage or lwage have the same=20
results. The one downloaded from POE=E2=80=99s don=E2=80=99t.

Uhm, funny. I can't reproduce this. This is what I just tried (after=20
downloading the dataset form sourceforge and storing the gdt file into a=20
temporary dir):

<hansl>
set verbose off
open /tmp/mroz.gdt --quiet

series orig_order =3D time
scalar check =3D ( min(diff(wage)) >=3D 0 ) && ( min(diff(lwage)) >=3D 0 =
)
printf "check: %d (unsorted: should be 0)\n", check

dataset sortby wage
scalar check =3D ( min(diff(wage)) >=3D 0 ) && ( min(diff(lwage)) >=3D 0 =
)
printf "check: %d (sorted by wage: should be 1)\n", check

dataset sortby lwage
scalar check =3D ( min(diff(wage)) >=3D 0 ) && ( min(diff(lwage)) >=3D 0 =
)
printf "check: %d (sorted by lwage: should be 1)\n", check

dataset sortby orig_order
scalar check =3D ( min(diff(wage)) >=3D 0 ) && ( min(diff(lwage)) >=3D 0 =
)
printf "check: %d (back to original: should be 0)\n", check
</hansl>

Could you please try this script?

-------------------------------------------------------
  Riccardo (Jack) Lucchetti
  Dipartimento di Scienze Economiche e Sociali (DiSES)

  Universit=C3=A0 Politecnica delle Marche
  (formerly known as Universit=C3=A0 di Ancona)

  r.lucchetti@univpm.it
  http://www2.econ.univpm.it/servizi/hpp/lucchetti
-------------------------------------------------------
---842988409-1736161747-1559287268=:16858--

------------------------------

Date: Fri, 31 May 2019 09:17:59 +0100 (BST)
From: Allin Cottrell <cottrell@wfu.edu>
Subject: [Gretl-users] Re: More minor bugs or errors  (Riccardo (Jack)
Lucchetti) & (Allin Cottrell)
To: Gretl list <gretl-users@gretlml.univpm.it>
Message-ID: <alpine.LNX.2.21.1905310912210.955@waverley.localdomain>
Content-Type: multipart/mixed;
boundary="8323329-1932899831-1559290680=:955"

 This message is in MIME format.  The first part should be readable text,
 while the remaining parts are likely unreadable without MIME-aware tools.

--8323329-1932899831-1559290680=:955
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: quoted-printable

On Fri, 31 May 2019, Fred Engst wrote:

#5: The sorting issue is only true in the famous mroz dataset that=20
is used by both Wooldridge=E2=80=99s =E2=80=9CIntroductory Econometrics=
A Modern=20
Approach" textbook and HIll-Grifit-Lim=E2=80=99s =E2=80=9CPrinciples of=
=20
Econometrics" textbook. However, I=E2=80=99ve just confirmed, only the =
one=20
=E2=80=9Cmroz.gdt" downloaded from Wooldridge=E2=80=99s datalink [...]
seems to have this problem, sorting by wage or lwage have the same=20
results. The one downloaded from POE=E2=80=99s don=E2=80=99t.

What do the bad results look like?

We probably also need to know, how exactly are you sorting? There are=20
three ways of doing this in gretl:

1) Using the sort() function, as in

  series wage_sorted =3D sort(wage)

2) Using the "dataset" command with the "sortby" parameter.

3) In the GUI, by displaying a series and using the sort icon.

Allin
--8323329-1932899831-1559290680=:955--

------------------------------

Date: Fri, 31 May 2019 09:34:27 +0100 (BST)
From: Allin Cottrell <cottrell@wfu.edu>
Subject: [Gretl-users] Re: More minor bugs or errors (Riccardo (Jack)
Lucchetti) & (Allin Cottrell)
To: r.lucchetti@univpm.it, Gretl list <gretl-users@gretlml.univpm.it>
Message-ID: <alpine.LNX.2.21.1905310930490.955@waverley.localdomain>
Content-Type: multipart/mixed;
boundary="8323329-177749420-1559291668=:955"

 This message is in MIME format.  The first part should be readable text,
 while the remaining parts are likely unreadable without MIME-aware tools.

--8323329-177749420-1559291668=:955
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: quoted-printable

On Fri, 31 May 2019, Riccardo (Jack) Lucchetti wrote:

On Fri, 31 May 2019, Fred Engst wrote:

#5: The sorting issue is only true in the famous mroz dataset that is =
used=20
by both Wooldridge=E2=80=99s =E2=80=9CIntroductory Econometrics A Mode=
rn Approach" textbook=20
and HIll-Grifit-Lim=E2=80=99s =E2=80=9CPrinciples of Econometrics" tex=
tbook.
=20
However, I=E2=80=99ve just confirmed, only the one =E2=80=9Cmroz.gdt" =
downloaded from=20
Wooldridge=E2=80=99s datalink=20
https://sourceforge.net/projects/gretl/files/datafiles/wooldridge.tar.=
gz/download=20
<https://sourceforge.net/projects/gretl/files/datafiles/wooldridge.tar=
.gz/download>=20
seems to have this problem, sorting by wage or lwage have the same res=
ults.=20
The one downloaded from POE=E2=80=99s don=E2=80=99t.

Uhm, funny. I can't reproduce this. This is what I just tried (after=20
downloading the dataset form sourceforge and storing the gdt file into =
a=20
temporary dir):

<hansl>
set verbose off
open /tmp/mroz.gdt --quiet

series orig_order =3D time
scalar check =3D ( min(diff(wage)) >=3D 0 ) && ( min(diff(lwage)) >=3D =
0 )
printf "check: %d (unsorted: should be 0)\n", check

dataset sortby wage
scalar check =3D ( min(diff(wage)) >=3D 0 ) && ( min(diff(lwage)) >=3D =
0 )
printf "check: %d (sorted by wage: should be 1)\n", check

dataset sortby lwage
scalar check =3D ( min(diff(wage)) >=3D 0 ) && ( min(diff(lwage)) >=3D =
0 )
printf "check: %d (sorted by lwage: should be 1)\n", check

dataset sortby orig_order
scalar check =3D ( min(diff(wage)) >=3D 0 ) && ( min(diff(lwage)) >=3D =
0 )
printf "check: %d (back to original: should be 0)\n", check
</hansl>

Could you please try this script?

Good idea. Here's another little test script in the same spirit but=20
using sort():

<hansl>
open /usr/local/share/gretl/data/wooldridge/mroz.gdt
ols lwage 0 log(wage) --simple
series swage =3D sort(wage)
series slwage =3D sort(lwage)
# regression results should be the same as first time
ols slwage 0 log(swage) --simple
# should get two zeros below
eval min(swage - swage(-1))
eval min(slwage - slwage(-1))
</hansl>

Allin
--8323329-177749420-1559291668=:955--

------------------------------

Date: Fri, 31 May 2019 19:40:13 +0000
From: "Summers, Peter" <psummers@highpoint.edu>
Subject: [Gretl-users] Re: More minor bugs or errors (Riccardo (Jack)
Lucchetti) & (Allin Cottrell)
To: 'Gretl list' <gretl-users@gretlml.univpm.it>
Message-ID:
<6A97640E2DDBB94BBE02DE6E658B320C7E140BE4@TOL.highpoint.edu>
Content-Type: text/plain; charset="utf-8"

-----Original Message-----
On Fri, 31 May 2019, Fred Engst wrote:

#5: The sorting issue is only true in the famous mroz dataset that is
used by both Wooldridge’s “Introductory Econometrics A Modern
Approach" textbook and HIll-Grifit-Lim’s “Principles of Econometrics"
textbook.

However, I’ve just confirmed, only the one “mroz.gdt" downloaded from
Wooldridge’s datalink
https://sourceforge.net/projects/gretl/files/datafiles/wooldridge.tar
.gz/download
<https://sourceforge.net/projects/gretl/files/datafiles/wooldridge.ta
r.gz/download> seems to have this problem, sorting by wage or lwage
have the same results.
The one downloaded from POE’s don’t.

Uhm, funny. I can't reproduce this. This is what I just tried (after
downloading the dataset form sourceforge and storing the gdt file into
a temporary dir):

<hansl>
set verbose off
open /tmp/mroz.gdt --quiet

series orig_order = time
scalar check = ( min(diff(wage)) >= 0 ) && ( min(diff(lwage)) >= 0 )
printf "check: %d (unsorted: should be 0)\n", check

dataset sortby wage
scalar check = ( min(diff(wage)) >= 0 ) && ( min(diff(lwage)) >= 0 )
printf "check: %d (sorted by wage: should be 1)\n", check

dataset sortby lwage
scalar check = ( min(diff(wage)) >= 0 ) && ( min(diff(lwage)) >= 0 )
printf "check: %d (sorted by lwage: should be 1)\n", check

dataset sortby orig_order
scalar check = ( min(diff(wage)) >= 0 ) && ( min(diff(lwage)) >= 0 )
printf "check: %d (back to original: should be 0)\n", check </hansl>

Could you please try this script?

Good idea. Here's another little test script in the same spirit but using sort():

<hansl>
open /usr/local/share/gretl/data/wooldridge/mroz.gdt
ols lwage 0 log(wage) --simple
series swage = sort(wage)
series slwage = sort(lwage)
# regression results should be the same as first time ols slwage 0 log(swage) -
-simple # should get two zeros below eval min(swage - swage(-1)) eval
min(slwage - slwage(-1)) </hansl>


I can confirm that something unexpected's going on:

If I run Jack's script on the Wooldridge version of mroz.gdt, I get "check = 0" in all 4 cases. Allin's script on the same data set gives identical regression results in the 2 cases, and the "eval" statements return 0's as expected.

Running the same 2 scripts on the Hill et al version of mroz, I get  the expected results from Jack's script, but different regression results from Allin's. Both eval statements still return 0's though:

gretl version 2019c-git
Current session: 2019-05-31 15:38

#open mroz.gdt
? ols lwage 0 log(wage) --simple

OLS, using observations 1-428
Dependent variable: lwage

            coefficient   std. error   t-ratio   p-value
 -------------------------------------------------------
 const       0.000000      0.000000      NA        NA   
 l_wage      1.00000       0.000000      NA        NA   

SSR = 0, R-squared = 1.000000

Warning: generated missing values
? series swage = sort(wage)
? series slwage = sort(lwage)
# regression results should be the same as first time
? ols slwage 0 log(swage) --simple # should get two zeros below

OLS, using observations 326-428 (n = 103)
Dependent variable: slwage

            coefficient   std. error   t-ratio    p-value
 ---------------------------------------------------------
 const       1.92669      0.0317078    60.76     2.41e-081 ***
 l_swage     0.411814     0.0496341     8.297    4.90e-013 ***

SSR = 8.63157, R-squared = 0.405323

Warning: generated missing values
? eval min(swage - swage(-1))
0
? eval min(slwage - slwage(-1))
0

I hope this helps,
Peter

------------------------------

Subject: Digest Footer

_______________________________________________
Gretl-users mailing list -- gretl-users@gretlml.univpm.it
To unsubscribe send an email to gretl-users-leave@gretlml.univpm.it
Website: https://gretlml.univpm.it/postorius/lists/gretl-users.gretlml.univpm.it/


------------------------------

End of Gretl-users Digest, Vol 148, Issue 22
********************************************