Am 02.01.2014 04:35, schrieb Allin Cottrell:
On Wed, 1 Jan 2014, Sven Schreiber wrote:
I'd second Jack's suggestion: if you can send us the original .dta
file maybe we can find out how the duplicate name came into being,
though we might also need a hint as to what sort of commands were
executed after opening the Stata file.
The file is too big and also restricted.
But I just found out that it's enough to do:
1) import
2) save as gretl file (undated)
-- the imported but corrupted variable name is still visible in the
variable list--
3) close the file
4) open the file again, and then I have a double "nordwest"!
What's also interesting, if right after the import (step 1 above) I give
the command 'varlist', I get (excerpt):
...
1149) rente1 1150) rente2 1151) rente3
1152) netto 1153) netto1 1154) netto2
1158) nation 1159) deutsch 1160) deutsch_1
...
Notice that the row with IDs 1155-1157 is missing -- 1156 is "nordwest",
and 1157 is the problematic ex-"südwest".
Further symptoms right after the import/ before saving:
If I open the series to show the values, in that new window gretl shows
the correctly spelled "südwest" header, in contrast to the mess-up in
the variable list in the main window.
If I then copy that header string into the Scite text editor and change
the encoding to UTF-8, I get a mark 'xFC' instead of the 'ü' (u-umlaut)
in the middle of the string. Of course xFC /252 is the u-umlaut in the
Windows codepage, but in UTF-8 it's an invalid byte AFAI understand.
If instead of opening the series/values window I open the attributes
window, the messed-up name appears there. There I can only select the
first two characters (not the trailing "dwest"), and if I try to copy
that (ctrl-c), gretl.exe crashes!
Hope this is enough information,
sven