On 18/10/13 18:40, Allin Cottrell wrote:
On Fri, 18 Oct 2013, Ignacio Diaz-Emparanza wrote:
> On 18/10/13 15:12, Allin Cottrell wrote:
>> Perhaps we should offer an optional second argument to readfile(),
>> allowing the user to specify the source codeset.
> I think it is a good idea.
OK, it's now implemented. Suppose I want to use readfile() on
a text file encoded in MS codepage 1251 (Russian), and that is
not my locale codeset. I can then do
string s = readfile("russky.txt", "cp1251")
(Case doesn't matter in the codeset name).
Thanks !
With respect to the 'open' (importing CSV) command I think we may leave
the responsability of using a correct UTF8 codeset to the user, but
probably the error message that emerges in trying to import from an
incorrect codeset could be more explicit.
With the table I sent you, the error I obtain is
<output>
Binary data (225) encountered (line 9:4): this is not a valid text file
</output>
I assume the program in this conditions cannot distinguish an accent or
symbol of a non-UTF8 codeset from another binary element, so a solution
may be to add a Warning (after the error message) such as
"Warning: Please note that gretl uses UTF8 codesetfor importing
plaintext or CSV data."
Appart from that, I am seeing that in my table the first accented
character is at line 10, position 5, so I think the information given in
the error message (line 9:4) is incorrect.
--
Firma Arista
*Ignacio Díaz-Emparanza*
Zuzendaria/Director
ignacio.diaz-emparanza(a)ehu.es <mailto:ignacio.diaz.emparanza@ehu.es>
94 6013732
*EKONOMIA APLIKATUA III SAILA (EKONOMETRIA ETA ESTATISTIKA)/
DEPARTAMENTO DE ECONOMÍA APLICADA III (ECONOMETRÍA Y ESTADÍSTICA)
UPV/EHU*
Avda. Lehendakari Aguirre, 83 | 48015 BILBAO
*T.: +34 946013740* | *F.: +34 946013754*
*www.ea3.ehu.es* <
http://www.ea3.ehu.es>
ERNE! Baliteke mezu honen zatiren bat edo mezu osoa legez babestuta
egotea. Mezuak badu bere hartzailea. Okerreko helbidera heldu bada
(helbidea gaizki idatzi, transmisioak huts egin) eman abisu igorleari,
korreo honi erantzunda. Kontuz! Mezua ez bada zuretzat, ez erabili, ez
zabaldu beste inori, ez kopiatu eta ez baliatu.
¡ATENCIÓN! Este mensaje contiene información privilegiada o confidencial
a la que sólo tiene derecho a acceder el destinatario. Si usted lo
recibe por error le agradeceríamos que no hiciera uso de la información
y que se pusiese en contacto con el remitente.
E-mail hau inprimatu baino lehen egiaztatu inprimatzeko beharra.
Antes de imprimir este e-mail piense bien si es necesario hacerlo.