October 2017 - Gretl-users - gretlml.univpm.it

by Henrique Andrade

Dear Gretl Community, I really stuck trying to define a function that gives a power set of a set. Suppose I have a set S: S = {"A", "B", "C"} The associated power set, P(S), is: P(S) = {{ }, {"A"}, {"B"}, {"C"}, {"A", "B"}, {"A", "C"}, {"B", "C"}, {"A", "B", "C"}} All that I can think by now (shame on me!) is this: strings S = defarray("A", "B", "C") scalar P_S_len = 2^nelem(S) # the size of the power set strings P_S = array(P_S_len) # an array with 8 spaces. Does anyone have any ideas? Best, Henrique Andrade

6 years, 8 months

5
15
0 / 0

Rolling OLS

by Filipe Rodrigues da Costa

Hi All, I have been trying to solve a few problems regarding the rolling regression. In order to separate them, I am using a simple example here, in which I have two financial assets: Amazon (AMZN) and the market (SP500). I'm trying to estimate "beta", which can be roughly obtained from regressing the returns of AMZN on the returns of SP500. I found a very simple code from a previous issue on this list, which is quite straightforward: > T = $nobs > > scalar window_size = 20 > scalar k = $nobs - window_size + 1 > series b = NA > > smpl 1 window_size > loop i = window_size .. T > ols AMZN const SP500 > if i < T > smpl +1 +1 > endif > endloop > > smpl full So far so good, this works quite well. But let's say the data covers 100 periods for SP500 but only 60 for AMZN (no data for the last 40). Because I'm using a rolling window of 20 data points, a point will come when the routine will use only 19 data points, 18, 17, and so on until reaching 2 (which is the technical minimum). The reason for this is because the routine will still identify datapoints in the full sample, even though there are less for AMZN. My question is as follows: Is there a simple way of imposing the routine to only estimate OLS when we have the full 20 data points for AMZN and 20 for SP500? When I have a very large dataset with 400 or even 500 assets, there are many cases where some just went out of the market and then I should not be estimating betas. I believe the program checks for the $t1 and if it exists it computes OLS. I would like it to check for everything between $t1 and $t2 and if some missing, in particular at the end, just don't compute. Hope I could make myself clear! Thanks all! -- Filipe Rodrigues da Costa Send me an email to: filipe(a)pobox.io Reach me through Telegram at: https://t.me/rodriguesdacosta

7 years, 8 months

3
3
0 / 0

Sequential naming of variables

by Filipe Rodrigues da Costa

Hi All, I'm trying to find a way of generating variable names automatically. I have a list of share prices: share1, share2, ...., share12. I want to compute returns in a loop. Here is what I was doing:> # Compute returns for all shares > loop foreach i list_share > ret_$i = 100 * (log(list_share.$i) - log(list_share.$i(-1))) > endloop The above one would return the following variables: ret_share1, ret_share2,..., ret_share12. My question is simple: is there a way of controlling the name of the output variable in a different way than what is used above? I mean, how can I get output in variables with name: ret1, ret2,..., ret12? Is there any sequential command that can be added to ret*? Thanks. -- Filipe Rodrigues da Costa Send me an email to: filipe(a)pobox.io Reach me through Telegram at: https://t.me/rodriguesdacosta

7 years, 8 months

3
3
0 / 0

Did I tell you that I just love the shell? ;-)

by Artur Tarassow

Hi all, today I've been playing around with the shell trying to run several gretl instances parallel. In a thread at the beginning of the year, Sven already came up with such an idea. This culminated in some nice example script by Allin, gathering information in a bundle if I understood this correctly. This is very nice but was too complicated for my rather simple tasks ;-) Actually, for a project I am just using a dataset comprising information for several countries, and I just need to re-run certain (intense) calculations for each country separately to create some measures which I want to plot later. So, I don't need to merge certain output files or -- even so this could also be handled, I guess. This works nicely on linux using "gnu-parallel" (https://www.gnu.org/software/parallel/) which does all the job for you. Simply save both the attached *.sh and the *.inp file in the same folder. The shell-script calls the SB()'s function package sample script, and runs the example for different numbers of bootstrap iterations. "parallel" does all the stuff, and automatically takes into account the machine's number of cores available (I am sure one could set the max. number of cores to use) and waits before starting the next queuing job to do... <SHELL> sh gretl_parallel_ex.sh </SHELL> I really love this, and it's going to save sooo much time for several applications! Artur

7 years, 8 months

1
1
0 / 0

Scan folder

by Schaff, Frederik

Hello, Is there a way within hansl to "scan" a given folder and give back the subfolder names, e.g. as strings? Non-recursive would be enough. Background: I am still working on the data-input script and changed the format of my input. If I could scan folders, I could get rid of some additional input information. Best Frederik

7 years, 8 months

4
10
0 / 0

Loop recursion: Speed comparison with Matlab

by Artur Tarassow

Dear all, I made a little experiment this morning as I was a bit puzzled that the SB() function for generating stationary bootstraps felt a bit "sluggish" when I played around with it -- at least for large vectors and many bootstrap iterations. So I decided to compare the speed of Gretl (4 days old 2017d-git) with MATLAB (2014), and the difference is astonishing. Loop recursion is about 40 times faster in MATLAB compared to Gretl. I am a bit shocked that this difference is that large. Usually I claim that Gretl is really quick. Let me give you the following example (see attached file) for which I provide both the Gretl and MATLAB codes. In exercise 1 (Ex. 1) I call SB() 10000 times for a vector with 10000 obs. The speed difference is remarkable. In Ex. 2 I draw 10000 a scalar from a uniform distribution. The difference in speed is rather negligible. I am not claiming that the SB() is programmed in the most efficient way, but as both codes are identical I would not expect such massive differences. Or do I miss something? #------------------------------- # Speed comparison on my laptop #------------------------------- # Gretl Matlab # Ex. 1 124.6562 3.1606 sec. # Ex. 2 0.0025 0.0021 sec. Best, Artur

7 years, 8 months

4
8
0 / 0

batch jobs on linux server

by Stefano

re Jack's suggestion of stopwatching my jobs, I try to explain more clearly my question: there _are_ big differences in execution time, that is 100% sure. It may depend, for unclear reasons, on the (i) the total number of jobs or (ii) the number of Gretl jobs or (iii) both. Given that I cannot control RAM and CPU use of other useres I cannot measure the coeteris paribus effect of changing the number of my simultaneous Gretl jobs. So the question is: does anybody know if there are any reasons why running more than one Gretl job at a time may affect speed? bye, Stefano -- ________________________________________________________________________ Stefano Fachin Professore Ordinario di Statistica Economica Dip. di Scienze Statistiche "Sapienza" Università di Roma P.le A. Moro 5 - 00185 Roma - Italia Tel. +39-06-49910834 fax +39-06-49910072 web http://stefanofachin.site.uniroma1.it/

7 years, 8 months

4
5
0 / 0

Re: [Gretl-users] batch jobs

by Stefano

these seem to be very useful info, I'll pass them to the system manager, thanks Stefano -- ________________________________________________________________________ Stefano Fachin Professore Ordinario di Statistica Economica Dip. di Scienze Statistiche "Sapienza" Università di Roma P.le A. Moro 5 - 00185 Roma - Italia Tel. +39-06-49910834 fax +39-06-49910072 web http://stefanofachin.site.uniroma1.it/

7 years, 8 months

1
0
0 / 0

Re: [Gretl-users] updating package SB

by Stefano

Since our package may be probably taken as an example if ineffcient coding any improvement is more than welcome :-). Yes of course, go ahead Artur, thanks Stefano -- ________________________________________________________________________ Stefano Fachin Professore Ordinario di Statistica Economica Dip. di Scienze Statistiche "Sapienza" Università di Roma P.le A. Moro 5 - 00185 Roma - Italia Tel. +39-06-49910834 fax +39-06-49910072 web http://stefanofachin.site.uniroma1.it/

7 years, 8 months

1
0
0 / 0

vectorized solution possible?

by Artur T.

Dear all, I am facing the following simple problem for which I only find a loop solution but loops are (generally?) slow compared to vectorized approaches (if possible). <hansl> matrix Y = seq(1,10)' # original realizations SEL = ceil(muniform(10,1)*10) # selection vector (select i-th row from Y) eval Y ~ SEL matrix X = zeros(10,1) # put re-sampled Y values here # I want to avoid this annoying loop... loop i=1..rows(SEL) -q catch X[i]=A[SEL[i]] # i-th position Y value endloop </hansl> In MATLAB this works simply by: X = Y(SEL) Is there any way to vectorize this task?? Thanks in advance. Artur

7 years, 8 months

2
8
0 / 0

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Gretl-users October 2017