Using closures to make 16 related functions

The R Open Sci newsletter linked to a mailing list thread about how to stash data inside of an R package.

Duncan Murdoch suggests using a function factory to create closures that can access data in the parent environment.

Your local() call can create several functions and return them in a list; then just those functions have access to the local variables. For example,
createFns <- function() {

   .fooInfo <- NULL

   fn1 <- function (...) { ... }
   fn2 <- function (...) { ... }

   list(fn1 = fn1, fn2 = fn2)
}

fns <- createFns()
fn1 <- fns$fn1
fn2 <- fns$fn2
Now fn1 and fn2 are functions that can see .fooInfo, and nobody else can (without going through contortions).

(I had to fix the code to create a real function factory.)

Then Martin Mächler replies to say yup, this is what we do in base R.

Note that the above approach has been how nls() has been implemented for R … a very long time ago {before R 1.0.0}

e.g. from example(nls):

DNase1 <- subset(DNase, Run == 1)
fm1 <- nls(density ~ SSlogis(log(conc), Asym, xmid, scal), DNase1)
str(fm1$m)
#> List of 16
#>  $ resid     :function ()  
#>  $ fitted    :function ()  
#>  $ formula   :function ()  
#>  $ deviance  :function ()  
#>  $ lhs       :function ()  
#>  $ gradient  :function ()  
#>  $ conv      :function ()  
#>  $ incr      :function ()  
#>  $ setVarying:function (vary = rep_len(TRUE, np))  
#>  $ setPars   :function (newPars)  
#>  $ getPars   :function ()  
#>  $ getAllPars:function ()  
#>  $ getEnv    :function ()  
#>  $ trace     :function ()  
#>  $ Rmat      :function ()  
#>  $ predict   :function (newdata = list(), qr = FALSE)  
#>  - attr(*, "class")= chr "nlsModel"

# so 16 functions, all sharing the *same* environment very
# efficiently and nicely

# this is *the* environment for the fitted model:
fmE <- environment(fm1$m[[1]])
ls.str(fmE)
#> convCrit : function ()  
#> dev :  num 0.00479
#> env : <environment: 0x0000020a69f488b0> 
#> form : Class 'formula'  language density ~ SSlogis(log(conc), Asym, xmid, scal)
#> getPars : function ()  
#> getPars.noVarying : function ()  
#> getPars.varying : function ()  
#> getRHS : function ()  
#> getRHS.noVarying : function ()  
#> getRHS.varying : function ()  
#> gradCall :  language attr(ans, "gradient")[c(TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE,  TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE| __truncated__ ...
#> ind : List of 3
#>  $ Asym: int 1
#>  $ xmid: int 2
#>  $ scal: int 3
#> internalPars :  Named num [1:3] 2.35 1.48 1.04
#> lhs :  num [1:16] 0.017 0.018 0.121 0.124 0.206 0.215 0.377 0.374 0.614 0.609 ...
#> nDcentral :  logi FALSE
#> npar :  int 3
#> QR : List of 4
#>  $ qr   : num [1:16, 1:3] -1.5221 0.0086 0.0314 0.0314 0.0584 ...
#>  $ rank : int 3
#>  $ qraux: num [1:3] 1.01 1.03 1.28
#>  $ pivot: int [1:3] 1 2 3
#> resid :  num [1:16] -0.01368 -0.01268 0.00895 0.01195 -0.00258 ...
#> rhs :  num [1:16] 0.0307 0.0307 0.1121 0.1121 0.2086 ...
#> scaleOffset :  num 0
#> setPars : function (newPars)  
#> setPars.noVarying : function (newPars)  
#> setPars.varying : function (newPars)  
#> upper :  NULL
#> useParams :  logi [1:3] TRUE TRUE TRUE
#> wts :  num [1:16] 1 1 1 1 1 1 1 1 1 1 ...

so the environment “contains” the functions themselves (but quite a few more things) and for an environment that means it only has pointers to the same function objects which are also in fm1$m.

So, there has been a nice convincing and important example on how to do this - inside R for more than two decennia.

Final emphasis was added by me.

Leave a comment