Species In Space: New test version of ENMTools with model selection

Tuesday, June 8, 2010

New test version of ENMTools with model selection

There's a new test version of ENMTools up here. In addition to fixing a few minor annoyances from previous versions, there's a new function that allows criterion-based model selection using AICc and BIC. The user interface for the function is almost non-existent - all it really does is ask for a script file. Here's a quick-and-dirty rundown of how to use it. In order to correctly calculate likelihoods, the data must be formatted appropriately. For that reason we suggest that users pay very close attention to the requirements below.

1. Build a set of models to compare. It is absolutely crucial that suitability scores be output in RAW format! You will need both the .asc file and the .lambdas file associated with each model.

2. Make sure that each set of occurrence points to be compared is in its own independent file. You do not want to load an occurrence file that has points for multiple species. You also need to eliminate duplicate occurrence points from the file, particularly if you have Maxent set to ignore duplicate occurrences.

3. Build a script. A script is simply a .csv file with the paths to the files you want to analyze. Each line of the script should consist of a .csv file, a .asc file, and a .lambdas file. Relative paths will not work, you need fully qualified path names. A typical line will look like this:

c:\mydata\points.csv,c:\mydata\species.asc,c:\mydata\species.lambdas

You need one line per analysis. Also note that ENMTools will output results into a file with a name based on your script file. If your script file is named myscript.csv, the output will be named myscript_model_selection.csv. At present it will overwrite that output file (if it already exists) without asking, so BE CAREFUL.

4. In ENMTools, choose the "Model Selection" tool under "ENM Measurements". A file dialog will pop up. At this point you should choose the script file that you just made. ENMTools will chug along for a while, and will tell you when it's finished. The process is fairly simple: ENMTools uses your raw suitability scores (after standardization) and occurrence points to calculate the likelihood of observing your data under that model. It then counts the number of parameters from your lambdas file, counting any parameter with nonzero weight. Finally, it uses these values to calculate AICc and BIC.

Preliminary studies (Warren and Seifert, in review) indicate that AICc outperforms BIC in selecting models on simulated data. I'll be talking about this study at Evolution this year, for those who are interested (shameless plug).

Keep in mind that this is a test build and may be buggy. Feedback is appreciated.

59 comments:

YumaOctober 6, 2010 at 4:40 PM
Dear Dan:
I am trying to use ENMTools to test the niche identity for some cryptic species (phylogroups), I installed the new version of ENMTolls (1.1) and the last version of Perl, I am using the last version of MaxEnt 3.3.1. Apparently they run well when I calculated I and D, however when I tried to run identity or background test I get the same error.

Can't open C:/Users/Jonathan/Documents/Trabajo/Docto/Analyses/ENMTools/Rglaucostigma_rep0.asc!!

Can't open C:/Users/Jonathan/Documents/Trabajo/Docto/Analyses/ENMTools/Rglaucostigma_rep0.asc!!

while executing
"::perl::CODE(0x23baecc)"
invoked from within
".b5 invoke "
invoked from within
".b5 instate {pressed !disabled} { .b5 state !pressed; .b5 invoke } "
(command bound to event)

Any idea of what I doing bad?
ReplyDelete
Replies
Dan WarrenOctober 6, 2010 at 4:45 PM
Could you email me your .csv file? Send it to dan.l.warren@gmail.com, thanks!
ReplyDelete
Replies
YumaOctober 6, 2010 at 6:27 PM
I had sent you the files, but also I had tried with the examples files, and I got the same error
ReplyDelete
Replies
YumaOctober 20, 2010 at 6:16 PM
Dear Dan:
I just wondering if you find any problem in my .csv file, or if you have any idea of what is producing the error?
ReplyDelete
Replies
AnonymousOctober 20, 2010 at 6:28 PM
Oh goodness, I'm sorry! Somebody else emailed me a .csv file at the same time and I got the two confused - I thought that I had fixed your problem! I'll look at it straight away.
ReplyDelete
Replies
AnonymousOctober 20, 2010 at 6:32 PM
Oh wait, I just dug through my email and found that I did write back to you with a question - perhaps you didn't see it?
ReplyDelete
Replies
Naturaleza_CehegínJanuary 25, 2011 at 6:15 AM
Dear Dan,

I'm using ENMTools to compare different Maxent models. I've followed your instructions and it looks to work correctly. In fact, I can get the AICc and BIC for most of the models. However, the outputfile don't provide them for some of the models, giving just "X"s. I mean, the output says this:

C:\models\species.csv,P:\models\species.asc,-151.356067919256,20,17,x,x,x

Any idea of what it is going bad?

Thanks in advance,

Pedro
ReplyDelete
Replies
Dan WarrenJanuary 25, 2011 at 7:01 AM
That occurs when your model has more parameters than you have occurrence points, which violates the assumptions of AIC.
ReplyDelete
Replies
Naturaleza_CehegínJanuary 25, 2011 at 7:21 AM
Thanks Dan for your reply. Now, I understand. So, I should discard those models with more parameters than occurrence points?

Cheers,

Pedro
ReplyDelete
Replies
Dan WarrenJanuary 25, 2011 at 7:23 AM
I would, yes.
ReplyDelete
Replies
Naturaleza_CehegínFebruary 9, 2011 at 3:57 AM
Hi again Dan,

One new (but quick) doubt: when obtaining the suitability scores (in RAW format) with maxent in order to compare different models, the "add samples to backgound" option should be disabled, shouldn't it? I've read that it in your Ecol Appl paper but not in the "protocol" above.

Cheers,

Pedro
ReplyDelete
Replies
Dan WarrenFebruary 9, 2011 at 6:40 AM
We disabled that option on the advice of Steven Phillips specifically because the simulated data we were using simulated data which has no spatial sampling bias. As that is not the case with real data, I don't think that it is generally necessary to disable this option.
ReplyDelete
Replies
Naturaleza_CehegínFebruary 9, 2011 at 8:41 AM
Ok. Thanks!
ReplyDelete
Replies
ladybluedevilJune 1, 2012 at 11:02 AM
Hey Dan - is there any reason why I would get X's instead of likelihood values when I *know* that my sample size is greater than the number of model parameters? I have 595 sample points (none are duplicated) and 87 model parameters. I'd love to use this tool but I can't figure out what's going on! Any help would be GREATLY appreciated! :)

~Elizabeth
ReplyDelete
Replies
ladybluedevilJune 1, 2012 at 11:03 AM
p.s. I am running ENMTools_1.3 for OSX.
ReplyDelete
Replies
UnknownJuly 29, 2012 at 10:31 PM
Dear Dan:
I am trying to use ENMTools to test the niche identity for few plant species. I have installed the new version of ENMTolls (1.3) and the latest version of Perl. I am using the MaxEnt 3.3.1. I was able to calculate I and D, however when I tried to run identity or background test I get the same error:

Can't open H:/ENMTools_1.3/Phyllanthus debilis_rep0.asc!!

Can't open H:/ENMTools_1.3/Phyllanthus debilis_rep0.asc!!

while executing
"::perl::CODE(0x5642e4c)"
invoked from within
".b5 invoke "
invoked from within
".b5 instate !disabled { .b5 invoke } "
invoked from within
".b5 instate pressed { .b5 state !pressed; .b5 instate !disabled { .b5 invoke } } "
(command bound to event)

Can you please help me solve this problem
ReplyDelete
Replies
UnknownNovember 13, 2012 at 9:32 AM
Dear Dan,

I am using the ENMTools model selection tool to compare Maxent habitat suitability models, and wondered whether it is ok to use an occurrence file that has duplicate occurrence points in it (as I did not remove duplicates in my Maxent models, for reasons I won't go into here!). I know the instructions say to remove them, but I'd like to know if the tool will run ok and produce sensible AIC/BIC results with them kept in? Your advice would be much appreciated.

Thanks,
Anna
ReplyDelete
Replies
Dan WarrenNovember 13, 2012 at 12:22 PM
If you constructed your models without removing duplicates, it's probably better to evaluate them without using duplicates. Cheers!
ReplyDelete
Replies
UnknownNovember 14, 2012 at 4:21 AM
Hi Dan,

Thanks for your quick reply. Just to clarify, are you saying that I should evaluate the models without using duplicates (ie remove them), even though I kept them in to build the models? Thanks!
ReplyDelete
Replies
Dan WarrenNovember 14, 2012 at 11:50 AM
Yes, but I'm assuming that you kept them in because you have some compelling reason to believe that the duplicate occurrences are not simply due to sampling bias.
ReplyDelete
Replies
E.February 13, 2013 at 8:40 AM
Dear Dan, I am using ENM tools to compare models ran with Maxent but I have one doubt. Sorry if it is too obvious.
The set of occurrence points to be compared is the set of points used to build the model (training data) or you use all the data (training + test data)??
and you use only presence data or presence + absence?
Thanks a lot!
Elena
ReplyDelete
Replies
Dan WarrenFebruary 13, 2013 at 12:25 PM
It uses all data, presence only.
ReplyDelete
Replies
E.February 15, 2013 at 4:17 AM
Thanks a lot Dan!!
ReplyDelete
Replies
BatistaMarch 10, 2013 at 4:46 PM
Dear Dan,
I'm starting to use ENMtools to evaluate my maxent models and I have some doubts about the process. My question is: it is necessary to standardize all the asc files before running Model selection tool, or the tool is able to do it during the process? This is because I’m very green in this area and need to be certain of my final scores and how to interpret them correctly. All my final scores seem to be extremely high (~106820793909 - AICC), but perhaps it is normal. Sorry for this basic question and many thanks in advance.
Leonel
ReplyDelete
Replies
Dan WarrenMarch 10, 2013 at 4:49 PM
ENMTools standardizes the models. However, the AICc score you present is a bit weird. Can you send me one of your models, or are they too big?
ReplyDelete
Replies
E.April 3, 2013 at 4:47 AM
Hi again Darren! when running ENM with my models I get this error:
Can't take log of -9.23358e-054 at ENMTools_3-17-2011.pl line 2116.

Can't take log of -9.23358e-054 at ENMTools_3-17-2011.pl line 2116.

while executing
"::perl::CODE(0x13233a4)"
(menu invoke)

The output of my models is RAW.
what can be wrong? thanks a lot!
ReplyDelete
Replies
Dan WarrenApril 7, 2013 at 3:56 PM
The error simply means you can't take the log of a negative number. I can't really tell from this why your models might be producing negative probabilities, but obviously that's a problem regardless.
ReplyDelete
Replies
UnknownApril 16, 2013 at 3:29 PM
Hi Dan,

First, let me say you are an absolute gem for troubleshooting in your blog comments for years. Much appreciated.

Second, I cannot convince the Model Selection tool to find any of the files needed to compare models--the points, the prediction raster, or the lambdas. I get this message for the files in each of the six models (Auto features used as an example):

Can't find c:\temp\camas_trimmed.csv c:\temp\auto.asc c:\temp\auto.lambdas!
Can't find !
Can't find !

I started out with all my files in places with quite complicated paths and, as I got more and more errors, copied and renamed them until they sit in this little temp folder on my C drive. I have gotten the same error with ENM 1.4 and 1.3 independently. A ...model_selection.csv file is written, but it only contains the column headers--no data. I am happy to send you my script or whatever you need to diagnose the problem.
ReplyDelete
Replies
Dan WarrenApril 16, 2013 at 5:40 PM
My guess is that there's something up with the .csv script file. Could you send it to me?
ReplyDelete
Replies
E.October 1, 2013 at 9:21 AM
Hi Dan, I am using ENM tools to compare my maxent modelsand select the best model. I am getting this message in the black command screen "Found probability of -9999" I was wondering if that is indicating an error or it is just a normal message. I am using raw output.
Thanks in advance.
ReplyDelete
Replies
Dan WarrenOctober 1, 2013 at 2:07 PM
That just means that there is one of your points that has a nodata value in the raster for some reason. Not anything to worry about unless you have a whole lot of them.
ReplyDelete
Replies
UnknownJanuary 28, 2016 at 11:27 AM
Hello,

I've done some searching and haven't found a specific answer to this question. I know I need a line in the model selection script for each csv/asc/lambda. So if I'm running multiple replicates (say 10) for one model, will my input model look like c:\data\points.csv\,c:\data\species_0.asc,c:\data\species_0.lambda
c:\data\points.csv\,c:\data\species_1.asc,c:\data\species_1.lambda
c:\data\points.csv\,c:\data\species_2.asc,c:\data\species_2.lambda
etc for each replicate on this model?
Will I then include similar lines for a separate model that also has 10 replicates?

Thanks!
ReplyDelete
Replies
Dan WarrenJanuary 28, 2016 at 11:52 AM
Yes indeed!
ReplyDelete
Replies
UnknownFebruary 24, 2016 at 4:59 PM
Hello,

I am trying to use the background test in ENMTools, but keep getting the same error (below). It appears that the program is creating a number of empty files in my output folder and then not being able to reopen them. My input files are .csv files with two columns with the headers "LAT" and "LONG" Any idea what the problem might be? Thanks in advance for any help that you can give!

Can't open C:/Users/rtelemeco/Documents/Alligator Lizards/Niche Modelling/ENMTools_ElgariaOutput/34.23442.asc!!

Can't open C:/Users/rtelemeco/Documents/Alligator Lizards/Niche Modelling/ENMTools_ElgariaOutput/34.23442.asc!!

while executing
"::perl::CODE(0x36b5578)"
invoked from within
".b5 invoke "
invoked from within
".b5 instate !disabled { .b5 invoke } "
invoked from within
".b5 instate pressed { .b5 state !pressed; .b5 instate !disabled { .b5 invoke } } "
(command bound to event)
ReplyDelete
Replies
UnknownJuly 16, 2016 at 2:00 PM
Hi,i know it is a simple question but i just started to use ENMTools, when i try to build a null model using resample from raster,what should i put in the number of points per replicate? I have one species(different wolfs) and 25 000 data (gps coordinates)?
ReplyDelete
Replies
Dan WarrenJuly 16, 2016 at 2:49 PM
What are you building the null model for?
ReplyDelete
Replies
CAPJuly 18, 2016 at 8:18 AM
Hi Dan
For some reason I don't know yet, I can't get a niche identity test using enmtools v1.3 on windows. I could do it last year, but now when I add two files for 25 replicates, I get the message "Niche identity test are finished" very quickly but I don't get any result. Can you please help me and let me know what I might have done wrong? Thanks!
ReplyDelete
Replies
Dan WarrenJuly 18, 2016 at 2:19 PM
Hard to diagnose just from this, but my guess is that Maxent isn't actually being run. Do you see the GUI pop up?
ReplyDelete
Replies
nishmaNovember 18, 2016 at 8:56 AM
This comment has been removed by the author.
ReplyDelete
Replies
Anna NamyatovaFebruary 19, 2020 at 7:15 AM
I have a question about the application of ENMTool model selection. Does it make sence to compare the models with the different number of variables, or is it just to compare the model with different parameters? Are there limitations for model comparisons with this tool at all?
ReplyDelete
Replies
Debanjan SarkarMay 21, 2020 at 1:52 AM
This comment has been removed by the author.
ReplyDelete
Replies

Add comment

Tuesday, June 8, 2010

New test version of ENMTools with model selection

59 comments:

Contributors