User's Guide for MCestimate Matthew Etterson U.S. Environmental Protection Agency 12 September 2013 ------- Contents 1. Preliminaries 3 1.1 Disclaimer & distribution 3 1.2 What is MCestimate and why is it needed? 3 1.3 Installation 4 2. Data format 5 2.1 Required data 5 2.2 Formatting requirements 5 2.3 Example: SimData.xls 6 3. Getting your data into MCestimate 7 3.1. The File Menu 8 3.2 Importing nest data 9 3.3 Viewing import errors 13 3.4 Re-describing the data after fixing errors 14 3.5 Opening a saved nest file 14 3.6. Displaying summary information 15 3.7 Exporting a file for analysis in program MARK 15 4. Modeling nest failure 16 4.1 Specifying the response distribution 16 4.2 Overview: designing models 17 4.3 Building models by specifying covariates 17 4.4 Importing models from a file 21 4.5 Running the models 22 4.6 The Pending models context menu 22 ------- 5. Interpreting model results 24 5.1 Model convergence 26 5.2 Parameter estimates 26 5.3 Goodness-of-fit 30 5.4 Example. Assessing Goodness-of-fit 32 5.5 Exporting MCestimate results 35 5.6 Visualizing MCestimate results 37 6. Managing results tables and model lists 38 6.1 Saving an AIC Table 39 6.2. Saving a model list 39 6.3 Managing lists 39 Acknowledgments 40 Literature cited 40 1. Preliminaries 1.1 Disclaimer & distribution Thank you for your willingness to try MCestimate. This is a work in progress, so expect some bugs! If you report them I will do my best to fix them quickly. However, your use of this program is at your own risk. Matt Etterson etterson.matthew@epa.gov 218-529-5158 1.2 What is MCestimate and why is it needed? MCestimate is a tool for estimating probabilities of failure in the presence of competing risks. It was originally developed to facilitate the study of causes of avian nest-failure. However, competing risks arise in virtually all stochastic processes and MCestimate can be applied in many contexts outside of avian nest failure. For example, carcass distribution trials are a commonly used technique for understanding either scavenging rates or searcher efficiency to better understand animal mortality as a result of collisions with anthropogenic structures such wind turbines, automobiles, power lines, or 3 ------- windows. The two fates (scavenged versus discovered) can be viewed as competing risks. Similarly, standard toxicity tests generally include a control treatment in which some animals die due to natural causes or experimental stress that is not related to the toxicity of the test material. These two sources of mortality (intoxication versus control) are competing risks. MCestimate can be (indeed is being) used to understand both of these processes. However, for historical reasons and for simplicity of presentation, this user's guide, and many of the menus in MCestimate explicitly refer to the analysis of nest survival data. MCestimate uses nest-monitoring data to estimate daily and overall probabilities of nest-failure due to specific causes (competing risks). It is programmed in MATLAB (Mathworks 2012a) and compiled as a standalone program. However, other good computer programs (White and Burnham 1999, Dinsmore et al. 2002) and code for statistical software (Stanley 2000, 2004a; Shaffer 2004) already exist for estimating daily survival rates for nests when nest outcomes are classified binomially (success/fail). Fewer methods exist for nests with multinomial outcomes (success, depredated, adverse weather, abandonment, etc.). Multinomial methods are slightly trickier than their binomial counterparts because estimating the probabilities of failure due to a given cause, requires that you correctly consider the probabilities of not failing due to all other possible causes. When combined with the need to account for nests not being monitored daily, this gets trickier still. MCestimate does this using the Markov chain algorithms described by Etterson et al. (2007a, 2007b). Like other current nest survival methods, MCestimate is a generalization of Mayfield (1961, 1975) methods for estimating daily probabilities of failure. The primary generalizations of MCestimate over the Mayfield (1961, 1975) methods are: 1. MCestimate does not require exact knowledge of the timing of failure events (i.e., daily nest-monitoring is not required). 2. MCestimate allows separate estimation of daily probabilities of failure due to multiple causes (competing risks, or cause-specific failure). 3. MCestimate allows separate specification of covariates to different fates to explore hypotheses about the influence of environmental conditions on cause-specific failure rates. MCestimate is also a simple multistate model (see Hougaard 1999 and Pradel 2005 for details on multistate and multievent models). Other software for fitting multistate models is also available (Choquet et al. 2004, 2009), which can accommodate more general models than MCestimate. MCestimate has been designed for easy use with an intuitive graphical user interface. 1.3 Installation MCestimate installation requires 2 files. The MATLAB Compiler Runtime (MCRInstaller.exe) installs a library of MATLAB functions that are used by components built in the MATLAB environment. The MCR is specific to the version of MATLAB in which MCestimate was compiled. This build of MCestimate used version 7.14.0.739, which is associated with MATLAB 2012a (32-bit). This file can be downloaded from the Mathworks website (http://www.mathworks.com/products/compiler/mcr/). Installing the MCR is a one-time event and should not need to be repeated until MCestimate is compiled under a new version of Matlab. Note that to install the MCR you will need administrative rights to your computer. To install, double click on the file MCRInstaller.exe. This will call a setup program that you should click through. Once the MCR is installed you should be able to run MCestimate by double-clicking the file MCestimate.exe. The initial distribution ([insert URL for MED website, (pending)]) consists of the following four files: 4 ------- 1. MCestimate.exe 2. MCestimateUsersGuide.pdf 3. SimData.xls 4. SimDataBad.xIsx 2. Data format MCestimate is designed to import nest monitoring data from Microsoft® Excel. For importing, your nest data should be arranged with one nest per spreadsheet row. The order in which columns occur is somewhat flexible, but with the following rules: 1) the first (left-most) column is reserved for an alphanumeric nest identifier, and 2) the monitoring data (the actual nest visit dates) must be in the right-most columns. 2.1 Required data 1. Row of unique headers for all columns. 2. Column giving the ultimate fate of each nest (successful, depredated, weather, etc.). You may have as many fates as you wish. However, "unknown" is a treacherous one. Be careful with it! 3. Columns (at least two) giving the dates on which each nest was visited. a. The first column gives the first date on which the nest was observed to be active. b. The last column gives the date of the last visit to the nest. Because there are many ways to code and interpret nest-monitoring data, I recommend the following rules. Note that I define an "observation" on a nest to be a set of two successive nest visits. i. For all nests, the first observation should begin with the first visit on which a nest was known to have contents that could fail (eggs or nestlings). The first observation should not begin with nest construction. ii. For all nests, if a nest was sufficiently developed to have fledged during an observation, then the second visit of the observation should be discarded and the fate of the nest should be recorded as its state on the first visit of the observation (presumably "successful"). To see why this is important, read Stanley (2004b). Optional data include: 4. Columns giving covariate data (year, treatments, etc.) 5. Column giving the age of a nest on the first visit (where "first visit" is defined as 3.b.i., above). 2.2 Formatting requirements Two example data sets (SimData.xls & SimDataBad.xIsx) are provided in the dropbox foloder (Fig. 1). The data in SimData.xls were simulated using fewer covariates than are actually provided in the data. SimDataBad.xIsx was created from SimData.xls by changing two data values to produce two common 5 ------- types of errors. Each of the examples in this user's guide uses one of these two files, so you should be able to reproduce the examples exactly. You might also have some fun trying to figure out which covariates I actually used to generate the data (and what the "true" parameter values were). 1. Dates should be given in the following format "Ol-Apr-09." Deviations may work (e.g. "01/04/09"), but employ them at your own risk! 2. There must be no missing data from any columns used for analysis (except visit dates). For now this is a very strict limitation. Cells with missing data will be assigned to the status "NaN" (not a number). If you import a spreadsheet with missing data, the program will display an error message and you will be unable to perform any analyses until the missing data is supplied or the nests with missing data are removed. 3. For each nest, visits are numbered sequentially beginning with the first active visit to a nest and ending with the last visit. Thus there won't be the same number of visits to each nest. All cells to the right of the last visit to a nest should be left blank. 4. The nest data must be on the first worksheet in your Excel Workbook. Safest thing is to delete all other worksheets. 5. Data may be saved in Excel 1997-2003 Format (*.xls) or Excel 2007 Format (*.xlsx, default). 6. Headers cannot include an asterisk. In general I'd recommend avoiding characters other than numbers and letters. I'd also recommend avoiding having one header that is a proper subset of another (e.g., don't have "Fl" and "F12"). It would probably be fine, but it might not... 2.3 Example: SimData.xls Figure 1. Example data SimData.xls njj J""" ' —' Horn* Irv.-rt Pagt Urout Formula! Data RnWw unv Aoo.lm » 1 2 3 10 11 M =i a J ,^_ :: uJ^t "ft™* Cv££T s££n A B C D E F G nest late yr hi in age VI 1 depredated 1 14.9 to 6 8-May-01 2 weather 0 11 0 to 9 15-Jun-OO 3 weather 0 9 1 to 1 15-Jun-OO 4 survive 0 9 7 hi 15 1-Jul-OO 5 survive 1 11 1 hi 11 12-Jun-OI 6 depredated 0 64 to 2 12-Jun-OO 7 depredated 0 10.7 hi 2 3-Jun-OO 8 depredated 0 13 4 hi 11 14-Jun-OO 9 survive 0 4 7 to 2 3-Jun-OO 10 depredated 1 14 5 to 12 28-Jun-01 12 11 survive 0 11 hi 14 12-JuKJO 13 12 weather 0 11 2 to 12 9-Jun-OO 14 13 survive 0 13 8 hi 11 2-Jul-OO 15 16 tee 187 IB 189 19 491 492 i-i - 494 495 191 497 196 499 ro- 501 502 503 14 depredated 0 63 to 5 4-Jun-OO 15 depredated 1 4 4 to 14 7-Jun-01 485 depredated 1 12.7 to 12 21-May-01 486 depredated 0 3 7 hi 6 19-Jun-OO 487 weather 1 13 1 hi 14 9-Jun-01 488 depredated 1 69 to 1 6-May-01 489 survive 1 5 8 to 8 24-Jun-01 490 depredated 1 80 to 12 19-Jun-01 491 depredated 0 94 to 4 22-May-OO 492 depredated 1 11 1 hi 6 24-May-01 493 survive 0 7 3 to 8 5-Jul-OO 494 survive 0 39 hi 15 15-Jul-OO 495 survive 0 13 5 hi 13 18-Jun-OO 496 survive 0 3 6 hi 3 22-Jun-OO 497 depredated 1 10-5 hi 5 ir>May-Ol 498 survive 0 10 9 to 4 30-Jun-OO 499 depredated 1 1 1 7 hi 7 7-Jul-01 500 survive 1 6 6 to 2 20-Jun-01 unuUBv HJUig! _ . H V2 12-May-01 19-Jun-OO 19-Jun-OO 4-JUI-OO 14-Jun-01 16-Jun-OO 7-Jun-OO 1 7-Jun-OO 7-Jun-OO 2-Jul-DI 15-Jul-OO 1 3-Jun-OO 6-Jul-OO 8-Jun-OO 9-Jun-01 23-May-01 21 -Jun-00 1 3-Jun-01 9-May-01 27-Jun-01 21-Jun-01 25-May-OO 26-May-01 7 Jul-00 18-Jul-OO 22-Jun-OO 24-Jun-OO 12-May-01 3-Jul-OO 10-Jul-01 24-Jun-01 i ;: . V3 21-Jun-OO 21 -Jun-00 8-Jul-OO 17-Jun-01 18-Jun-OO 9-Jun-OO 21 Jun-00 11-Jun-OO 6-Jul-01 1 9-Jul-OO 15-Jun-OO 9-Jul-OO 11-Jun-OO 12 JuiiO! 25-Jun-OO 16-Jun-01 1-Jul-Q1 24-Jun-01 27-May-OO 20 l.'ay 01 9-Jul-OO 21-JuUX) 26-Jun-OO 27-Jun-OO 14-May-oi 7-Jul-OO 215 .inn 01 V4 25-Jun-OO 21-Jun-01 22-Jun-OO 23-Jun-OO 1 3-Jun-OO 1 7-Jun-OO 12-JUr-OO 27-Jun-OO 5-JuM31 27-Jun-01 29-May-OO 1-Jun-01 12-Jut-OO 24-Jul-OO 29-Jun-OO 1 -JuMJO 16- May -01 10-Jul-OO 1-Jul-01 • K L M N 0 P V5 V6 W V8 V9 V10 28-Jun-OO 2-Jul-OO 4-Jul-OO 7-Jul-OO 24-Jun-01 27-Jun-OO 16-Jun-OO 18-Jun-OO 20-Jun-OO 23-Jun-OO 20-Jun-OO H 1-JuHX) 5-Jul-OQ 9-Jul-01 29-Jun-01 2-Jun-OO 5-Jun-01 15-Jul-OO 18-Jul-OO 20-Juf-OO 3-Jul-OO 7-Jul-OO 11-Jul-OO 13-JuMX) 19-May-O! 22-May-O! 25-May-Ol 29-May-Ol 1 3-Jul-OO 1 7-Jul-OO 20-Jul-OO 3-Jul-01 6-Jul-01 10-Jul-01 ------- Fig. 1, above, shows portions of the sample data file SimData.xls. The sample file has 16 columns of data as described in Table 1. Table 1. Simulated nest-monitoring data in SimData.xls Column header Description nest Unique identification number for a nest. Can be any combinations of letters, spaces, and numbers. fate Ultimate fate of the nest (assumed to be determined on last visit). yr Year of study in which nest occurred, to be used as a covariate. ht Nest height (m) above ground. trt A treatment thought to influence predation rate (hi vs. lo). age Age (d) of the nest on the first visit (VI). V1-V10 Columns of dates of nest-visits to nest. VI is the first visit on which nest could have failed (i.e. the first visit on which the nest had eggs or nestlings observed). 3. Gettingyour data into MCestimate Start MCestimate by double-clicking MCestimate.exe. You should now be looking at the main MCestimate window (Fig. 2). Figure 2. The main MCestimate window at startup File Analysis About Expiration date: - Status Ready Fitted models: Decimal precision: 2 Response Variable: Interrupt O Binomial (success/fail) O Multinomial Pending models: Model name ML AICc | delta AICc | Weight ------- Operations concerned with importing, managing, and formatting data for MCestimate are initiated via the File menu in the upper left corner (Table 2). The Analysis menu is described in Section 4: Modeling nest failure. After analysis, most operations are initiated via the Fitted models context menu (Section 5). The About menu provides information on the build date of the current version. Ultimately it will also provide a recommended citation. Table 2. MCestimate menus Menu Item Description File Analysis About Set of commands for importing and formatting data for MCestimate. Set of commands for performing analysis and managing output. Information about MCestimate. 3.1. The File Menu The File menu in MCestimate has 8 choices most of which are designed to assist with importing, formatting , viewing, and exporting data (Table 3, Fig. 3). Table 3. The File menu Menu Item Description Set working directory Import data View file Open data file Display summary info Export MARK file Clear data Exit Sets the default directory for files for a given project. Opens a file browser to select a data file for importing. If a file is chosen, subsequently launches the DescribeData GUI (Fig. 4, below). Displays the MCestimate version of the Excel file. This is used for fixing data errors. Opens a properly formatted MCestimate file created via the Import data option. Displays summary information about the current data file. Exports an encounter history file (*.inp) for analysis in program MARK. Clears all nest data from memory. Do this before opening another nest file. Exits MCestimate. ------- Figure 3. The Main MCestimate window showing the File menu Analysis About Set working directory Import data View Pile Open data file Display summary info Export MARK file Clear data -Status- Ready j Decimal precision: 2 Interrupt Model name Model name delta AICc Weight 3.2 Importing nest data To import a spreadsheet into MCestimate, choose the Import data option from the File menu. This will call up a typical windows file browser (Fig. 4) that will display only Excel 2007 (*.xlsx) files. Excel 1997- 2003 (*.xls) files can be viewed by changing the type at the bottom of the file browser (Files of type:). Figure 4. Select your data file browser Select your data file. f?l(E] Look in: a My Recent Documents \J Desktop » My Documents ;l My Compute* ^ My Network Places LJ MCestimale _~] •- S) d* B- S}simData.xls "S]5imDataBad.xlsx File name: _^j Open Files of type: | (".*"! ^\ Cancel ------- After you choose your file and click open, MCestimate will automatically open the DescribeData window (Fig. 5). If you load SimData.xls, your DescribeData window should look like Fig. 5. Figure 5. The DescribeData window Variables: nest fate y ht lit age V1 V2 V3 V4 V5 V6 V7 V8 V9 V10 * - Observed States: State variable » | Monitoring data: 1 1 All Fates: A v Success » Column indicating first observation » Age of subject on first observation - Save to MCestimate file: Source file: — Independent variables (covariates)- Categorical predictors: Save MCestimate file Clicking the above button will open a standard dialog box for saving files. Reset Close When the DescribeData window loads, the data columns of your spreadsheet should be listed in the Variables listbox and all other boxes should be empty. Before you can analyze your data, you must identify its structure to MCestimate. Not all of the controls on this page must be used. The required controls are: 1. You must identify the column containing nest outcomes (State variable). In SimData.xls (Figs. 1, 5) it is the variable labeled fate (but you may use any label you wish). Enter it by selecting the variable in the Variables box and clicking the Fate variable » button. Once you do so, the All Fates listbox will contain a list of all entries in that column of the spreadsheet. If you make a mistake, don't worry. In most cases you can simply highlight the correct variable and click the button again and the incorrect entry will be replaced. If you manage to screw up completely, then you can click the Reset button and start over! 2. You must indicate which one of the fates corresponds to a successful nest. Do this by highlighting that fate in the All Fates listbox and clicking the Success » button. 10 ------- 3. You must identify the monitoring data by indicating which column contains the first nest visit. This is done by highlighting the proper variable (in SimData.xls it is VI, though again you may use any headers you wish) and clicking the button Column indicating first observation. 4. You must name the data and save it as a MCestimate file. Do this by clicking the Save MCestimate file button. This will close the DescribeData window and you should see a message in the main MCestimate window Status box "Parsing data ..." If you have a lot of data this could take a minute or two. After parsing, the DescribeData window will close and a file browser will open to allow you name and save the file. Once the file is saved, a status message will be displayed (Fig. 6). If the parsing algorithm fails, the file browser will not open and a status message will let you know about data formatting problems. Dealing with failed imports is the topic of Sections 3.3 and 3.4 below. Figure 6. Status message after a successful import CHI The optional controls are: 5. Age of subject at first visit: If this is known it can be entered using the button of the same name. 6. Categorical predictors: This listbox is used to identify categorical covariates. 7. Quantitative predictors: This listbox is used to identify quantitative covariates. Note: Integer- coded covariates may be entered as either categorical, quantitative, or both. If you code an integer-valued variable in both ways for the same analysis MCestimate will either append (c) for categorical, or (q) for quantitative to the name of the variable. For most integer-coded variables, the choice will be obvious about whether to specify that variable as categorical versus quantitative. However, for ordinal variables, in which the codes 1, 2, 3,..., n indicate a meaningful progression (e.g., over time, as in years, or from small to big, as in size classes), you might wish to explore both kinds of effect. The best example I can think of to justify exploring both kinds of effects in a single variable is with a "year" effect. Years may randomly fluctuate in predation pressure or weather events, arguing for a categorical coding. On the other hand, years may follow a trend, arguing for a continuous effect. In the end the choice should best reflect why you want to use the variable as a covariate. Example: importing SimData.xls From the File menu choose Import Data and open SimData.xls. Then, using the DescribeData window, add variables following the rules above until the DescribeData window looks like Fig. 7. Then click Save MCestimate file. 11 ------- Figure 7. The DescribeData window for the importing data example Variables: inest tfl ht age V4 r JV6 V7 JVB V9 V10 Observed States: - Monitoring data: - State variable » Column indicating tirst observation » Age of subject on first observation » | - Independent variables (covariates) - Save to MCestimate file: - Source (lie: Save MCestimate tile Clicking the above button will open a standard dialog box for saving files. Example: importing SimDataBad.xlsx This example is designed to illustrate the primary causes for failures in data importing. They are 1) nest observations lasting zero or negative days, and 2) missing data. The parsing algorithm searches for both kinds of errors. If either occurs then the status dialog reports the identification numbers of all nests having one or the other error. For an example, follow the steps above to import data, but this time choose SimDataBad.xlsx. After parsing, the status dialog should look like Fig. 8 (below). Figure 8. Status message after a failed import ImportErrors Your data contain errors that will crash MCnestimate. Nests with missing data: Nests wJh observations < 1 day: 1 1 1 11 1 12 ------- 3.3 Viewing import errors If you have many errors, you will probably want to edit your Excel file and re-import the data. However, if there are only a few errors, or you don't wish to alter your Excel file, you may do some limited editing in MCestimate. This is done by choosing View file under the File menu. Doing so for this example will produce a new window that should resemble Fig. 9. Figure 9. The imported nest file for SimDataBad.xIsx Edi 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 123456789 nest fate yr ht trt age VI V2 V3 1 depredated 1 14.861410 65/8/2001 5/12/2001 NaN 2 weather 0 10.967410 96/15/2000 6/14/2000 6/21/2000 3 weather 0 9.0823 lo 16/15/2000 6/19(2000 6/21/2000 4 survive 0 9. 6552 hi 157/1/2000 7/4/2000 7/8/2000 5 survive 1 11. 11 72 hi 116/12/2001 6/14/2001 6/17/2001 6 depredated 0 6.4027IO 26/12/2000 6/16/2000 6/18/2000 /depredated 0 10.6757 hi 26/3/2000 6/7/2000 6/9/2000 8 depredated 0 13.4008 hi 116/14/2000 6/17/2000 6/21/2000 9 survive 0 4.6667IO 26/3/2000 6/7/2000 6/11/2000 10 depredated 1 14.516510 126/28/2001 7/2/2001 7/6G001 11 survive 0 NaNh] 1 4 7/1 2/2000 7/15/2000 7f1 9/2000 12 weather 0 11210110 126/9/2000 6ft 3/2000 6/15/2000 3 survive 0 13 8447 hi 11 71212000 7 IS/2000 7/9/2000 4 depredated 0 6.275210 56/4/2000 6/8/2000 6/11/2000 5 depredated 1 4.361110 146/7/2001 6/9/2001 6/1 2/2001 6 weather 1 5 6608 hi 37/2/2001 7/5/2001 7/8/2001 7 depredated 0 14 0732 lo 46/18/2000 6/22(2000 NaN 8 survive 0 14. 5480 lo 116/24/2000 6/27/2000 6/30/2000 19 weather 1 9.616710 116/28/2001 7/1/2001 7/3/2001 20 survive 1 13.847210 55/12/2001 5f!5/2001 5/17/2001 21 survive 0 58240IO 37(2/2000 7/6/2000 7/9/2000 22 survive 1 13.3541 hi 86/5/2001 6/12/2001 6/15/2001 23 survive 0 8. 41 95 hi 87/4/2000 7/8/2000 7/12/2000 24 survive 1 13.513810 105/14/2001 5/18/2001 5/21/2001 25 depredated 1 14. 8732 hi 136/8/2001 6/10/2001 6/12/2001 26 depredated 0 13. 6630 hi 55/16/2000 5/19/2000 NaN 27 survive 1 9.201 2 hi 147/3/2001 7/5/2001 7/7/2001 < 10 V4 NaN 6/24/2000 6/25/2000 NaN 6/21/2001 6/22/2000 NaN 6G3/2000 6/1 3/2000 NaN NaN 6/17/2000 7/12/2000 6/15/2000 NaN 7/1 1 /2001 NaN 7/2/2000 7/6/2001 5/21 /2001 7/13/2000 6/18/2001 7/15/2000 5/23/2001 6/16/2001 NaN 7/10/2001 11 12 13 V5 V6 V7 NaN NaN NaN NaN 6/28/2000 7/2/2000 7/4/2000 NaN NaN 6/24/2001 NaN NaN NaN NaN NaN 6/27/2000 NaN 6/16/2000 6/18/2000 6/20/2000 NaN NaN NaN NaN 6/20/2000 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 7.6/2000 NaN 7/10/2001 NaN S/23/2001 5/27/2001 5/31 /2001 7/17/2000 7/21/2000 6/22/2001 NaN 7/1 7/2000 7/20/2000 5/27/2001 NaN NaN NaN NaN NaN 7/1 2/2001 NaN t v > •* The status message (Fig. 8) reported an observation lasting zero or fewer days for nest 2. Looking across the visit dates for nest 2, you can see that the date in row 3, column 8 is a day earlier than that in row 3, column 7, which will generate an observation lasting -1 days. Similarly, the status message reported missing data for nest 11. MCestimate represents missing data as NaN (not a number). Looking across row 12, NaN appears in column 4 (ht). The correct values for these cells are 6/19/2000 and 7.2, respectively (Fig. 1). These values can be typed directly into the cells. Alternatively you can use the Edit menu (Fig. 10). If you simply want to remove all nests with errors, you can use the first two choices. If you want to look at each error individually you can use the third or fourth choices Delete current cell and Delete current row(s). In particular, try selecting the cell in row 3, column 8 and choose Delete current cell. This will remove the value in the selected cell and shift leftward by one cell all cells to the right of the selected cell. Because the incorrect date is nested within a sequence of successful (survived) observations, deleting that particular cell will not affect the estimation results (but this will change in future versions of MCestimatel). Choosing Delete row will remove the entire row (nest) from the data. Deleting a row will always change the estimation results. 13 ------- Figure 10. The Edit menu in the NestFile window Remove all nests with missing covariates Remove all nests with an observation < 1 day Delete current cell Delete current rowi(s) 3674 lo .0823 lo 6 5)8/2001 96/15/2000 1 6/15/2000 5)12/2001 6)14)2000 6)19)2000 6/21/2000 6/21/2000 6/24/2000 6125/2000 NaN NaN NaN NaN 0 depredated 0 depredated 0 1 depredated 0 1 survive 0 depredated 1 1 survive 0 weather 0 1 survive 0 1 depredated 0 depredate.:! 1 weather 1 depredated 0 1 i survive 0 1 9 weather 1 20 survive 1 1 21 survive 0 22 survive 1 1 23 survive 0 24 survive 1 1 25 depredated 1 1 26 depredated 0 1 .6552 hi .4027 lo .6757 hi .4008 hi .666710 .516510 NaN hi .2101 lo .8447 hi .2752 lo .361 1 lo .6608 hi .0732 lo .5480 lo .616710 .8472 lo .8240 lo .3541 hi .41 95 hi .51 38 lo .8732 hi .6630 hi 27 survive 1 9.201 2 hi 157/1/2000 26/12/2000 2 6/3/2000 11 6/14/2000 2 6/3/2000 126)26/2001 147/12/2000 126)9/2000 11 7/2/2000 5 6/4/2000 146/7/2001 3 7/2/2001 4 6/1 8/2000 11 6/24/2000 11 6/28/2001 55/12/2001 3 7/2/2000 8 6)8/2001 8 7/4/2000 105/14/2001 136/8/2001 55/16/2000 14 7)3/2001 7/4/2000 6/16/2000 8)7)2000 8)17/2000 6/7)2000 7/2/2001 7/15)2000 6/13/2000 7)5/2000 6)8/2000 6)9/2001 7)5/2001 6/22/2000 6/27/2000 7)1/2001 5)15/2001 7)6/2000 6/1 2)2001 7)8)2000 5/18/2001 6/10/2001 5/19)2000 7)5/2001 7/8/2000 6/18/2000 6/9GOOO 6/21/2000 6/11/2000 7/6/2001 7/19/2000 6/15/2000 7/9/2000 6/1 1 /2000 en 2/2001 7/8/2001 6/30/2000 7/3/2001 5/17/2001 7/9/2000 6/15/2001 7/12/2000 5/21/2001 6/12/2001 7/7/2001 NaN NaN NaN 6)22/2000 NaN NaN NaN NaN NaN 6)23)2000 6/27QOOO NaN 6/13)2000 6/16/2000 6/18)2000 6/20/2000 NaN NaN NaN NaN NaN NaN 6/1 7)2000 6/20/2000 NaN 7/12)2000 NaN NaN 6/1 5/2000 NaN NaN NaN NaN NaN 7/11/2001 NaN NaN NaN NaN NaN NaN 7)2/2000 7)6)2000 NaN 7)6/2001 7/1 0/2001 NaN 5/21 Q001 5/23/2001 5/27/2001 5)31/2001 7/1 3)2000 7/1 7/2000 7)21 )2000 6/18/2001 6/22Q001 NaN 7/15/2000 7/17/2000 7/20/2000 5/23/2001 5)27/2001 NaN 6/16/2001 NaN NaN NaN NaN NaN NaN 7/1CW2001 7/12)2001 NaN Some final notes about data errors: 1) MCestimate will only look for errors in columns that you specify for analysis. It ignores other columns. For example, had you omitted specifying ht as a covariate in the above example, no error would have been generated for the missing value in the cell in row 12, column 4 (though nest 2 would still have generated an error for an observation of less than 1 day). However the displayed nest file would be unchanged (i.e., it would still look like Figs. 9 & 10). 2) Another possible error, having non-numeric data in a column specified for a quantitative covariate, will crash MCestimate. 3.4 Re-describing the data after fixing errors If you make changes to the data file in the NestFile window MCestimate will automatically try to parse the data again, and if you haven't fixed the errors (or if you've created new ones!) then MCestimate will display the same error again. You cannot perform any analyses until you successfully navigate the import screens and get notification of a successful import (e.g., Fig. 6). 3.5 Opening a saved nest file Once you've successfully described your data, the MCestimate window will display a file name under Data file. The file has been saved to your hard drive and, in future sessions, can be opened directly in MCestimate (without going through the import and describe data processes above). Or you can begin analysis immediately. If you wish to use a previously imported data set choose Open nest file from the File menu. 14 ------- 3.6. Displaying summary information If you have a nest file open that has been successfully parsed, then you can quickly view summary information about the file by choosing Display summary info from the File menu. Doing so for the imported data SimData.xls should look like Fig. 11. Figure 11. The summary information screen for SimData.mat ,' SimData.mat Source file = SirnData. Number of subjects = 500. Fledge = 211 depredated = 191 weather = 98 Number of observations = 1737. Number of exposure days = 5113. Effective sample size = 4538 days. Beginning of first observation = 03-May. End of last observation = 23-Jul. Length of season = 83 days. Duration (mean W- sd) of observations = 2.94 W- 0.32 days. Number of categorical covariates = 2. Number of quantitative covariates = 4. Was information provided on age?: VE9. 3.7 Exporting a file for analysis in program MARK You may wish to see whether you get the same answer in MCestimate that you get using program MARK (White and Burnham 1999). For binomial fates the two programs should give virtually identical results. I've tried to make this an easy comparison by adding the Export MARK file option. Choosing this option will prompt you for a file name (say FileName), then create two files, FileName.inp and FileName.txt. FileName.inp is an encounter history file formatted for immediate import into MARK. FileName.txt gives you some supporting information, including the number of encounter occasions (length of the nest season, defined from first active date to last nest-check) and the names of covariates (if you specified any in the DescribeData GUI). You will need to input the season length value into the Encounter occasions box in MARK. You may wish to add the covariate names under the Enter Ind. Cov. Names option in MARK. 15 ------- 4. Modeling nest failure Analysis is done through the Analysis menu, which has five options (Fig. 12, Table 4). Figure 12. The Analysis menu .' MCestimate d r_ Design models Load models from file Load AIC table Status Ready. c LLoad model list Manage lists Fitted models: Decimal precision: 2 Run models Interrupt Response Variable: O Binomial (success/fail) 0 Multinomial Pending model;: Model name Model name AICc delta AICc Weight Table 4. The Analysis menu Menu Item Description Design models Load models from file Load AIC table Load model list Manage lists Opens the ModelDesign window (Section 4.2 below). Imports a list of models from Microsoft Excel (Section 4.3 below). Opens a window for displaying previously saved results (Section 6 below). Opens a window for adding a previously specified list of models to run (Section 6 below). Opens a window displaying both previous results tables and model lists. This choice is for deleting saved results (Section 6 below). 4.1 Specifying the response distribution Before designing models for analysis you must specify whether you wish to analyze binomial or multinomial responses. The default is multinomial. If your fate variable contained only two fates to begin with then the two options will provide identical results. A note of caution! Changing the response variable during analysis will delete the current pending models list and results table (you will get a 16 ------- warning first). This is to prevent you from inadvertently creating an AIC table comparing multinomial and binomial models. 4.2 Overview: designing models Primary analysis is done using the ModelDesign window, which is accessed using the Design models option of the Analysis menu. With the ModelDesign window open (Fig. 13) you can design multiple models and then run them from the main MCestimate window. The default model is the Intercept-only model (no covariates, all failure rates temporally constant). MCestimate uses fairly conventional notation to name models. Each failure rate is designated as a basic model parameter and is specified as a function of specific covariates using parentheses. If no covariates are used then the parentheses brackets contain only a dot - indicating that the basic parameter is assumed constant across all nests and over time. This model, the intercept-only model, can be added to the list immediately by clicking the button Add to pending models. In the example above this will add the model weather(.)depredated(.) to the Model Set listbox. Figure 13. The ModelDesign window for SimData.mat as created in Section 3.2 Response Failure slates: Model Name: i.lfF.'rfiJ.-tfiJ r.|i.'eather(.) Add 1o pending models Remove from pending models Pending models: * v Covariates: yr c it hi Age | ^ | « Remove Main effects: ^^^™± nter act ions: J Set interaction ] Remove interaction 4.3 Building models by specifying covariates Your specified covariates appear in the Covariates listbox of MCestimate. Note that there are two covariates (Date and Age) in that box beyond what you specified in the DescribeData step. MCestimate creates Date automatically and Age if an Age at first visit column is specified on the DescribeData GUI. Fates may be modeled using different covariates. To specify which fate you are currently working on, 17 ------- click that fate in the Failure states listbox. This will set the title of the box below to: Predictors for fate: [fate]. Any covariates you specify will always be for the currently selected fate. The buttons for specifying a covariate to a fate change depending on whether you have selected a categorical or a quantitative covariate. Fig. 13 shows the display for a categorical covariate. If you click on a quantitative covariate in the Covariates listbox the display will look like Fig. 14. With the exception of Date and Age all covariates are assumed to be nest specific (i.e., constant within nests). Figure 14. The ModelDesign window when a quantitative covariate is selected. Model Name: Add to pending models Remove from pending models Pending models: Covariates: yr_c trt y^i 5ate Age v I Power » | 1 Log» ] e \ | — NegExp» ] e « Remove Main effects: V nteractions: • Set interaction Remove interaction 4.3.1 Adding a categorical covariate Categorical covariates may be added to a model by selecting them in the Covariates listbox and clicking Add ». 4.3.2 Adding a quantitative covariate Quantitative covariates may be added by selecting them in the Covariates listbox and clicking the button corresponding to the desired transformation. 18 ------- Power transformations To add a power of a quantitative covariate click Power». MCestimate applies the indicated power transformation (edit box to the right of Power ») to the quantitative covariate at run-time. The default is untransformed (power = 1). Any power may be entered. Note, however, that entering zero will simply generate a second intercept. Multiple powers of a given covariate may be entered sequentially (e.g., {height+heightA2...}, etc.). Log transformations Log transformations work in analogous fashion to the power transformations (described above). The edit box is for entering the desired base. The default is the natural logarithm (e). With the exception of e, only strictly positive numeric values can be entered in the base edit box. Another restriction is that the base cannot be 1. I'd recommend sticking to the bases 2, e, and 10. MCestimate will not add an arbitrary constant to zeros to allow log transformations. Instead, if you attempt to log-transform any quantitative covariate that contains data that are not strictly positive, you will get an error message and you will not be allowed to use the log transformation. Negative exponential transformations The NegExp transformation is intended to allow you to model thresholds in a quantitative covariate (i.e., a steep slope at small values but negligible additional effect at large values). The edit box is for entering the desired base. The default is the natural logarithm (e). The same restrictions apply to the base as with the log transformation. However, there is no restriction on the value of the covariate, except that very large numbers will result in "inf" errors that will crash MCestimate. The Date covariate MCestimate uses the dates you entered for visits (VI - V10 in simulated data, Fig. 1) to generate an ordinal design variable (always called Date) to enable you to model within-experiment temporal effects. It is the numbered day within the year (i.e., where day 1 = 1 January). Date is a quantitative covariate that varies within nests with every day of the season (whereas other quantitative covariates above are assumed fixed for a given nest). Ordinal dates are assumed to be non-leap year. In creating Date MCestimate removes the year (i.e., 2 July 2006 and 2 July 2007 would have the same ordinal date: 184). If you want a year covariate you must add it as a separate column in your Excel file. You could then ask whether the effect of date differs by year by specifying an interaction between the Date and year covariates. The Age covariate If you specify an age at first visit column, MCestimate generates an ordinal Age covariate in the same way that it generated Date. Thus, like Date, the Age covariate will change with every day within a nest. 19 ------- 4.3.3 Interactions Once you've added at least two covariates for a given fate to the Main effects listbox, you may specify an interaction. This is done by selecting all covariates you wish to be in the interaction in the Main effects listbox and clicking Set interaction. If you select more than two (say n) covariates then MCestimate builds all lower level interactions when generating the n-way interaction. If you remove a main effect that is specified in an interaction MCestimate will crash at run time. 4.3.4 Finalizing a model To build models you move covariates from the Covariates listbox to the Main effects listbox using the rules described above and specifying any desired interactions. As you do so, the Model Name box in the Models frame will keep track of the covariates you are adding and removing. When you are satisfied with the model for the current fate you can select another fate in the Failure states listbox and continue. Once you have specified the model you wish for all fates, then you can add it to the set for analysis by clicking the Add to pending models button in the Models frame. If you change your mind after adding the model to the model set, then simply select it and click Remove from pending models. After you select Add to pending models the ModelDesign window reverts to the opening state, with the default model being the intercept model. Fig. 15 shows the ModelDesign window with five models designed and ready for analysis. Figure 15. Some models designed in the ModelDesign window Model Name: depredatedC.JweatherQ | Add to pending models [ Pending models: Remove (rom pending models depredated!;.) weatherf.) A depredatedi; .)weather(yr_c) depredated(yr_c)weather(yi-_c) depredded(1rt)weatJTer(yrc)^ H Covariates: yT_c trt it Date 4ge fl |— Categorical Add» 1 « Remove Main effects: A v Ferncve interaction 20 ------- 4.4 Importing models from a file Models may also be typed into Excel and then imported to MCestimate. As an example, the above models have been typed into the file "ModelRunList.xIsx" (Fig. 16). To do so, Choose Load models from file from the Analysis menu and navigate the desired file. The resulting models will be loaded into the pending models window. In the model list file the fates may be listed in any order, however the syntax must match exactly the syntax that MCestimate uses for building files. Any models that do not conform to this syntax are ignored. Figure 16. Example model list for import to MCestimate » ModelRunList.xIsx - Microsoft Excel Home Insert Page Layout •^ j(, Calibri '111 LU ^ IB i nH^V7 Paste 3 bid H.l<3»~ Clipboard ^ Font Review View Add-Ins W - = Alignment Number A Styles Cells Z - Editing M < > M Sheetl Sheet2 Sheets Probably the easiest way to learn MCestimate model syntax is to build a few models using the model builder (Section 4.3). Here are the basic rules. Main effects are listed by name (yr_c, trt, Date, etc.). Additive models are indicated by separating main effects with a plus sign (+). Interactions are indicated by separating main effects by an asterisk (*). Intercept-only models are indicated with a period (.). Transformations of quantitative covariates are indicated differently, depending on the transformation. Power transformations of quantitative covariates are indicated with a caret symbol, followed by the desired power (e.g., DateA2). Log transformations differ depending on the desired base. If the natural log is desired, then the "In" notation is used, whereas if any other base is desired, then the notation logX(covariate) is used, where X is the base (e.g., loglO(Date)) indicates a common log of the variable Date). Similarly, the negative exponential transformation notation differs depending on whether the desired base is e or not. When the base is e, then the negative exponential transformation is indicated as exp(-covariate). For example, a negative exponential transformation of the Date variable is indicated as exp(-Date). However, if a different base is desired, the notation used is, for example, baseA(-covariate). The resulting transformations can be combined into additive and interactive models as described in the previous paragraph. 21 ------- 4.5 Running the models Once you've specified as many models as you wish, select Close to return to the main MCestimate window. You might wish to build the models in Fig. 15 to verify that you will get the same results from running MCestimate, and so you will be able to follow the results menus in the subsequent material. Upon closing the ModelDesign window, the main MCestimate window should look like Fig. 17, with the table of pending models appearing on the left of the window. To run the models click Run models. As each model runs, it will be removed from the Pending models table to the Fitted models table, which is a fairly typical AIC table. If you've specified too many models and want your computer back you can click Interrupt to stop the analysis. As each model is added to the AIC table, the table is reordered relative to the model with the lowest AIC value. The AIC values are corrected for sample size, where sample size is defined as the sum of the number of known days survived and the number of failed observations (Rotella et al. 2004). Figure 17. The main MCestimate window with models ready to run File Analysis About Expiration date: Status - Ready Decimal precision: Run models Interrupt Response Variable: — O Binomial (success/fail) © Multinomial Pending models: depredated(.)weather(.) depredated(.)weather(yr_c) depredated(yr_c)weather(yr_c) depredated(trt)weather(yr_c) 5 |depredated(Date+yr_c)weatherO Model name Model name NLL AICc delta AICc Weight 4.6 The Pending models context menu Before clicking Run models or while running is paused by clicking the Interrupt button, models can be removed from the Pending models table using the pending models context menu. This menu is accessed by selecting a cell in the Pending models table and right-clicking, which will bring up four choices (Fig. 18). Choosing Clear pending models will delete all of the pending models from the table. They cannot be recovered, except by going back to the DesignModels GUI and rebuilding them (or reimporting from a file). Choosing Export model list to Excel will open a dialog box for saving the Pending models list to an Excel file for later import. This might be useful, for example, if you wish to run 22 ------- the same set of models on a different data set. Choosing Save model list will first prompt you for a name for the model list then save the list for later use (Section 6.2 below) to the same MCestimate file in which your data are saved. This option may be removed in future editions as it is now redundant with the ability to save model lists to a distinct Excel file. Finally, the Delete model choice will delete all highlighted models in the Pending models table. Figure 18. The Pending models context menu > MCestimate File Analysis About Expiration date: - Status - Ready. Fitted models: Decimal precision: Interrupt Response Variable: — 0 Binomial (successrtail) <•} Multinomial Pending models: Model name Jepiedateduweatherf.) depredatedf ,)weather(yr_c) depredated(yr_c)weather(yr_c) depredated(trt)weamer(yr_c) depredated(Date+yr_c)weather(.) Mudel ri delta AICc Weight Clear pending models Export model list to Excel Save model list Delete model 23 ------- Figure 19. Models after they have been run File Analysis About - Expiration date' — 31-Dec-2014 Data file: SimData.mat Status - Ready Run models Interrupt Response Variable: O Binomial (success/fail) 0 Multinomial Pending models: Model name Decimal precision: depredated(Dste*yr_c)weather(.) depredated(trt)weather(yr_c) depredated( -)weather(yr_c) depredated(yr_c)weather(yr_c) 5 depredatedf ,)weather(.) Model name AICc delta AIO: Weight 962.01 968.77 970.46 970.23 973.72 1932.03 0 1945.54 13.51 1946.93 14.9 1948.47 16.44 1951.45 19.42 5. Interpreting model results Once you've run the model, the output table (Fitted models) gives you the negative log-likelihood (NIL) of the model, the corrected AIC value (AICC), the delta AICC value relative to the lowest AICC value in the set, the model Weight (calculated following Burnham and Anderson 2002), and the number of estimated parameters (K) for the model (Fig. 19). Models are sorted from lowest AICC to highest. The results given in the Fitted models table are only a start. Much more information can be gleaned using the context menu. To enable this menu you must select a row (any cell) in the Fitted models table and then right-click your mouse. This will bring up a context menu with two choices Model (Table 4, Fig. 20) and Table (covered in Section 6, below). Most of these options are fairly self-explanatory and can be understood by reading the descriptions in Tables 4 and 5. The Show parameter estimates window will probably be the most useful and I describe this option at length below. 24 ------- Table 4. The Model context menu in the Fitted models table Menu Item Description Rerun model Show convergence stats Show parameter estimates Assess goodness of fit Export estimates Delete model This reruns the model with a different (randomly chosen) starting point for the numerical optimization. This brings up a message box giving the convergence statistics for that particular model. This brings up a new window with each of the estimated parameters (Figs. 20-22 below). This brings up a new window that allows you to run a goodness-of-fit test. I have just added four goodness-of-fit algorithms (see Section 5.3). Exports the maximum likelihood parameter estimates, covariance matrix and the predicted probabilities for each nest over the period of observation (not over the life of the nest) to an Excel file. Deletes the models selected in the Fitted models table. Figure 20. The Model context menu File Analysis About Expiration date: Status Ready Decimal precision: Run models Interrupt - Response Variable: — O Binomial (success/fail) 0 Multinomial Pending models: Model name Model name NIL delta AICc Weight _ ted(trt)we; Bfc^JJ jBJdepredatedQweat Table > redated(yr_c)weather(yr_c ^Sjdepredatedf Jweather(.) Rerun model Show convergence stats Show parameter estimates Assess goodness of fit Export estimates 1932.03 1945,54 194693 1948.47 1951.45 0 13.51 14.9 16.44 19.42 Delete Model 25 ------- 5.1 Model convergence Occasionally MCestimate will fail to converge on a solution. This is more likely to happen the greater the dimensionality (number of parameters) of the model that is designed. Indicators of failed convergence include NaN in the NIL column of the Fitted models table or multinomial logit scale parameters that are equal to 10 (or minus 10), indicating that the optimization ran to a maximum (minimum). The latter is most likely to occur when all nests share the same fate. In particular, if you attempt to optimize a categorical covariate with many levels, it may occur by chance that all nests in that category share the same fate, which may result in the optimization running to a boundary condition (depending on what other covariates are in the model). If the optimization failed to converge due to a poor starting point you may try re-optimizing it from a different starting point. To rerun a model, select that model and chose model, rerun model in the Fitted models context menu. If you have reached a boundary condition due to invariant fates in a given category this will not help. In the latter case, your data do not provide sufficient information for estimation and you should not try to fit that model. The convergence criteria for each model are also available as a MCestimate menu option. To see the convergence statistics for a given model choose Show convergence stats under the model option in the Fitted models context menu. 5.2 Parameter estimates The most useful context menu choice is the Show parameter estimates choice under Models (Fig. 20), which loads the Farms window (Fig. 21). In the Farms window, parameter estimates are given at three scales (Multinomial logit scale, daily probabilities, and overall probabilities). Figure 21. The Parms window depredated(trt)weatherCyr_c) rWK, — =. - 9 Fate Parameter Estimate 5E mfri max fit 1 depredated Intercept -3.3040 0.1076 1 1 1 3 weather Intercept -4.0861 0.1 484 1 1 1 * weather yr_c: 1 0 .5227 0 .2028 0 1 0 O Date calculator z -value tor confidence intervals:! 1.96 ) decimal precision: I 4 1 Parameter MLE 5E L_CL U_CL Jlsuccess 0.9492 0.0043 0.9365 0.9594 2 depredated 0.0349 0.0036 0.0283 0.0423 jTJweather o .01 60 0 .0023 0 .01 1 9 0 .021 4 i start j 1~ ] days. enttf l| | days. decimal precision : 4 Outcome Estimate SE 1 hedge 0.9492 0.0043 2 depredated 0.0349 0.0036 3 jweather 0 .01 60 0 .0023 26 ------- 5.2.1 Multinomial logit scale estimates The logit scale estimates (the "betas") are difficult to convert into survival probabilities because the probabilities depend not only on the betas, but also on particular covariate values. The fit column of the first table shows which covariate values are used for fitting. When the Farms window is initially opened, the fit column is set to 1 for all intercept parameters, 0 for all categorical covariate levels (representing the reference category(s), which is lumped in with the intercept), and the mean value of a covariate for all quantitative covariates. For interactions, the product of the fit values for each main effect in the interaction is taken. These fit values are then used to generate the probabilities reported in the Daily estimates table, which should contain one row for each fate (including success). 5.2.2 Daily Estimates Daily survival and failure rates are reported in the Daily estimates table. This table has five columns and as many rows as nest outcomes (including success). The column MLE gives the fitted probabilities corresponding to the values of covariates in the fit column of the Multinomial logit scale estimates table. Similarly the subsequent columns give the standard error of the fitted probability and lower and upper confidence limits of the parameter. The standard error is calculated using the delta method (Seber 1982). Confidence limits are calculated following the strategy outlined by Sambamoorthi et al. (1999). They are symmetrical around the linear portion of the model (prior to transformation via the multinomial logit) and become asymmetrical under the transformation. The z-value specifies the desired nominal coverage of the confidence interval. The default is z = 1.96 corresponding to an approximate 95% confidence interval. Changes in the fit column of the Multinomial logit scale estimates table are immediately applied to the Daily estimates and Overall estimates tables. 5.2.3 Overall Estimates The third table, Overall estimates, extrapolates the daily probabilities for a single nest over the length of a cycle defined by the entries in the start and end edit boxes. The extrapolation to overall rates is done under the assumption that the particular covariate values in the fit column of the Multinomial logit scale estimates table pertain to the nest at Age = 1 day. Overall estimates are calculated for the period lasting from Age = start through Age = end, assuming the nest is still "alive" at age start. If you do not specify an Age covariate, the actual values of start and end are irrelevant and all that matters is their difference (the number of days for which an "Overall" estimate is desired). In this case you should always specify Ifor start. The ability to specify start is designed to allow you to obtain "Overall" estimates for a given developmental stage (e.g., the nestling phase). Temporal covariates (Age and Date) are assumed to increment by 1 each day from start until end is reached. All other covariates are assumed fixed for that particular nest. 5.2.4 Exploring the parameter space Of course, the mean of quantitative covariates and the reference category of categorical covariates are not necessarily the only portions of the covariate space of interest. Therefore the fit column in the first table allows limited interaction. All main effect values can be edited. Intercept values and interactions 27 ------- cannot be edited. Intercepts must always be "1" and interactions will change with changes in the corresponding main effects. 5.2.5 Design variables for quantitative covariates All quantitative covariates are rescaled (by dividing by their standard deviation) when they are converted to design variables. MCestimate retains the standard deviations in memory so that the covariance matrix for untransformed variables can be estimated from the covariance matrix for the transformed variables. 5.2.6 Design variables for categorical covariates All categorical covariates are converted to indicator variables (columns of ones and zeros) indicating whether a particular nest is in that category. Each categorical covariate with n levels generates n-1 indicator variables. The other category is lumped into the intercept. MCestimate arbitrarily allocates the first level it encounters of any categorical covariate into the intercept. In the example above the level "hi" of the categorical variable trt was lumped into the intercept. 5.2.7 Effects of changing covariate values on other covariates in the model In some cases, changing a covariate value in the fit column will automatically change the value of other covariates in the model. When a level of a categorical covariate is set to "1" all other levels of that covariate are set to zero (except the intercept). When a main effect fit value is changed, any interactions involving that main effect will use the new value of that covariate, so the interaction values will change also. Similarly, when multiple powers (or other transformations) of a quantitative covariate are simultaneously included in a model, changing the fit value for one transformation will result in a corresponding change to the fit values of the other (i.e., if "variable" is set to "3" then "variableA2" will automatically change to "9" and vice versa). Example 1. Interpreting parameter estimates To get a sense for how to use the Farms window, click on a cell in the second row of the output table on the main MCestimate window (i.e., highlighting the model: depredated(trt)weather(yr(c))), which is the model hypothesizing that the nest-predation rate depends on treatment (a categorical covariate) and that the weather-related failure rate depends on year (a categorical covariate). Then right-click and choose Model > Show parameter estimates. The Farms window should look like Fig. 21 (above). Suppose you were interested in the predation rate under the treatment condition lo. Then click that cell (row 2, column 7) in the fit column, type "1" and hit the "Enter" key . The new Farms window should look like Fig. 22. Note the changes in daily and overall parameters that have occurred in the second table (compare Fig. 21 to Fig. 22). 28 ------- Figure 22. Parms window from Fig. 21 with trt:lo set to "1" — Model name depredsrted(trt)weather(yrj;) Multinomial loglt scale estimates: - Fate | Parameter depredated Intercept depredated iitlo weather Intercept weather yr_c:1 -3.3040 0.2727 -4.0861 0.5227 0.1087 0.1474 01541 0.2087 O Date calculator Daily estimate?. z-value for eonSctenoe Hwvats: 1.96 | Parameter 1 [success 2 depredated MLE SE LJO. 0.9389 0.0049 0.9247 0.0453 0.0043 0.0373 0.0158 00024 0.0116 U_CL 0.9505 0.0548 0.0214 - Overall estimates: — start:[ 1 i days. Outcome 1 JFIedge 2 depredated 3 [weather i Estimate 0.9389 0.0453 0.0158 5E 0.0049 0.0043 0.0024 In Fig. 22, above, the overall rates are equal to the daily estimates because the start and end edit boxes are set to 1. Setting these values to 1 and 25, respectively (i.e., a 25-day nest cycle, from first egg to fledging) should give you the result displayed in Fig. 23. Figure 23. Parameter estimates from Example 1 for a 25-day nest cycle Model name depredated(trt)weather(yr_c) Fate | Parameter i Estimate 5E min max fit 1 depredated Intercept -3.3040 0.1076 1 1 1 2 depredated trtlo 0.2727 0.1463 0 1 1HHKI 3 weather Intercept -4.0861 0.1484 1 1 1 4 weather yr_c:1 0.5227 0.2028 0 1 0 O Date calculator z-value for confidence intervals:]"! ,95 | decimal precision: | 4 Parameter MLE SE L_CL U_CL _Msuccess 0.9389 0.0048 0.9251 0.9502 _2_idepredated 0.0453 0.0042 0.0375 0.0546 3 jweather 0.01 58 0.0023 0.01 1 7 0.021 2 1 !tat| 1 days. end: 25 days. decimal precision: | 4 Outcome Estimate SE 1 Fledge 0.2068 0.0267 2 depredated 0.5883 0.0340 3 [weather 0.2049 0.0267 29 ------- Example 2. Interpreting parameter estimates when Date is included Close the Parms window and go back to the main MCestimate screen. Choose the "best" model (depredated(Date+yr)weather(.)) and display its parameter estimates. The numbers in the min, max, and fit columns for the Date covariate are not easily interpreted because they are ordinal dates. The Date calculator allows conversion between calendar dates and ordinal dates (and their transformations). It can be displayed by clicking the Date calculator radio button below the Multinomial logit scale estimates table (Fig. 24). Typing "167" into the Ordinal: edit box should result in "15-Jun" appearing in the Date (dd-mmm): edit box. If you've used a transformation of Date you can also accommodate for it using the appropriate controls to the right and the transformed design variable will be displayed (on its native scale, not divided by its standard deviation). Changes to any of the three edit boxes (Date (dd-mmm):, Ordinal:, or Design variable:) will carry through to the other two. Note that this calculator cannot currently handle negative exponential transformations. Figure 24. Using the Date calculator to interpret models with "Date" as a covariate depredated(Date+yr_c)weather(.) Fate Parameter Estimate 3 depredated yr_c:1 0.0798 •* weather Intercept -3.8314 Date (dd-mmm): O ® Date calculator SE min max fit 0.1438 0 1 0 0-1023 1 1 1 dinai: Design variable: ® Power transformation power; J L 1 J U °9 ansor n | 1 ; i-value for confidence intervals: 1 .96 decimal precision: 4 Parameter MLE SE L CL U CL _J_|success 0.9431 0.0040 0.9314 0.9529 2 (depredated 0.0365 0.0036 0.0300 0.0443 3 jweather 0.0204 0-0020 0.01 67 0.0250 1ait| 1 jdays, end: 1 days. decimal precision: | 4 Outcome Estimate 5E _1_ Fledge 05431 0.0040 2 depredated 0.0365 0.0036 3 weather 0.0204 D.0020 5.3 Goodness-of-fit Assessing the fit of nest survival models is challenging (Sturdivant et al. 2007). MCestimate uses several different algorithms, including a multinomial extension of the Hosmer-Lemeshow test (Hosmer & Lemeshow 2000), a similar method described by Pigeon and Heyse (1999), and two methods based on 30 ------- parametric bootstrap (White et al. 2001). The latter algorithms rely on repeatedly generating sets of data from the maximum likelihood estimates of the linear coefficients and fitting the same model to the new sets of data. In this way, a null distribution of a given test statistic is generated under the hypothesis that the data arose from the fitted model parameterized with the estimated MLEs. The complement of the proportional rank of the test statistic is taken to be the p-value for that test statistic. Several test statistics are available for use in the parametric bootstrap procedure, including the deviance from a saturated model (Deviance) and various forms of the power-divergence family of statistics (Read & Cressie 1988). The latter family includes the Pearson X2 statistic and the Freeman-Tukey statistic. I recommend that you use the parametric bootstrap option with caution! It is very slow and often does not exhibit improved performance compared to the Hosmer-Lemeshow and Pigeon-Heyse tests, which are assessed against chi-square distributions with known degrees of freedom (hence do not require lengthy simulations). To use these algorithms choose Assess goodness of fit under the model option in the Fitted models context menu (Table 4). This will load the GoF GUI (Fig. 25). Figure 25. The GoF GUI when first loaded depredated(Date+y rjiOweatheri;.) Groups: 1 Test Statistic Hosmer-Lemeshow 14 4505 Pigeon-Heyse 14.4704 I dr Prob 18 0.6993 13 0.6980 Power diver yen.? :t View contributions lambda: I 1 Run 1 2 3 Fate Interrupt Parameter Estimate Continue — Iterations: — | 100 | SE (hessian) SE (bootstrap) 1 o.e 0.6 0.4 0.2 °[ Status: Current deration: Overdispersion (c hat): Observed Current Value Probability 10 20 30 40 50 60 70 80 90 10C At the top of the GoF GUI the name of the model to which the current goodness-of-fit results pertain is displayed. Immediately below the model name is a panel (Goodness-of-fit based on grouping strategies) showing p-values for the Hosmer-Lemeshow test and the Pigeon-Heyse test. The default value for the number of groups to consider is 10 (following guidance from Hosmer and Lemeshow 2000), but you may enter other values. Typically the Hosmer-Lemeshow p-value is very close to that for the 31 ------- Pigeon-Heyse statistic given the same number of groups. Below these results is the Parametric Bootstrap panel. To run a parametric bootstrap test you must choose how many iterations you wish to simulate and click Run. Usually 100 iterations is enough to figure out whether you should be concerned about goodness-of-fit, but if so, then you should run at least 500 iterations (better yet 1,000). Plan on having something else to do during that time! Above and to the right of the Parametric Bootstrap panel is a small panel labeled Power divergence statistic. This box has two uses. First, the parameter lambda determines which member of the Power divergence family is used as the test statistic in the Parametric bootstrap. A value of lambda = 1 corresponds to Pearson's X2 statistic. A value of lambda = -0.5 corresponds to the Freeman-Tukey statistic. Reed and Cressie (1988) recommend a value of lambda ~ %. You don't necessarily need to decide in advance because you can change the value of lambda on the fly (example below). The ability to change the value of lambda is intended to allow you to ensure that your data do not exhibit lack-of-fit under different statistics. It is NOT intended to allow you to shop for a statistic under which you get an acceptable p-value! The second use for the Power divergence statistic panel is to allow you to view the contributions from each particular observation in your data set to the Power divergence statistic (with parameter lambda). To do so, enter the desired value of lambda and choose View contributions. This will bring up a table of contributions (described in example below). If you run a parametric bootstrap test, in addition to the chosen power divergence statistic, you will also be provided with goodness-of-fit results for the deviance of the current model compared to a saturated model. You will also generate estimates of overdispersion and bootstrap estimates of the standard errors of the linear coefficients. Do not be too alarmed if these disagree with the estimates generated from the Hessian matrix. It can take many iterations for the bootstrap estimates to converge. However, if after many iterations the estimates still do not agree, then the bootstrap estimates are probably more reliable, especially at small sample sizes. Finally, the parametric bootstrap test will also generate a graphical display showing the current value and cumulative probability trajectory of the goodness-of-fit statistics. This is intended to help you decide when enough simulations have been run. 5.4 Example. Assessing Goodness-of-fit If you have imported SimData.xls as described above and fitted the Models from Section 4, then load the GoF GUI so that it looks like Fig. 25. Preliminary indications from the Hosmer-Lemeshow and Pigeon- Heyse tests suggest little evidence for lack-of-fit (Fig. 25). For further verification, a bootstrap goodness- of-fit test could be run. To do so, simply click Run within the Parametric Bootstrap panel. Fig. 26 shows the bootstrap results after 100 iterations (note this took about 30 minutes to generate). 32 ------- Figure 26. Bootstrap goodness-of-fit results for the depredated(Date+Yr_c)weather(.) model depredated(Date+yr_c)weatherC.) Groups: Hosmer-Lemeshow Pigeon-Heyse Test Statistic c 144505 14.4704 f Prob 18 0.6993 18 0.6980 Power divergence statistic - View contributions , 1 Parametric Bootstrap depredated weather r Iterations: — Interrupt Continue | I — I Parameter Estimate SE (hessian) SE (bootstrap) d Intercept -0.0156 0.6632 0.6816 d Date -0.0193 0.0040 0.0338 Intercept -3.8314 0.0991 0.1039 1 O.B 0.6 04 3atus: Current iteration: 1 Overdispersion (c_hat): Observed Current Value Probability Deviance 1924 1816 0.23 \Vvs_ 0.2 10 20 30 40 50 BO 70 BO 90 10c| According to the table showing goodness-of-fit statistics (above the panel graph of probabilities), the observed deviance (compared to a saturated model) was 1,924, which was well within the distribution of simulated deviances (Fig. 26). However, the Pearson X2 statistic was higher than almost all simulated values, indicating lack-of-fit. The probability is 0.02 in this case because the observed value of X2 from the empirical data is taken to be the first member of the simulation set and one iteration of the simulation (around iteration 16) produced a value of X2 greater than the observed value. Thus 2 out of 100 values were greater than or equal to the observed value. To see which observations are contributing the most to the test statistic, you can click View contributions in the Power divergence statistic panel. Doing so in this case, with lambda=l, should produce a table looking like Fig. 27. 33 ------- Figure 27. Table of observation-specific contributions to the Peason X2 statistic 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 1234 Nest Date Length Outcome 39 17-May-2QOO 2 weather 201 1 8-May-2000 2 weather 105 23-May-20QO 2 weather 166 28-May-2QOO 2 weather 450 27-May-2000 2 weather S4 Q2-Jun-200Q 2 weather 375 Q5-Jun-200Q 2 weather 241 06-Jun-2000 2 weather 142 Q3-Jun-2QOO 2 weather 362 Q8-Jun-20QQ 2 weather 127 09-Jun-2000 2 weather 41 Q5-Jun-200Q 2 weather 337 1Q-Jun-200Q 2 weather 349 13-Jun-2000 2 weather 3S1 21 -Jun-2000 2 weather 230 22- Jun-2000 2 weather 260 23-Jun-2000 2 weather 361 23-Jun-200Q 2 weather 369 26- Jun-2000 2 weather 130 29-Jun-2000 2 weather 231 25-Jun-2000 2 weather 186 07-Jul-2000 2 weather 250 04-JUI-2000 2 weather 442 04-Jul-2000 2 weather 183 15-Jul-200Q 2 weather 399 1 1 -Jul-2000 2 weather 455 11 -Jul-2000 2 weather 49 18-Jul-2000 2 weather 44 22-JUI-2000 2 weather 126 07-Jul-2000 2 depredated 100 06-Jul-2000 2 depredated 400 06-JUI-2000 2 depredated 412 06-Jul-2000 2 depredated 224 01 -Jul-2000 2 depredated 5 6 survived depredated 0.3333 0.12S9 03440 01173 0.3457 0.1161 0.3552 0.1063 03607 01005 0.3639 0.0972 0.3688 0.0922 0 3704 0 0905 0.3722 0.0887 0.3735 0.0873 0 37SO 0 0853 0.3752 0.0855 0.3765 0.0842 0 3864 0 0740 0.3912 0.0690 0.3924 0.0677 0 3936 0 0665 0.3936 0.0665 0.3971 0.0629 0 9003 0 0593 0.9005 0.0594 0.9083 0.0514 0 9093 0 0503 0.9093 0.0503 0.9151 0.0443 09152 00442 0.9152 0.0442 0.9175 0.0419 0 9204 0 0389 0.9119 0.047S 0.9111 0.0485 09111 00485 0.9073 0.0523 0.9065 0.0532 7 weather 0.0378 0.0382 0.03S3 0.0386 0.0388 0.03S9 0.0390 0.0391 0.0391 0.0392 0.0392 0.0392 0.0393 0.0396 0.0398 0.0398 0.0399 0.0399 0.0400 0.0401 0.0401 0.0403 0.0404 0.0404 0.0406 0.0406 0.0406 0.0407 0.0407 0.0405 0.0404 0.0404 0.0403 0.0403 8 PDS contrib. 25.4264 25.1777 25.1 385 24.9231 24.7934 24.7279 24.6197 24.5850 24.5463 24.5176 24.4848 24.4B03 24.4528 24.2410 24.1 382 24.1128 24.0879 24.0B79 24.0161 23.9433 23.9453 23.7859 23.7847 23.7647 23.6468 23.6446 23.6446 23.6000 23.5416 20.0134 19.6266 19.6266 1 8.1 029 17.8008 As you can quickly determine by inspection, the greatest contributions to the Peason X2 statistic in this case are coming from observations on weather failure. Scrolling further down (not shown) you would find that the next largest contributions come from observations on predation events, followed by observations over which nests survived. This is typical, as the occurrence of rare events will dominate the Pearson X2 statistic when all cells have exactly one observation (hence the need to evaluate the distribution of the test statistic via parametric bootstrap). Changing the value of lambda in the Power divergence statistic panel will change the test statistic and will retroactively apply the changes to the results of the Parametric Bootstrap. For example, the next figure shows the results for the Freeman-Tukey statistic (Fig. 28). 34 ------- Figure 28. Parametric bootstrap results for the Freeman-Tukey statistic OWE depredated (Date+yr_c)weatherO Groups: Hosrner-Lemeshow Pigeon-Heyse Test Statistic c 14.4505 14.4704 f Prob 13 0.6993 18 0.6980 Power divergence statistic - View contributions lambda: I -°-5l — Parametric Bootstrap Fate depredated depredated weather Interrupt Parameter d Intercept d Date d yr_c:1 Intercept Estimate Continue SE(hessian) SE(b -0.0156 0.6632 -0.0193 0.0040 0.0798 0.1487 -3.8314 0.0991 ,— Iterations: — 100 ootstrap) 0.661 6 0.0338 0.1564 0.1039 — Status: — Current deration: 100 Overdispersion (c_hat): Observed Curren Deviance 1924 1318 Freernan-Tukey 2596.6 24543 t Value Probability 0.23 0.4 - Freeman-Tukey 10 20 30 40 50 60 70 BO 90 100 Unlike the Pearson X2 statistic, the Freeman-Tukey statistic does not suggest lack-of-fit (Fig. 28). This should be interpreted with caution! Limited power analyses suggest that the Freeman-Tukey statistic rarely has much power to detect lack-of-fit (for these kinds of data) and that the Pearson X2 statistic tends to reject at the nominal rate when the model fits the data. Thus I do not recommend adoption of the Freeman-Tukey statistic without good a priori justification. Fig. 28 also shows that by 100 iterations, the bootstrap estimates of standard errors are fairly close to those obtained from the Hessian (with the exception of the estimate for Date). This happened fairly rapidly in this case and you should not expect such rapid agreement! Finally, note also that overdispersion is small (c —1.0223) and can probably be ignored. 5.5 Exporting MCestimate results All tables in MCestimate can be copied and pasted into other programs. To do so, select the cells you wish to copy and press ctrl-c to copy the cells into the windows buffer. They can then be pasted using ctrl-v or the edit-paste menus of most programs. MCestimate provides two additional ways to export results. The first is Export estimates under the Model choice of the Fitted models context menu. Choosing this option will bring up a file save dialog 35 ------- (Fig. 29) with a default name being the name of the model under which the estimates are derived (though you may change it). Figure 29. The file save dialog that appears after choosing Export estimates. > Output file nlfx Provide a file name: |pepredated(Date+yr(c))we9l OK Can EZH ;el This option saves an Excel workbook with three worksheets. Sheetl will contain the MLEs (the betas) and the covariance matrix. SheetZ will contain fitted probabilities for each nest from the time of discovery to the end of monitoring (note these are not fitted probabilities over the life of the nest!). Sheets will contain fitted probabilities for each observation on each nest. I recommend caution interpreting the values on SheetZ and Sheets. They are generally equivalent to neither daily nor overall values. This option is included to facilitate a do-it-yourself goodness-of-fit test. Another way to export results from MCestimate is to use the File menu on the Farms GUI (e.g., Fig. 22, above). To use this function you must first select the name of a quantitative covariate in the Multinomial logit scale estimates window. MCestimate will then generate 101 values of the selected quantitative covariate evenly spaced between the minimum and maximum value of the covariate. It then uses those 101 values to fit daily and overall probabilities as described above (Sections 5.2.2 and 5.2.3). The resulting fitted probabilities are exported to an Excel file of the same name as the selected covariate (in this case Date.xls, Fig. 25). This function is intended to help with graphing the relationship between the fitted probabilities and continuous covariates. 36 ------- Figure 30. A portion of the file Date.xls, sheet = weather,created using the Export series button a Paste J ">- ' Cahb -J Copy J Format Painter B IVbo.ra Ai I A j B 1 [Dale Idaily est. 2 124 0.0194 3 124.81 0.0194 4 125.62 0.0194 5 126.43 0.0194 6 127.24 0.0195 7 128.05 0.0195 8 128.86 0.0195 9 129.67 0.0195 10 130.48 0.0196 11 131.29 0.0196 12 132.1 0.0196 13 132.91 0.0196 14 133.72 0.0197 15 134.53 0.0197 16 135.34 0.0197 17 136.15 0.0197 IS 136.96 0.0197 19 137.77 0.0198 20 138.58 0.0198 21 139.39 0.0198 22 140.2 0.0198 23 141.01 0.0199 24 141.82 0.0199 25 142.63 0.0199 26 143.44 0.0199 - C | •" - 1 A" »' ' — " s Font l". f. Date D E F S dailySELCI UCI overall est. overall SE 0.0020 0.0159 0.0236 0.0194 0.0020 0.0020 0.0159 0.0236 0.0194 0.0020 0.0020 0.0159 0.0237 0.0194 0.0020 0.0020 0.0160 0.0237 0.0194 0.0020 0.0020 0.0160 0.0237 0.0195 0.0020 0.0020 0.0160 0.0237 0.0195 0.0020 0.0020 0.0160 0.0238 0.0195 0.0020 0.0020 0.0160 0.0238 0.0195 0.0020 0.0020 0.0161 0.0238 0.0196 0.0020 0.0020 0.0161 0.0239 0.0196 0.0020 0.0020 0.0161 0.0239 0.0196 0.0020 0.0020 0.0161 0.0239 0.0196 0.0020 0.0020 0.0161 0.0239 0.0197 0.0020 0.0020 0.0162 0.0240 0.0197 0.0020 0.0020 0.0162 0.0240 0.0197 0.0020 0.0020 0.0162 0.0240 0.0197 0.0020 0.0020 0.0162 0.0241 0.0197 0.0020 0.0020 0.0162 0.0241 0.019S 0.0020 0.0020 0.0162 0.0241 0.0198 0.0020 0.0020 0.0163 0.0241 0.0198 0.0020 0.0020 0.0163 0.0242 0.0198 0.0020 0.0020 0.0163 0.0242 0.0199 0.0020 0.0020 0.0163 0.0242 0.0199 0.0020 0.0020 0.0163 0.0242 0.0199 0.0020 0.0020 0.0163 0.0242 0.0199 0.0020 « a P ' SjJ Wrap Text General 31 Merge & Center - $ - % » rrt '• number H 1 j K m Exes! ft I M 5.6 Visualizing MCestimate results The File menu on the Parms window also contains the option Plot series. As above, to use this function you must first select the name of a quantitative covariate in the Multinomial logit scale estimates window. MCestimate will then generate 101 values of the selected quantitative covariate, as described above and plot them for you with confidence intervals (the latter using specified z-values). For example, with the Parms window open for the model depredated(Date+Yr(c))weather(.) and the name Date highlighted in the Multinomial logit scale estimates table, choosing File/Plot series will produce three figures that resemble Figs. 31-33, respectively. Figure 31. Daily survival rate, with Date in the depredated(Date+Yr(c))weather(.) model 130 140 150 37 ------- Figure 32. Daily depredation rate, with Date in the depredated(Date+Yr(c))weather(.) model 7r riff i Figure 33. Daily rate of failure due to adverse weather, with Date in the depredated(Date+Yr(c))weather(.) model 11 ! J . 0022 190 200 210 I would welcome suggestions and thoughts on how to improve these graphing functions! 6. Managing results tables and model lists Constantly clicking through the creation of models in MCestimate can be tedious. Thus model lists and AIC tables can be saved, re-opened, and subsequently modified (add or remove models). The easiest way to do this is through the Table options in the context menu of the Fitted models table (Table 5, Fig. 34). Table 5. The Table context menu in the Fitted models table Menu Item Description Save AIC Table Save as model list Clear AIC Table Saves the AIC table so that it can be loaded later. Saved AIC tables are automatically saved to the data file currently in use. Saves the models in the AIC table as a list that can be loaded and run at a later time. Saved model lists are automatically saved to the data file currently in use. Deletes the Fitted models table. It cannot be recovered. 38 ------- Figure 34. The Table context menu File Analysis About Expiration date; - Ready Decimal precision: Run models Interrupt Response Variable: — O Binomial (success/fail) <•;• Multinomial Pending models: Model name AICc delta AICc Weight 2 |depredated(trt)weather(yr j jjdepredated( .)weather(yr_c; | t depredated(yr_c)weather(yr_c) 5 ldecredated(.)weather(.) 962.01 QRR77 Save AIC Table Save as model list 1932.03 0 1945.54 13.51 1946.93 14.9 1948.47 16.44 1951.45 19.42 Clear AIC Table 6.1 Saving an AIC Table Choosing Save AIC Table under the Table option in the Fitted models context menu (Fig. 34) will load a dialog box for specifying a name for the AIC Table. Note: spaces are not allowed in the table name. If you include a space, the table will be saved, but will generate an error when you attempt to load the table subsequently. Names of AIC Tables and model lists (below) should always begin with a letter. When you save an AIC table, MCestimate automatically saves it to the current data file on disk. 6.2. Saving a model list In some cases you may wish to save a model list to run in the future. This can be done in two ways. An existing AIC table can be saved as a model list by choosing Save as model list under the Table option in the Fitted models context menu. As with saving an AIC table, this will bring up a dialog asking you to name the model list. Also as above, the name cannot include spaces. The second way to save a model list is to choose Save model list in the Pending models context menu. 6.3 Managing lists Once you've saved an AIC table or a model list it can be subsequently loaded under the Analysis menu using the analogous load command (Load AIC table or Load model list). In either case, the choices you will be offered to load will depend on the response variable that is currently selected in the radio button on the main MCestimate page. MCestimate will only allow you to load model lists or AIC results that 39 ------- match the selected response variable (binomial or multinomial). As you save model lists and AIC tables you may wish to delete lists that are no longer needed. You can do so under the Analysis menu choice Manage lists. Saved model lists and AIC tables may be compiled into larger lists by loading them into the main MCestimate window. Whenever you load a list or an AIC table you have the option of either appending or replacing the existing table. Acknowledgments Thanks to T. Johnson, University of Wyoming (first MCestimate beta tester!) for suggesting some great changes to the program and user's guide. Thanks also Allyson Jackson of Biodiversity Research Institute, for using MCestimate to analyze Carolina Wren data and making many good suggestions for model improvements. Finally, thanks to R. Greenberg of the Smithsonian Migratory Bird Center for patiently waiting for ten years for this two-year project to finish. A. Kuhn and M. Starus provided valuable editorial comments on this guide. Literature cited Burnham, K., and D.R. Anderson. 2002. Model selection and multimodel inference: A practical information-theoretic approach, 2nd Ed. Springer-Verlag, NY. Choquet, R. A.-M. Reboulet, R. Pradel, O. Gimenez, and J.-D. Lebreton. 2004. M-SURGE: new software specifically designed for multistate capture-recapture models. Animal Biodiversity and Conservation 27.1: 207-215. Choquet, R., L. Rouan, and R. Pradel. 2009. Program E-SURGE: a software application for fitting multievent models. Environmental and Ecological Statistics 3:8-39. Dinsmore S.J., G.C. White, and F.L Knop. 2002. Advanced techniques for modeling avian nest survival. Ecology 83:3476-3488. Etterson, M.A., L.R. Nagy and T. Rodden-Robinson. 2007a. Partitioning risk among different causes of nest failure. Auk 124:432-443. Etterson M.A., B.J. Olsen, and R. Greenberg. 2007b. The analysis of covariates in multi-fate Markov chain nest-failure models. Studies in Avian Biology 34:55-64. Hosmer, D.W. and S. Lemeshow. 2000. Applied Logistic Regression. John Wiley & Sons, Inc. New York, NY. Hougaard, P. 1999. Multi-state models: A review. Lifetime Data Analysis 5:239-264. Mathworks. 2009. MATLAB: The language of technical computing. The Mathworks, Inc. Natick, MA. Mayfield, H.F. 1961. Nesting success calculated from exposure. Wilson Bulletin 73:255-261. Mayfield, H.F. 1975. Suggestions for calculating nesting success. Wilson Bulletin 87:456-466. 40 ------- Pigeon, J.G. and J.F. Heyse. 1999. An improved goodness of fit statistic for probability prediction models. Biometrical Journal 41:71-82. Pradel, R. 2005. Multievent: an extension of multistate capture recapture to uncertain states. Biometrics 61:442-447. Read, T.R.C., and N.A.C. Cressie. 1988. Goodness-of-Fit Statistics for Discrete Multivariate Data. Springer- Verlag. New York, NY. USA. Rotella J., S.J. Dinesmore and T.L Shaffer. 2004. Modeling nest-survival data: A comparison of recently developed methods that can be implemented in MARK and SAS. Animal Biodiversity and Conservation 27:187-205. Sambamoorthi, N., V.J. Ervin and G. Thomas. 1994. Simultaneous prediction intervals for multinomial logistic regression models. Communications in Statistics -Theory and Methods 23:815-829. Seber, G.A.F. 1982. The Estimation of Animal Abundance and Related Parameters, 2nd Ed. Blackburn Press. Caldwell, NJ. Shaffer, T.L. 2004. A unified approach to analyzing nest success. Auk 121:526-540. Stanley, T.R. 2000. Modeling and estimation of stage-specific daily survival probabilities of nests. Ecology 81: 2048-2053. Stanley, T.R. 2004a. Estimating stage-specific survival probabilities of nests when nest age is unknown. Auk 121:134:147. Stanley, T.R. 2004b. When should Mayfield data be discarded? Wilson Bulletin 116:267-269. Sturdivant, R.X., J.J. Rotella, and R.E. Russell. 2010. A smoothed residual based goodness-of-fit statistic for nest-survival models. Studies in Avian Biology 34:45-54. White, G.C. and K.P. Burnham. 1999. Program MARK: Survival estimation from populations of marked animals. Bird Study 46:S120-S129. See also: http://warnercnr.colostate.edu/~gwhite/mark/mark.htmtflntroduction White, G.C, K.P. Burnham, and D.R. Anderson. 2001. Advanced features of program MARK. Pages 368- 377 in Wildife, Land, and People: Priorities for the 21st Century. Proceedings of the 2nd International Wildlife Management Congress. R. Field, R. J. Warren, H. Okarma, and P. R. Sievert, Eds. The Wildlife Society. Bethesda, MD. 41 ------- |