Start cricketsim by double-clicking the icon for it. The main window appears:
Make sure the Output->Options window has these settings (should be the
defaults):
"Hybridity file output sampling frequency" is how often hybridity data
collection occurs (e.g., the distribution of A's and B's in the
population). In this case, hybridity data are collected every 100
timesteps. The other settings should not affect any relevant
datafiles.
Make sure the Output->Data Displays window has these settings (should
be the defaults):
Either set the parameters manually, using the Params menu options, or load in a set of parameters using the File->"Load param file" option. When loading a parameter file, browse until you find a file whose name ends in ".prm" (the default parameter files all end this way, but any parameter files you save can be found, no matter what they end in). Similarly, if you have some parameter set you wish to save to use later, or to make minor variations from, simply choose File->"Save param file" and choose a name for the parameter file. Be sure to put ".prm" on the end of the file name to help you remember that it's a parameter file.
Once the parameters have been set, choose the number of repetitions for this experiment from the main window, "# run repetitions". Since the simulation involves many random elements, each repetition can produce different results. Each experimental test ("run") will produce a set of data stored in its own unique folder. Each folder will be stored inside another folder whose name is specified by the general parameter, "Experimental condition subdirectory/folder". For example, if the Main data directory/folder is set to "CSData" and the Experimental condition subdirectory/folder is set to "test", then each run of the experiment will be stored inside a folder called "cs-YYYY-MMDD-hhmmss" where the YYYY, MM, DD etc. are the date and time the experiment was begun. This keeps experiments separate from each other with unique names. This folder will be inside the "test" folder, which will be inside the "CSData1.2" folder (the version of the program, 1.2, is automatically added to the end of the Main data directory/folder.
Phenotype parameters specify the layout of the genome, indicating how many loci per trait (pulse rate and pulse rate preference, age/terrain fitness, sex). In the example below, each genome will have 10 loci coding for male pulse rate. These are all specified, and a list of 10 locations follows. The same is true for the pulse-rate preference trait, although the locations are different (which they must be - no two loci can occupy the same map location, of course). Again, the age trait has 10 loci. Finally, the sex locus is located in the exact center of the genome. All of these traits can vary from 1 to any positive number of loci, and pulse-rate can have a different number of loci than pulse-rate preference, etc. NOTE: locus and map location are different concepts. If there are 3 age loci, located at map locations 2.3, 30.5, and 91.2, the 2nd locus is found at map location 30.5.
Here is an example of a phenotype similar to the above one, but with some randomly-placed alleles. Random alleles will be the same for all crickets across all species so that interbreeding is possible. Otherwise, if a sex gene were located in one place for one cricket but another place for a different cricket, then their offspring would have 2 sex loci (which is undefined for this simulator). "random" means that the simulator chooses random locations for these at the start, and all crickets will have these random values. So the locations are only random across runs, not between individual crickets within the same run.
8 random 7 specified 1 19.2 54.3 56.2 59.3 90 95 21 random randomHere, the phenotype has 8 randomly-located pulse-rate loci, 7 loci for pulse-rate preference with user-specified locations, 21 randomly-located age loci and a randomly-located sex locus. For 10 different runs under this phenotype, the populations will have different loci for pulse-rate, age and sex traits, but all populations will have the same pulse-rate preference loci.
Genotype parameters specify which marker to use for each genotype and then give initial values which are used in construction genotypes for the founding populations in the simulation. For example, if you wanted to run an experiment with A's and B's, you could specify an "A" genotype and a "B" genotype here.
Example:
genotype1 A 1 0.05 0.005 0.072 0.001This genotype is called "genotype1". The name is important since the species parameters must refer to different genotypes for different species. The marker on all initial alleles created for this genotype (at the start of the experimental run) will be "A". Thus, initial genomes of type "genotype1" will be 100 percent pure A's. The "1" that follows The next 4 numbers describe "pr-mean", "pr-sd", "prPref-mean" and "prPref-sd". These are values per allele used in the construction of the initial population's genomes. Any cricket with genotype1 will given initial values in each of its alleles according to these means and standard deviations. The first 2 parameters describe the normal distribution used to select initial pulse-rate alleles from. In the example, pr-mean=0.05 and pr-sd=0.005. This means that on average, each allele for pulse rate will have a value of .05 with a standard deviation of .005. If there are 10 loci for pulse rate (the number loci are specified in the phenotype parameters), then there would be 20 alleles for pulse rate (since genomes are diploid) with an average value of 0.05 each. The same rules apply to the last 2 numbers, except that these code for pulse-rate preference. In the example, initial cricket pulse rates will be on average lower than their species' pulse-rate preferences, but pulse rates will vary more (higher sd) than the preferences (sd of .001).
Genomes are diploid. Traits are expressed by summing values found at each locus. If there were 2 loci for pulse rate, then there would be 4 alleles (2 per locus since genomes are diploid). If those alleles were .03, .02, .045 and .033, then the pulse rate would be determined by adding these up, for a value of .128.
Each genome specifies both male and female traits (pulse rate and pulse-rate preference). Males express the pulse-rate trait but ignore the pulse-rate preference alleles (except when passing them on to offspring of course). Females express pulse-rate preference but not pulse-rate.
Values must be separated by spaces, TABs or new lines - any "whitespace" will do.
Species parameters determine what actual populations will appear in the world. Each line of the "Species parameters" window describes a particular species. So, if there are 3 lines describing 3 species, then there will be 3 distinct populations during the simulation run. In the example below, the "cerasina" population will begin the run with 200 males, 200 females all using genotype1. They will be located around cell 5,5 with a standard deviation of 1.5 in the x and y directions. This should give a reasonable clustering without being too overly crowded. The second species, "nigra" differs only in their genotype and in their initial locations.
Files that will appear after a run has completed:
cs-graphs.gp cs-hybrid-graphs.gp data.html data3.html data4.html hybrid-A-1.data hybrid-A-100.data hybrid-B-1.data hybrid-B-100.data mate-succ-pr.data mate-succ-prpref.data params.prm popavg.data pr-A-stats.data pr-A.data pr-B-stats.data pr-B.data pr-stats.data pr.data prpref-A-stats.data prpref-A.data prpref-B-stats.data prpref-B.data prpref-stats.data prpref.data region-hybridity-avgs.data region-hybridity-diffs.data spatial-hybrid-A-1.data spatial-hybrid-A-100.data spatial-hybrid-B-1.data spatial-hybrid-B-100.data spatial-hybrid-summary-1.data spatial-hybrid-summary-100.dataThese files are all inside of a folder called cs2003-0714-114427, and this folder is inside of a folder called "test" (the "Experimental condition subdirectory/folder" parameter), which is inside of the CSData1.2 folder. In the cs-2003-0714-114427 folder is also a "csdata.html" file. Opening this in Netscape Navigator or Internet Explorer will allow a view of all of the plots that have been created. There will also be many other files, ending in .jpg which are graphs/plots. These can be viewed with the web browsers mentioned above or inserted into an electronic document.
Of the files in the data folder listed above, all of the ones ending in .html and .gp can be ignored. "params.prm" is the parameter file - it lists the parameters used during the run. The other files, all ending in .data, are the data files.
1 0.13999999999999999 1 0.0 1 0.0 1 0.0 1 0.01111111111111111 1 0.0 1 0.0 1 0.13333333333333333 100 0.12452262836878222 100 0.142461704961705 100 0.13068754232721846 100 0.10665902724726255 100 0.1403209193906868 100 0.15687699397376817
dominant genetic markers in the world at time 1100 NOTE: values are not averaged over the period, just sampled at this period, so they may not represent trends A - B B B B B B B B B B B A B B B A B A A B A A A B B B B B B A A B B A A A B B A B A B B A B B B B A A A B A B B B B B B B A B B B B B B B A A A A B B A A B B A A A A B B A B B B A A B B A A B B A A A A A B A A A B B B A A A B A A A A B B A A A A B B A B B B A A B B A A B A B B A A A A A A B B A B A A A A A A A A A A hybridity (list form): x y hybridity 0, 0 A 0, 1 - 0, 2 B 0, 3 B 0, 4 B 0, 5 B 0, 6 B 0, 7 B 0, 8 B 0, 9 B 1, 0 B 1, 1 B 1, 2 B 1, 3 A 1, 4 B 1, 5 B 1, 6 B 1, 7 A 1, 8 B 1, 9 A 2, 0 A 2, 1 B 2, 2 A 2, 3 A 2, 4 A 2, 5 B 2, 6 B 2, 7 B 2, 8 B 2, 9 B 3, 0 B 3, 1 A 3, 2 A 3, 3 B 3, 4 B 3, 5 A 3, 6 A 3, 7 A 3, 8 B 3, 9 B 4, 0 A 4, 1 B 4, 2 A 4, 3 B 4, 4 B 4, 5 A 4, 6 B 4, 7 B 4, 8 B 4, 9 B 5, 0 A 5, 1 A 5, 2 A 5, 3 B 5, 4 A 5, 5 B 5, 6 B 5, 7 B 5, 8 B 5, 9 B 6, 0 B 6, 1 B 6, 2 A 6, 3 B 6, 4 B 6, 5 B 6, 6 B 6, 7 B 6, 8 B 6, 9 B 7, 0 A 7, 1 A 7, 2 A 7, 3 A 7, 4 B 7, 5 B 7, 6 A 7, 7 A 7, 8 B 7, 9 B 8, 0 A 8, 1 A 8, 2 A 8, 3 A 8, 4 B 8, 5 B 8, 6 A 8, 7 B 8, 8 B 8, 9 B 9, 0 A 9, 1 A 9, 2 B 9, 3 B 9, 4 A 9, 5 A 9, 6 B 9, 7 B 9, 8 A 9, 9 A 10, 0 A 10, 1 A 10, 2 A 10, 3 B 10, 4 A 10, 5 A 10, 6 A 10, 7 B 10, 8 B 10, 9 B 11, 0 A 11, 1 A 11, 2 A 11, 3 B 11, 4 A 11, 5 A 11, 6 A 11, 7 A 11, 8 B 11, 9 B 12, 0 A 12, 1 A 12, 2 A 12, 3 A 12, 4 B 12, 5 B 12, 6 A 12, 7 B 12, 8 B 12, 9 B 13, 0 A 13, 1 A 13, 2 B 13, 3 B 13, 4 A 13, 5 A 13, 6 B 13, 7 A 13, 8 B 13, 9 B 14, 0 A 14, 1 A 14, 2 A 14, 3 A 14, 4 A 14, 5 A 14, 6 B 14, 7 B 14, 8 A 14, 9 B 15, 0 A 15, 1 A 15, 2 A 15, 3 A 15, 4 A 15, 5 A 15, 6 A 15, 7 A 15, 8 A 15, 9 A
average hybridities for marker A in the world at time 1100 A 0.0 means that crickets in the region are purely type A NOTE: values are not averaged over the period, just sampled at this period, so they may not represent trends 0.500 0.750 0.750 0.769 0.636 0.800 0.700 0.636 0.667 0.700 0.667 0.769 0.667 0.500 0.571 0.727 0.750 0.429 0.875 0.500 0.500 0.545 0.250 0.500 0.500 0.778 0.750 0.533 0.667 0.909 0.588 0.500 0.444 0.636 0.625 0.200 0.250 0.500 0.600 1.000 0.364 0.750 0.462 0.556 0.636 0.429 1.000 1.000 0.909 0.545 0.375 0.455 0.286 0.636 0.500 0.571 0.889 0.600 0.750 0.800 0.615 0.545 0.333 0.611 0.733 0.700 0.889 0.636 0.818 0.667 0.300 0.429 0.312 0.444 0.727 0.667 0.500 0.455 0.545 1.000 0.400 0.429 0.364 0.333 0.538 0.636 0.500 0.556 0.800 0.857 0.417 0.143 0.615 0.667 0.273 0.500 0.545 0.700 0.333 0.429 0.417 0.231 0.385 0.714 0.333 0.500 0.300 0.769 1.000 0.667 0.077 0.200 0.333 0.750 0.500 0.333 0.364 0.091 0.750 0.533 0.375 0.333 0.250 0.500 0.571 0.583 0.333 0.600 0.667 0.643 0.273 0.444 0.538 0.625 0.400 0.250 0.700 0.500 0.571 0.833 0.200 0.500 0.417 0.222 0.333 0.417 0.571 0.625 0.333 0.545 0.188 0.429 0.385 0.500 0.357 0.455 0.364 0.500 0.286 0.125
jm-pre-cluster-t1 jm-pre-sd1-cluster-t1 jm-pre-sd1-cluster-t1-bestprop jm-pre-sd1-cluster-t1-bestinvprop jm-pre-sd1-cluster-t1-bestfix
The following instructions are only needed if cricketsim does not produce plots or data analysis files at the end of an experimental run. As installed, the program should have no problems finding its proper settings.
From the main window, select the Output menu, then choose Data Analysis:
The data tools directory is a full path (including the drive designator, usually C:\ on a typical windows installation). The cricketsim tools consist of several perl and gnuplot scripts. "cs-analysis.pl" is one of these tools. Browse to whatever folder holds these tools.
Perl and Gnuplot are used by cricketsim to create graphs (gnuplot does this with perl's help) and to analyze some of the raw data files to produce more readable ones. The gnuplot command and perl directory must be full paths to these other software components. The perl directory must be the location of the "perl.exe" program. The gnuplot command must be the location of the gnuplot program "wgnupl32.exe" or some other gnuplot program. This command must include the program's name as well (should be "wgnupl32.exe" with the original installation). If these are not set correctly, data analysis (plots, tables) will not be performed and the program will hang. The raw data will always be, but they must be analyzed manually if perl and gnuplot cannot be found. Use the "browse" button to set these values if needed.