READ ME FILE

Overview:

Population determination is usually based upon geographical origin of samples or phenotypes but due to unidentified barriers to gene-flow they are not discretely distributed.

The software "Goaty Le" is based on population genetics for assigning breeds among 3 selected population and identifying immigrants.

The software mainly performs the task of demonstrating the presence of
population structure, assigning individuals to population and identifying migrants.

At back end it uses another software for carrying out the above task.
Software named "STRUCTURE 2.1" is used for calculating Fst values and finding Maximum Likelihood
for the population data. It mainly uses Bayesian approach to find Maximum Likelihood and Fst values which then forms a cluster of a population.

A model is assumed in which there are K population (K is probability of an individual to fall under any
one population), each of which is characterized by a set of allele frequencies at each locus(Marker).

It is assumed that the loci are at Hardy-Weinberg equilibrium within population .
This model assumes that markers are not in linkage disequilibrium (LD) within subpopulation so that one
can deal with weakly linked markers.Data here taken are from diploid organism.

Software used here does all the task by using the below methods:

1.Maximum Likelihood.

2.F-Statistics.

3.MCMC Algorithm.

4.Drichilet Distance.

5.Gamma Distribution.

Parameters for all this methods used in the software are set as default along with its data (dinucleotide OR trinucleotide).

 

To ensure sensible result some care is to be taken while entering data and running the
software.

About the Software:

The software package "Goaty Lè" merges two interface COMMAND LINE & FRONT END.
Structure is executed on command line and Data entering & Result interpretation is displayed on front-end.

"For accuracy in the result obtained by using this software , run your data with each population about 10-20 times, and the maximum value obtained(~=0.99 probability) gives the assigned breed for an individual."


Entering Data:

Data entering requires a proper experimentally derived result and proper know-how about all the range for a given marker, any random data are not acceptable and so is not recommended.

In an exceptional case without a prior knowledge of data and its range is designed where for testing purpose press"Load Data " which will enter default data in each field,this is mainly done for getting acquainted with the software (see sample data on the main page).

An outline of a data and its range are :
Individual should be a Diploid organism (2n), and for each loci (Marker position) its corresponding
AMPLICON FRAGMENT is measured .

Three main parameters (which are non-genotypic values) are to be taken into consideration:
1.Population ID (3 digit numeric value).
2.Population No(1,2,or 3 for Zalawadi,Gohilwadi,and Surti respectively).
3.Population Flag(0 or 1 for using population no in the data).

Genotypic values are entered into the text boxes given near each of the marker name.
Each value enetered is specific for an individual and locus.

Marker Name ILsts029 Ilstst019 Jmp29 ILsts033 ILsts030 ILsts082 HH64 Rm088 ILsts065 Eth225 ILsts002 ILsts005 ILsts008 ILsts034 ILsts049 ILsts087 Omhc1 Rm04
Range 144-164 145-159 112-118 144-176 146-172 108-128 122-130 114-140 110-130 138-156 106-130 164-188 166-184 150-180 167-179 136-158 182-200 106-118

Data entered within this range would give nearly accurate result and for missing data enter -9 for both the genotypes of a particular marker

Result:

Result page consists of maximum probable value for an individual to fall under a population.
First line shows all the values for each population, while next line shows highest probability
among them, and the last line shows highlighted population name.

The first line also indicates probable parentage and grand-parentage of an individual where highest probability indicates the population assigned to the individual.

The program will attempt to provide some indication about the nature of any problems that exist.

 

**For getting accurate result POP FlAG must be kept "1" every time you run the program.

Back to Main Page