|
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||
java.lang.ObjectSIGRS.KAStatistics
public class KAStatistics
SIGRS is a collection of routines used in searching for regions of contrasting composition (CCRs) in sequence files using a partial sum process. Significance of segments is evaluated using Karlin-Altschul statistics and specifically an extension by Karlin-Dembo allowing for nucleotides to have a Markov-dependence (see e.g. Karlin & Altschul (1993) and Karlin & Dembo (1992)
The routines are provided as is and no guarantee regarding stability etc. is given so use at your own risk!
See publication Larsson, P., Hinas, A., Ardell, D.H., Kirsebom, L.A., Virtanen, A. and Söderbom, F. De novo search for non-coding RNA genes in the AT-rich genome of Dictyostelium discoideum: performance of Markov-dependent genome feature scoring
Questions and comments can be directed to Pontus.Larsson@icm.uu.se
| Constructor Summary | |
|---|---|
KAStatistics()
|
|
| Method Summary | |
|---|---|
static double |
cutoff(double alpha,
int N,
double K,
double L)
Determines the bit score cutoff at significance level alpha. |
static double |
entropy(double[][] s,
double[][] p,
double L)
Calculates the entropy of a scoring matrix |
static double |
estimateK(double[][] s,
double[][] p,
double L,
double H)
Estimates the parameter K for the independant nucleotides case For details, see page 8 in BLAST scoring parameters |
static double |
expect(double score,
double N,
double K,
double L)
Calculates the expect value of a score according to Karlin & Altschul (1993) p.5875 [y=K*N*exp(-lambda*x)] |
static double |
gcd(double a,
double b)
Finds the greatest common divisor of two numbers using the Euclidian algorithm |
static double |
lambda(double[][] s,
double[][] p)
Estimates lambda by an iterative Newton-Rhapson until convergence Terminates execution if convergence is not reached within 10000 iterations |
static double[][] |
reshape(double[][] s,
double[][] p)
Returns a matrix spanning the scores in s where the first column of each row is a score and the second column is the total probability of observing that score |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public KAStatistics()
| Method Detail |
|---|
public static double cutoff(double alpha,
int N,
double K,
double L)
alpha - The significance levelN - The search space sizeK - The parameter KL - The parameter lambda
public static double entropy(double[][] s,
double[][] p,
double L)
s - The input score matrixp - The probabilities associated with the score matrixL - The estimated lambda parameter for s and p
public static double estimateK(double[][] s,
double[][] p,
double L,
double H)
s - Score matrixp - Probability matrix associated with scoresL - The estimated lambdaH - The calculated entropy of the score matrix
public static double gcd(double a,
double b)
a - b -
public static double expect(double score,
double N,
double K,
double L)
score - The bit score to calculate expect value forN - The search space sizeK - The parameter KL - The parameter lambda
public static double lambda(double[][] s,
double[][] p)
s - Score matrixp - Probability matrix associated with scores
public static double[][] reshape(double[][] s,
double[][] p)
|
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||