EMfit_binom.Rd
EMfit_binom
estimates, per locus, parameters of an assumed binomial mixture model via expectation maximization (genotype
frequencies, allelic bias i.e. heterozygous p-parameter).
Through this fit, per-samples genotype probabilities are obtained as well. A fit assuming no allelic bias (heterozygous p-parameter = 0.5) is
performed as well, for the purpose of significant allelic bias detection via likelihood ratio test.
EMfit_binom(
data_counts,
SE,
allelefreq = 0.5,
inbr = 0,
dltaco = 0.001,
HWE = FALSE,
p_InitEst = FALSE
)
Data frame. Data frame of a SNP with reference and variant counts ("ref_count" and "var_count", respectively) for each sample ("sample").
Number. Sequencing error rate.
Number. Allele frequency. Only used when pInitEst is TRUE (default = 0.5)
Number. Degree of inbreeding (default = 0).
Number. Minimal difference between 2 iterations (default = 0.001).
Logical. Should HWE be used for allele frequency estimation, not recommended (default = FALSE).
Logical. Calculate initial estimates of pr, pv and prv from allelefreq, not recommended (default = FALSE).
A list containing the following components:
The estimated allelic shift.
The test statistic of the likelihood ratio test.
The p-value of the likelihood ratio test.
The goodness-of-fit value based on the corrected likelihood.
The number of iterations.
Indicates the quality of a locus. An "!" indicates bad data or a bad fit due to no apparent heterozygosity.
The variant (homozygote) genotype probability for the SNP.
The reference (homozygote) genotype probability for the SNP.
The heterozyous genotype probability for the SNP.
The variant (homozygote) genotype probability with Allelic Bias = 0.5.
The reference (homozygote) genotype probability with Allelic Bias = 0.5.
The heterozyous genotype probability with Allelic Bias = 0.5.
Data frame. Input data frame with extra columns: allelefreq, genotype and genotype probabilities (prr, prv, pvv) per sample.