R/pcs.R
pcs.prepare.Rd
Prepares principal components of two datasets for colocalisation testing.
pcs.prepare(X1, X2, impute = TRUE)
X1 | Either a SnpMatrix or numeric matrix of genetic data. Columns index SNPs, rows index samples. |
---|---|
X2 | as X1 |
impute | if TRUE (default), impute missing genotypes |
a colocPCs
object.
If X1
and X2
are SnpMatrix
objects, they are checked for
missing data, and any missing values imputed by repeated use of
impute.snps
from the snpStats
package.
Columns with common names are rbind
ed together and principal
components calculated using prcomp
.
pcs.model
can then be invoked to create glm
objects.
Wallace et al (2012). Statistical colocalisation of monocyte gene expression and genetic risk variants for type 1 diabetes. Hum Mol Genet 21:2815-2824. http://europepmc.org/abstract/MED/22403184
Plagnol et al (2009). Statistical independence of the colocalized association signals for type 1 diabetes and RPS26 gene expression on chromosome 12q13. Biostatistics 10:327-34. http://www.ncbi.nlm.nih.gov/pubmed/19039033
## simulate covariate matrix (X) and continuous response vector (Y) ## for two populations/triats Y1 and Y2 depend equally on f1 and f2 ## within each population, although their distributions differ between ## populations. They are compatible with a null hypothesis that they ## share a common causal variant, with the effect twice as strong for ## Y2 as Y1 set.seed(1) X1 <- matrix(rbinom(5000,1,0.4),ncol=10) Y1 <- rnorm(500,apply(X1[,1:2],1,sum),2) X2 <- matrix(rbinom(5000,1,0.6),ncol=10) Y2 <- rnorm(500,2*apply(X2[,1:2],1,sum),5) ## generate principal components object colnames(X1) <- colnames(X2) <- make.names(1:ncol(X1)) pcs <- pcs.prepare(X1,X2) ## generate glm objects m1 <- pcs.model(pcs, group=1, Y=Y1)#> selecting 8 components out of 10 to capture 0.8299584 of total variance.#> selecting 8 components out of 10 to capture 0.8299584 of total variance.#> eta.hat chisquare n p.value.chisquare #> 2.8822102 8.7378708 8.0000000 0.2720277