Epicenter Software




Products >> Genetrix >> Data Management and Organization
Epicenter Software Genetrix

Data Management and Organization


Data Input

Expression data
Import expression array data from Excel spreadsheet, dChip output, MAS 5.0, delimited text file, or using Probe Profiler (Corimbia, Inc) that models probe intensities across all samples simultaneously to provide improved signal to noise ratios, estimates of standard errors, identification of outliers and adjustments for scanner saturation.
SNP data
Import Affymetrix SNP array data and link to expression array data
Covariate data
Import other biological, clinical and/or demographic/descriptive data from an Excel spreadsheet, delimited text files or entered directly with a data editor.

Data Management

Impute missing
Impute missing data using a k-nearest neighbor algorithm.
Transformations
Provides a comprehensive set of tools for standardization, normalization, log transform and/or data permutation, inlcuding the ability to randomize samples or genes to determine the null distribution of statistics.
Presence/absence
Assign presence/absence labels using MAS and/or customized P/A calls.
Data edit
View/edit sample covariate and gene covariate data, gene expression values, their S.E.s and P/A calls. Covariates can be given text labels for easy interpretation.
Recode
Assign values to sample (or gene) covariates based on arithmetic or logical combinations of values of other covariates.
Sort
Multi-key sort of samples or genes.
Labels & Notes
Genes and samples can be conveniently renamed,as required. The sample (or gene) name may incorporate selected covariate information. All projects, samples, genes, gene and sample subsets and metacluster analysis files may include a free-format text annotation.
Survival time definition
Define right censored outcomes in terms of start and end dates/times and a censor date/time or indicator.
Replicates
Experimental replicates can be combined in a user-specified manner, as can data from multiple probe sets that represent a single gene target.
Subsets
Subsets are used to define sets of genes and/or samples to work with. Genes may be selected on their attributes (e.g. pathway membership, molecular function), quality of the data (e.g. proportion of samples with outliers), cluster membership etc. Samples may be selected according to a covariate value, or randomly chosen (to create a test data set). Venn diagrams permit analysis of the relationship between subsets.

Output

Log
An external XML Log file keeps track of each step in an analysis. This file records screen graphics, gene lists and sample lists and can be viewed and edited using a separate viewer. Displays can be captured into a jpg file or the clipboard or sent directly to a printer.
HTML output
User-selected portions of the Log can be reformatted as an HTML file for dissemination.



Home | Products | Buy | Support | Contact Us | All contents ©2004-2007 Epicenter Software. All rights reserved. Epicenter Software