Home

Download Download GeneXPress Download Analysis Packages

Tutorials

Frequently Asked Questions

People


Previous     Next

Finding significant functional groups in clusters

From the Attributes menu, choose Load > From File.  Then select the appropriate attributes file (.gxa).  Attributes files are available for download in the Download section.

 

Analyze Attributes

From the analyze menu, choose Attributes.  The above menu will appear.

Here, you can select which of the loaded attributes you wish to analyze.  Usually, you will be interested in gene attributes, as you will have loaded an annotations file that describes the genes of your particular organism.

Minimum hits means that the attribute must be present at least that many times in the cluster in order for the cluster to be analyzed for enrichment of that attribute.  This prevents, for example, a small cluster with only one occurance of an attribute from showing that that attribute is significant in the data.

Similarly, minimum # genes indicates that only clusters of a certain minimun size will be analyzed.  Again, this is to prevent a cluster with extremely few genes from showing up as enriched for attributes.  A cluster that only has with 50% of its genes having a certain attribute tends to be much more meaningful in a cluster with 200 genes than in a cluster with 2 genes.

Maximum p-value sets a threshold p-value.  Only p-values more significant that the selected p-value will register in the analysis.

Pearson Cutoff is only available if your file is a TreeView format file, and your are analyzing "All Clusters". This option allows you to set a Pearson correlation threshold, and it will return you a disjoint set of clusters created by walking down each branch of the tree and finding the first cluster on the branch that whose Pearson correlation is above the threshold. If this cluster meets the other criteria, it will be included. Otherwise, it will not be included. No other cluters on the branch will be searched. walking down the hierarchy at each split cross this threshold

Spreadsheet vs. Graphical: Allows you to select whether you want to view your results quantitatively or graphically.

Click Analyze to begin.

Results

The program generates a table.  For each attributes, we see the following:

Cluster: The cluster in which the attribute was detected.

P-val: The probability that the distribution of the attribute in this cluster versus the dataset as a whole was due to chance.

Hits: Number of times the attribute occurs in this cluster.

Total: Number of times the attribute occurs in the dataset.

Genome Total: Number of times the attribute occurs in the genome.

#Genes, #Expts, mean, STD: Information about the cluster.

The table is sortable by colums.  Click on a header to sort descending, and hit Shift+click to sort ascending.





Click to continue:
  • Next:
  • Find significant motif binding sites in your clusters
  • Previous:
  • Bookmark your favorite clusters
  • Home:
  • Tutorials Home