From 3ec000e099e35b359996b824aabeaee632628deb Mon Sep 17 00:00:00 2001
From: amwaterhouse
Principal components analysis is a technique for examining the -structure of complex datasets. The components are a set of dimensions -formed from the measured values in the dataset, and the principle +structure of complex data sets. The components are a set of dimensions +formed from the measured values in the data set, and the principle component is the one with the greatest magnitude, or length. The sets of measurements that differ the most should lie at either end of this principle axis, and the other axes correspond to less extreme -patterns of variation in the dataset. +patterns of variation in the data set.
In this case, the components are generated by an eigenvector @@ -29,21 +29,21 @@ href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt= href="http://industry.ebi.ac.uk/SeqSpace/">http://industry.ebi.ac.uk/SeqSpace) at the EBI.
-The PCA Viewer
+The PCA Viewer
This is an interactive display of the sequences positioned within the similarity space. The colour of each sequence point is the same - as the sequence group coloring, white if no colour has been + as the sequence group colours, white if no colour has been defined for the sequence, and green if the sequence is part of a the currently selected group.
The 3d view can be rotated by dragging the mouse with the left mouse button pressed. The view can also be zoomed in and out with the up and down arrow - keys.
+ keys.A tool tip gives the sequence ID corresponding to a point in the space, and clicking a point toggles the selection of the - corresponding sequence in the alignment window. Rectanglar region - based selection is also possible, by holding the 's' key whilst + corresponding sequence in the alignment window. Rectangular region + based selection is also possible, by holding the 'S' key whilst left-clicking and dragging the mouse over the display.
Initially, the display shows the first three components of the diff --git a/help/html/calculations/redundancy.html b/help/html/calculations/redundancy.html index e97e3b0..1f3d464 100755 --- a/help/html/calculations/redundancy.html +++ b/help/html/calculations/redundancy.html @@ -3,13 +3,9 @@
Removing redundancy
Selecting this option brings up a window asking you to select a threshold. - If the percentage identity between two sequences (based on the - current alignment) exceeds this value one of the + If the percentage identity between two sequences exceeds this value one of the sequences (the shorter) is discarded. The redundancy calculation is done when - the Apply button is pressed, and the undo button recovers the - original set or any previously calculated subset. The detection of - redundant sequences involves a pairwise comparison of all the - sequences, so you may have to wait a few moments when applying this to - large sequence sets.
+ the Apply button is pressed. For large numbers of sequences this can take a + long time as all pairs have to be compared.