From 9807ea8cc4e867378c52d51b832168d4bc611d29 Mon Sep 17 00:00:00 2001 From: jprocter Date: Thu, 16 Mar 2006 10:24:50 +0000 Subject: [PATCH] added description of alignment quality scores and separated description of conservation calculation from the instructions for colouring alignments by it. --- help/help.jhm | 7 ++++++- help/helpTOC.xml | 6 ++++-- help/html/calculations/conservation.html | 27 ++++++++++++++++++++++++ help/html/calculations/quality.html | 32 +++++++++++++++++++++++++++++ help/html/colourSchemes/conservation.html | 18 ++++++++-------- 5 files changed, 77 insertions(+), 13 deletions(-) create mode 100755 help/html/calculations/conservation.html create mode 100755 help/html/calculations/quality.html diff --git a/help/help.jhm b/help/help.jhm index 231bcbc..5167e2c 100755 --- a/help/help.jhm +++ b/help/help.jhm @@ -42,7 +42,12 @@ - + + + + diff --git a/help/helpTOC.xml b/help/helpTOC.xml index 1530b05..8402add 100755 --- a/help/helpTOC.xml +++ b/help/helpTOC.xml @@ -9,8 +9,10 @@ - - + + + + diff --git a/help/html/calculations/conservation.html b/help/html/calculations/conservation.html new file mode 100755 index 0000000..cf7faf5 --- /dev/null +++ b/help/html/calculations/conservation.html @@ -0,0 +1,27 @@ + +Alignment Conservation Annotation +

Alignment Conservation Annotation

+

This is an automatically calculated quantitative alignment +annotation which measures the number of conserved physico-chemical +properties conserved for each column of the alignment. Its calculation +is based on the one used in + the AMAS method of multiple sequence alignment analysis :
+

    Livingstone + C.D. and Barton G.J. (1993), Protein Sequence Alignments: A Strategy + for the Hierarchical Analysis of Residue Conservation.CABIOS Vol. 9 + No. 6 (745-756)). +
+

+

Conservation is measured as a numerical index reflecting the +conservation of physico-chemical properties in the alignment: +Identities score highest, and the next most conserved group contain +substitutions to amino acids lying in the same physico-chemical +class.

+ +

Colouring an alignment by conservation
+Conservation scores can be used to colour an alignment. This is +explained further in the help page for conservation colouring. +

+ + diff --git a/help/html/calculations/quality.html b/help/html/calculations/quality.html new file mode 100755 index 0000000..fcd97f9 --- /dev/null +++ b/help/html/calculations/quality.html @@ -0,0 +1,32 @@ + +Alignment Quality Annotation + +

Alignment Quality Annotation

+

Alignment Quality is one of the automatically calculated +quantitative alignment +annotations displayed below the columns of a multiple sequence +alignment (and can be used to shade the alignment). It is an ad-hoc +measure of the likelihood of observing the mutations (if any) in a +particular column of the alignment.

+

+More precisely, the quality score is inversely proportional to the +average cost of all pairs of mutations oberved in a particular column +of the alignment - a high alignment quality score for a column would +suggest that there are no mutations, or most mutations observed are +favourable. +

+ +

The Algorithm
+The quality score is calculated for each column in an alignment by +summing, for all mutations, the ratio of the two BLOSUM 62 scores for +a mutation pair and each residue's conservered BLOSUM62 score (which +is higher). This valueis normalised for each column, and then plotted +on a scale from 0 to 1. +

+

+Multiple alignment algorithms using the BLOSUM 62 substition matrices +should, in theory, maximise alignment quality for an un-gapped +alignment, and locally maximise quality for gapped alignments. +

+ + diff --git a/help/html/colourSchemes/conservation.html b/help/html/colourSchemes/conservation.html index 3ac2c13..59b3929 100755 --- a/help/html/colourSchemes/conservation.html +++ b/help/html/colourSchemes/conservation.html @@ -1,26 +1,24 @@ -Conservation Calculation +Colouring by Conservation

Colouring by Conservation

-

This is an approach to alignment colouring based on the one used in +

This is an approach to alignment colouring which highlights + regions of an alignment where physicochemical properties are + conserved. It is based on the one used in the AMAS method of multiple sequence alignment analysis (Livingstone C.D. and Barton G.J. (1993), Protein Sequence Alignments: A Strategy for the Hierarchical Analysis of Residue Conservation.CABIOS Vol. 9 - No. 6 (745-756)). + No. 6 (745-756)). See the conservation calculation help page for + a more thorough explanation of the calculation.

-

Conservation is measured as a numerical index reflecting the - conservation of physico-chemical properties in the alignment: - Identities score highest, and the next most conserved group contain - substitutions to amino acids lying in the same physico-chemical - class.

For an already coloured alignment, the conservation index at each alignment position is used to modify the shading intensity of the colour at that position. This means that the most conserved columns in each group have the most intense colours, and the least conserved are the palest. The slider controls the contrast between these extremes.

-

Conservation can be calculated over the whole alignment, or just - within specific groups of sequences (such as those defined by +

Conservation can be calculated over all sequences in an alignment, or just + within specific groups (such as those defined by phylogenetic tree partitioning). The option 'apply to all groups' controls whether the contrast slider value will be applied to the indices for the currently -- 1.7.10.2