help/html/webServices/proteinDisorder.html

   1 <html>
   2 <!--
   3  * Jalview - A Sequence Alignment Editor and Viewer ($$Version-Rel$$)
   4  * Copyright (C) $$Year-Rel$$ The Jalview Authors
   5  *
   6  * This file is part of Jalview.
   7  *
   8  * Jalview is free software: you can redistribute it and/or
   9  * modify it under the terms of the GNU General Public License
  10  * as published by the Free Software Foundation, either version 3
  11  * of the License, or (at your option) any later version.
  12  *
  13  * Jalview is distributed in the hope that it will be useful, but
  14  * WITHOUT ANY WARRANTY; without even the implied warranty
  15  * of MERCHANTABILITY or FITNESS FOR A PARTICULAR
  16  * PURPOSE.  See the GNU General Public License for more details.
  17  *
  18  * You should have received a copy of the GNU General Public License
  19  * along with Jalview.  If not, see <http://www.gnu.org/licenses/>.
  20  * The Jalview Authors are detailed in the 'AUTHORS' file.
  21  -->
  22 <head>
  23 <title>JABAWS Protein Disorder Prediction Services</title>
  24 </head>
  25 <body>
  26   <p>
  27     <strong>JABAWS Protein Disorder Prediction Services</strong> <br />
  28     The <strong>Web Services&rarr;Disorder</strong> menu in the
  29     alignment window allows access to protein disorder prediction
  30     services provided by the configured <a
  31       href="http://www.compbio.dundee.ac.uk/jabaws"
  32     >JABAWS servers</a>. Each service operates on sequences in the
  33     alignment or currently selected region (<em>since Jalview
  34       2.8.0b1</em>) to identify regions likely to be unstructured or
  35     flexible, or alternately, fold to form globular domains.
  36   </p>
  37   <p>
  38     Predictor results include both <a
  39       href="../features/seqfeatures.html"
  40     >sequence features</a> and sequence associated <a
  41       href="../features/annotation.html"
  42     >alignment annotation</a> rows. Features display is controlled from
  43     the <a href="../features/featureSettings.html">Feature Settings</a>
  44     dialog box. Clicking on the ID for a disorder prediction annotation
  45     row will highlight or select (if double clicked) the associated
  46     sequence for that row. You can also use the <em>Sequence
  47       Associated</em> option in the <a
  48       href="../colourSchemes/annotationColouring.html"
  49     >Colour By Annotation</a> dialog box to colour sequences according to
  50     the results of predictors shown as annotation rows.
  51   </p>
  52   <p>JABAWS 2.0 provides four disorder predictors which are
  53     described below:</p>
  54   <ul>
  55     <li><a href="#disembl">DisEMBL</a></li>
  56     <li><a href="#iupred">IUPred</a></li>
  57     <li><a href="#ronn">RONN</a></li>
  58     <li><a href="#globplot">GlobPlot</a></li>
  59   </ul>
  60   <p>
  61     <strong><a name="disembl"></a><a href="http://dis.embl.de/">DisEMBL
  62         (Linding et al., 2003)</a> </strong> <br /> DisEMBL is a set of
  63     machine-learning based predictors trained to recognise
  64     disorder-related annotation found on PDB structures.
  65   </p>
  66   <table border="1">
  67     <tr>
  68       <td><strong>Name</strong></td>
  69       <td><strong>Annotation type</strong></td>
  70       <td><strong>Description</strong></td>
  71     </tr>
  72     <tr>
  73       <td><strong>COILS</strong></td>
  74       <td>Sequence Feature &amp;<br />Annotation Row
  75       </td>
  76       <td>Predicts loops/coils according to DSSP definition<a
  77         href="#dsspstates"
  78       >[1]</a>.<br />Features mark range(s) of residues predicted as
  79         loops/coils, and annotation row gives raw value for each
  80         residue. Value over 0.516 indicates loop/coil.
  81       </td>
  82     </tr>
  83     <tr>
  84       <td><strong>HOTLOOPS</strong></td>
  85       <td>Sequence Feature &amp;<br />Annotation Row
  86       </td>
  87       <td>&quot;Hot loops constitute a refined subset of <strong>COILS</strong>,
  88         namely those loops with a high degree of mobility as determined
  89         from C&alpha; temperature factors (B factors). It follows that
  90         highly dynamic loops should be considered protein
  91         disorder.&quot;<br /> Features mark range(s) of residues
  92         predicted to be hot loops and annotation row gives raw value for
  93         each residue. Values over 0.6 indicates hot loop.
  94       </td>
  95     </tr>
  96     <tr>
  97       <td><strong>REMARK465</strong></td>
  98       <td>Sequence Feature &amp;<br />Annotation Row
  99       </td>
 100       <td>&quot;Missing coordinates in X-ray structure as defined
 101         by remark465 entries in PDB. Nonassigned electron densities most
 102         often reflect intrinsic disorder, and have been used early on in
 103         disorder prediction.&quot;<br /> Features gives range(s) of
 104         residues predicted as disordered, and annotation row gives raw
 105         value for each residue. Value over 0.1204 indicates disorder.
 106       </td>
 107     </tr>
 108   </table>
 109
 110   <p>
 111     <a name="dsspstates"></a>[1]. DSSP Classification: &alpha;-helix
 112     (H), 310-helix (G), &beta;-strand (E) are ordered, and all other
 113     states (&beta;-bridge (B), &beta;-turn (T), bend (S), &pi;-helix
 114     (I), and coil (C)) considered loops or coils.
 115   </p>
 116
 117
 118   <p>
 119     <strong><a name="ronn"></a><a
 120       href="http://www.strubi.ox.ac.uk/RONN"
 121     >RONN</a></strong> <em>a.k.a.</em> Regional Order Neural Network<br />This
 122     predictor employs an approach known as the 'bio-basis' method to
 123     predict regions of disorder in sequences based on their local
 124     similarity with a gold-standard set of disordered protein sequences.
 125     It yields a set of disorder prediction scores, which are shown as
 126     sequence annotation below the alignment.
 127   </p>
 128   <table border="1">
 129     <tr>
 130       <td><strong>Name</strong></td>
 131       <td><strong>Annotation type</strong></td>
 132       <td><strong>Description</strong></td>
 133     </tr>
 134     <tr>
 135       <td><strong>JRonn</strong>[2]</td>
 136       <td>Annotation Row</td>
 137       <td>RONN score for each residue in the sequence. Scores above
 138         0.5 identify regions of the protein likely to be disordered.</td>
 139     </tr>
 140   </table>
 141   <p>
 142     <em>[2]. JRonn denotes the score for this server because JABAWS
 143       runs a Java port of RONN developed by Peter Troshin and
 144       distributed as part of <a href="http://www.biojava.org/">Biojava
 145         3</a>
 146     </em>
 147   </p>
 148   <p>
 149     <strong><a name="iupred"></a><a
 150       href="http://iupred.enzim.hu/Help.php"
 151     >IUPred</a></strong><br /> IUPred employs an empirical model to estimate
 152     likely regions of disorder. There are three different prediction
 153     types offered, each using different parameters optimized for
 154     slightly different applications. It provides raw scores based on two
 155     models for predicting regions of 'long disorder' and 'short
 156     disorder'. A third predictor identifies regions likely to form
 157     structured domains.
 158   </p>
 159   <table border="1">
 160     <tr>
 161       <td><strong>Name</strong></td>
 162       <td><strong>Annotation type</strong></td>
 163       <td><strong>Description</strong></td>
 164     </tr>
 165     <tr>
 166       <td><strong>Long disorder</strong></td>
 167       <td>Annotation Row</td>
 168       <td>Prediction of context-independent global disorder that
 169         encompasses at least 30 consecutive residues of predicted
 170         disorder. Employs a 100 residue window for calculation.<br />Values
 171         above 0.5 indicates the residue is intrinsically disordered.
 172       </td>
 173     </tr>
 174     <tr>
 175       <td><strong>Short disorder</strong></td>
 176       <td>Annotation Row</td>
 177       <td>Predictor for short, (and probably) context-dependent,
 178         disordered regions, such as missing residues in the X-ray
 179         structure of an otherwise globular protein. Employs a 25 residue
 180         window for calculation, and includes adjustment parameter for
 181         chain termini which favors disorder prediction at the ends.<br />Values
 182         above 0.5 indicate short-range disorder.
 183       </td>
 184     </tr>
 185     <tr>
 186       <td><strong>Structured domains</strong></td>
 187       <td>Sequence Feature</td>
 188       <td>Features highlighting likely globular domains useful for
 189         structure genomics investigation. <br />Post-analysis of
 190         disordered region profile to find continuous regions confidently
 191         predicted to be ordered. Neighbouring regions close to each
 192         other are merged, while regions shorter than the minimal domain
 193         size of at least 30 residues are ignored.
 194       </td>
 195     </tr>
 196   </table>
 197   <p>
 198     <strong><a name="globplot"></a><a
 199       href="http://globplot.embl.de/"
 200     >GLOBPLOT</a></strong><br /> Defines regions of globularity or natively
 201     unstructured regions based on a running sum of the propensity of
 202     residues to be structured or unstructured. The propensity is
 203     calculated based on the probability of each amino acid being
 204     observed within well defined regions of secondary structure or
 205     within regions of random coil. The initial signal is smoothed with a
 206     Savitzky-Golay filter, and its first order derivative computed.
 207     Residues for which the first order derivative is positive are
 208     designated as natively unstructured, whereas those with negative
 209     values are structured.<br />
 210   <table border="1">
 211     <tr>
 212       <td><strong>Name</strong></td>
 213       <td><strong>Annotation type</strong></td>
 214       <td><strong>Description</strong></td>
 215     </tr>
 216     <tr>
 217       <td><strong>Disordered Region</strong></td>
 218       <td>Sequence Feature</td>
 219       <td><br />Sequence features marking range(s) of residues
 220         with positive dydx values (correspond to the #Disorder column
 221         from JABAWS results)</td>
 222     </tr>
 223     <tr>
 224       <td><strong>Globular Domain</strong>
 225       <td>Sequence Feature</td>
 226       <td>Putative globular domains</td>
 227     </tr>
 228     <tr>
 229       <td><strong>Dydx</strong></td>
 230       <td>Annotation row</td>
 231       <td>First order derivative of smoothed score. Values above 0
 232         indicates residue is disordered.</td>
 233     </tr>
 234     <tr>
 235       <td><strong>Smoothed Score<br />Raw Score
 236       </strong></td>
 237       <td>Annotation Row</td>
 238       <td>The smoothed and raw scores used to create the
 239         differential signal that indicates the presence of unstructured
 240         regions.<br /> <em>These are hidden by default, but can be
 241           shown by right-clicking on the alignment annotation panel and
 242           selecting <strong>Show hidden annotation</strong>
 243       </em>
 244       </td>
 245     </tr>
 246   </table>
 247   <p>
 248     <em>Documentation and thresholds for the JABAWS Disorder
 249       predictors adapted from a personal communication by Nancy Giang,
 250       2012.</em>
 251   </p>
 252 </body>
 253 </html>