help/help/html/webServices/proteinDisorder.html

   1 <html>
   2 <!--
   3  * Jalview - A Sequence Alignment Editor and Viewer ($$Version-Rel$$)
   4  * Copyright (C) $$Year-Rel$$ The Jalview Authors
   5  *
   6  * This file is part of Jalview.
   7  *
   8  * Jalview is free software: you can redistribute it and/or
   9  * modify it under the terms of the GNU General Public License
  10  * as published by the Free Software Foundation, either version 3
  11  * of the License, or (at your option) any later version.
  12  *
  13  * Jalview is distributed in the hope that it will be useful, but
  14  * WITHOUT ANY WARRANTY; without even the implied warranty
  15  * of MERCHANTABILITY or FITNESS FOR A PARTICULAR
  16  * PURPOSE.  See the GNU General Public License for more details.
  17  *
  18  * You should have received a copy of the GNU General Public License
  19  * along with Jalview.  If not, see <http://www.gnu.org/licenses/>.
  20  * The Jalview Authors are detailed in the 'AUTHORS' file.
  21  -->
  22 <head>
  23 <title>JABAWS Protein Disorder Prediction Services</title>
  24 </head>
  25 <body>
  26   <p>
  27     <strong>JABAWS Protein Disorder Prediction Services</strong> <br />
  28     The <strong>Web Services&rarr;Disorder</strong> menu in the
  29     alignment window allows access to protein disorder prediction
  30     services provided by the configured <a
  31       href="http://www.compbio.dundee.ac.uk/jabaws">JABAWS
  32       servers</a>. Each service operates on sequences in the alignment or
  33     currently selected region (<em>since Jalview 2.8.0b1</em>) to
  34     identify regions likely to be unstructured or flexible, or
  35     alternately, fold to form globular domains.
  36   </p>
  37   <p>
  38     Predictor results include both <a
  39       href="../features/seqfeatures.html">sequence features</a> and
  40     sequence associated <a href="../features/annotation.html">alignment
  41       annotation</a> rows. Features display is controlled from the <a
  42       href="../features/featuresettings.html">Feature Settings</a>
  43     dialog box. Clicking on the ID for a disorder prediction annotation
  44     row will highlight or select (if double clicked) the associated
  45     sequence for that row. You can also use the <em>Sequence
  46       Associated</em> option in the <a
  47       href="../colourSchemes/annotationColouring.html">Colour
  48       By Annotation</a> dialog box to colour sequences according to the
  49     results of predictors shown as annotation rows.
  50   </p>
  51   <p>JABAWS 2.2 provides four disorder predictors which are
  52     described below:</p>
  53   <ul>
  54     <li><a href="#disembl">DisEMBL</a></li>
  55     <li><a href="#iupred">IUPred</a></li>
  56     <li><a href="#ronn">RONN</a></li>
  57     <li><a href="#globplot">GlobPlot</a></li>
  58   </ul>
  59   <p>
  60     <strong><a name="disembl"></a><a href="http://dis.embl.de/">DisEMBL
  61         (Linding et al., 2003)</a> </strong> <br /> DisEMBL is a set of
  62     machine-learning based predictors trained to recognise
  63     disorder-related annotation found on PDB structures.
  64   </p>
  65   <table border="1">
  66     <tr>
  67       <td><strong>Name</strong></td>
  68       <td><strong>Annotation type</strong></td>
  69       <td><strong>Description</strong></td>
  70     </tr>
  71     <tr>
  72       <td><strong>COILS</strong></td>
  73       <td>Sequence Feature &amp;<br />Annotation Row
  74       </td>
  75       <td>Predicts loops/coils according to DSSP definition<a
  76         href="#dsspstates">[1]</a>.<br />Features mark range(s)
  77         of residues predicted as loops/coils, and annotation row gives
  78         raw value for each residue. Value over 0.516 indicates
  79         loop/coil.
  80       </td>
  81     </tr>
  82     <tr>
  83       <td><strong>HOTLOOPS</strong></td>
  84       <td>Sequence Feature &amp;<br />Annotation Row
  85       </td>
  86       <td>&quot;Hot loops constitute a refined subset of <strong>COILS</strong>,
  87         namely those loops with a high degree of mobility as determined
  88         from C&alpha; temperature factors (B factors). It follows that
  89         highly dynamic loops should be considered protein
  90         disorder.&quot;<br /> Features mark range(s) of residues
  91         predicted to be hot loops and annotation row gives raw value for
  92         each residue. Values over 0.6 indicates hot loop.
  93       </td>
  94     </tr>
  95     <tr>
  96       <td><strong>REMARK465</strong></td>
  97       <td>Sequence Feature &amp;<br />Annotation Row
  98       </td>
  99       <td>&quot;Missing coordinates in X-ray structure as defined
 100         by remark465 entries in PDB. Nonassigned electron densities most
 101         often reflect intrinsic disorder, and have been used early on in
 102         disorder prediction.&quot;<br /> Features gives range(s) of
 103         residues predicted as disordered, and annotation row gives raw
 104         value for each residue. Value over 0.1204 indicates disorder.
 105       </td>
 106     </tr>
 107   </table>
 108
 109   <p>
 110     <a name="dsspstates"></a>[1]. DSSP Classification: &alpha;-helix
 111     (H), 310-helix (G), &beta;-strand (E) are ordered, and all other
 112     states (&beta;-bridge (B), &beta;-turn (T), bend (S), &pi;-helix
 113     (I), and coil (C)) considered loops or coils.
 114   </p>
 115
 116
 117   <p>
 118     <strong><a name="ronn"></a><a
 119       href="http://www.strubi.ox.ac.uk/RONN">RONN</a></strong> <em>a.k.a.</em>
 120     Regional Order Neural Network<br />This predictor employs an
 121     approach known as the 'bio-basis' method to predict regions of
 122     disorder in sequences based on their local similarity with a
 123     gold-standard set of disordered protein sequences. It yields a set
 124     of disorder prediction scores, which are shown as sequence
 125     annotation below the alignment.
 126   </p>
 127   <table border="1">
 128     <tr>
 129       <td><strong>Name</strong></td>
 130       <td><strong>Annotation type</strong></td>
 131       <td><strong>Description</strong></td>
 132     </tr>
 133     <tr>
 134       <td><strong>JRonn</strong>[2]</td>
 135       <td>Annotation Row</td>
 136       <td>RONN score for each residue in the sequence. Scores above
 137         0.5 identify regions of the protein likely to be disordered.</td>
 138     </tr>
 139   </table>
 140   <p>
 141     <em>[2]. JRonn denotes the score for this server because JABAWS
 142       runs a Java port of RONN developed by Peter Troshin and
 143       distributed as part of <a href="http://www.biojava.org/">Biojava
 144         3</a>
 145     </em>
 146   </p>
 147   <p>
 148     <strong><a name="iupred"></a><a
 149       href="http://iupred.enzim.hu/">IUPred</a></strong><br />
 150     IUPred employs an empirical model to estimate likely regions of
 151     disorder. There are three different prediction types offered, each
 152     using different parameters optimized for slightly different
 153     applications. It provides raw scores based on two models for
 154     predicting regions of 'long disorder' and 'short disorder'. A third
 155     predictor identifies regions likely to form structured domains.
 156   </p>
 157   <table border="1">
 158     <tr>
 159       <td><strong>Name</strong></td>
 160       <td><strong>Annotation type</strong></td>
 161       <td><strong>Description</strong></td>
 162     </tr>
 163     <tr>
 164       <td><strong>Long disorder</strong></td>
 165       <td>Annotation Row</td>
 166       <td>Prediction of context-independent global disorder that
 167         encompasses at least 30 consecutive residues of predicted
 168         disorder. Employs a 100 residue window for calculation.<br />Values
 169         above 0.5 indicates the residue is intrinsically disordered.
 170       </td>
 171     </tr>
 172     <tr>
 173       <td><strong>Short disorder</strong></td>
 174       <td>Annotation Row</td>
 175       <td>Predictor for short, (and probably) context-dependent,
 176         disordered regions, such as missing residues in the X-ray
 177         structure of an otherwise globular protein. Employs a 25 residue
 178         window for calculation, and includes adjustment parameter for
 179         chain termini which favors disorder prediction at the ends.<br />Values
 180         above 0.5 indicate short-range disorder.
 181       </td>
 182     </tr>
 183     <tr>
 184       <td><strong>Structured domains</strong></td>
 185       <td>Sequence Feature</td>
 186       <td>Features highlighting likely globular domains useful for
 187         structure genomics investigation. <br />Post-analysis of
 188         disordered region profile to find continuous regions confidently
 189         predicted to be ordered. Neighbouring regions close to each
 190         other are merged, while regions shorter than the minimal domain
 191         size of at least 30 residues are ignored.
 192       </td>
 193     </tr>
 194   </table>
 195   <p>
 196     <strong><a name="globplot"></a><a
 197       href="http://globplot.embl.de/">GLOBPLOT</a></strong><br /> Defines
 198     regions of globularity or natively unstructured regions based on a
 199     running sum of the propensity of residues to be structured or
 200     unstructured. The propensity is calculated based on the probability
 201     of each amino acid being observed within well defined regions of
 202     secondary structure or within regions of random coil. The initial
 203     signal is smoothed with a Savitzky-Golay filter, and its first order
 204     derivative computed. Residues for which the first order derivative
 205     is positive are designated as natively unstructured, whereas those
 206     with negative values are structured.<br />
 207   <table border="1">
 208     <tr>
 209       <td><strong>Name</strong></td>
 210       <td><strong>Annotation type</strong></td>
 211       <td><strong>Description</strong></td>
 212     </tr>
 213     <tr>
 214       <td><strong>Disordered Region</strong></td>
 215       <td>Sequence Feature</td>
 216       <td><br />Sequence features marking range(s) of residues
 217         with positive dydx values (correspond to the #Disorder column
 218         from JABAWS results)</td>
 219     </tr>
 220     <tr>
 221       <td><strong>Globular Domain</strong>
 222       <td>Sequence Feature</td>
 223       <td>Putative globular domains</td>
 224     </tr>
 225     <tr>
 226       <td><strong>Dydx</strong></td>
 227       <td>Annotation row</td>
 228       <td>First order derivative of smoothed score. Values above 0
 229         indicates residue is disordered.</td>
 230     </tr>
 231     <tr>
 232       <td><strong>Smoothed Score<br />Raw Score
 233       </strong></td>
 234       <td>Annotation Row</td>
 235       <td>The smoothed and raw scores used to create the
 236         differential signal that indicates the presence of unstructured
 237         regions.<br /> <em>These are hidden by default, but can be
 238           shown by right-clicking on the alignment annotation panel and
 239           selecting <strong>Show hidden annotation</strong>
 240       </em>
 241       </td>
 242     </tr>
 243   </table>
 244   <p>
 245     <em>Documentation and thresholds for the JABAWS Disorder
 246       predictors adapted from a personal communication by Nancy Giang,
 247       2012.</em>
 248   </p>
 249 </body>
 250 </html>