X-Git-Url: http://source.jalview.org/gitweb/?a=blobdiff_plain;f=help%2Fhtml%2Fio%2Fmodellerpir.html;h=65e41bc7cd474b9edff9efdcb9f8875adaaebdde;hb=c53aeee9a688f68d64106a968876e542f701fbc6;hp=e18e273bc6e63d17a8266460e4a4915b193850f3;hpb=6ab4ef1cc71ff9d28a21a139db69e4a8351a3fb5;p=jalview.git diff --git a/help/html/io/modellerpir.html b/help/html/io/modellerpir.html index e18e273..65e41bc 100755 --- a/help/html/io/modellerpir.html +++ b/help/html/io/modellerpir.html @@ -16,3 +16,47 @@ * * You should have received a copy of the GNU General Public License along with Jalview. If not, see . --> + +Modeller PIR Format IO + + +

Modeller PIR Format IO

+

The homology modelling program, Modeller uses a special form +of the PIR format where information about sequence numbering and chain +codes are written into the 'description' line between the PIR protein +tag and the protein alignment entry:

+
>P1;Q93Z60_ARATH
+sequence:Q93Z60_ARATH:1:.:118:.:.
+----MASTALSSAIVSTSFLRRQQTPISLRSLPFANT-QSLFGLKS-STARGGRVTAMATYKVKFITPEGEQ
+EVECEEDVYVLDAAEEAGLDLPYSCRAGSCSSCAGKVVSGSIDQSD------QSFLD-D-------------
+---------------------*
+>P1;PDB|1FER|_
+structureX:1FER:1:.:105:.:.
+----------------------------------------------------AFVVTDNCIKCKY---TDCV
+EV-CPVDCFY----EGPNFLVIHPDECIDCALCEPECPAQAIFSEDEVPEDMQEFIQLNAELAEVWPNITEK
+KDPLPDAEDWDGVKGKLQHLE*
+
+

Jalview will attempt to parse any PIR entries conforming to the +Modeller/PIR format, in order to extract the sequence start and end +numbering and (possibly) a PDB file reference. The description line +information is always stored in the sequence description string - so +no information is lost if this parsing process fails.

+

The 'Modeller Output' flag in the 'Output' tab of the Jalview Preferences dialog +box controls whether Jalview will also output MODELLER style PIR +files. In this case, any existing 'non-modeller PIR' header +information in the description string of an alignment is appended to +an automatically generated modeller description line for that +sequence.

+

The general format used for generating the Modeller/PIR sequence +description line is shown below : +

>P1;Primary_Sequence_ID
+sequence or structureX:pdb-reference if
+  available:start residue:start chain code:end
+  residue:end chain code:. description text
+The first field is either sequence or structureX, depending upon the +presence of a PDB database ID for the sequence. If the protein has no +PDB reference, then the chain code is not specified, unless one +already existed when the sequence was imported into Jalview. + +