binaries/src/fasta34/fasts3.1

   1 .TH FASTS/TFASTSv3 1 local
   2 .SH NAME
   3 fasts3, fasts3_t \- compare several short peptide sequences against a protein
   4 database using a modified fasta algorithm.
   5
   6 tfasts3, tfasts3_t \- compare short pepides against a
   7 translated DNA database.
   8
   9 .SH DESCRIPTION
  10
  11 .B fasts3
  12 and
  13 .B tfasts3
  14 are designed to compare set of (presumably non-contiguous) peptides to
  15 a protein (fasts3) or translated DNA (tfasts3) database.
  16 fasts3/tfasts3 are designed particularly for short peptide data from
  17 mass-spec analysis of protein digests.  Unlike the traditional
  18 .B fasta3
  19 search, which uses a protein or DNA sequence,
  20 .B fasts3
  21 and
  22 .B tfasts3
  23 work with a query sequence of the form:
  24 .in +5
  25 .nf
  26 >tests from mgstm1
  27 MLLE,
  28 MILGYW,
  29 MGADP,
  30 MLCYNP
  31 .fi
  32 .in 0
  33 This sequence indicates that four peptides are to be used.  When this
  34 sequence is compared against mgstm1.aa (included with the
  35 distribution), the result is:
  36 .nf
  37 .ft C
  38 .in +5
  39 testf    MILGYW----------MLLE------------MGDAP-----------
  40          ::::::          ::::            :::::
  41 GT8.7  MPMILGYWNVRGLTHPIRMLLEYTDSSYDEKRYTMGDAPDFDRSQWLNEK
  42                10        20        30        40        50
  43
  44 testf  --------------------------------------------------
  45
  46 GT8.7  FKLGLDFPNLPYLIDGSHKITQSNAILRYLARKHHLDGETEEERIRADIV
  47                60        70        80        90       100
  48
  49                       20
  50 testf  ------------MLCYNP
  51                    ::::::
  52 GT8.7  ENQVMDTRMQLIMLCYNPDFEKQKPEFLKTIPEKMKLYSEFLGKRPWFAG
  53               110       120       130       140       150
  54 .in 0
  55 .ft P
  56 .fi
  57 .SH Options
  58 .LP
  59 .B fasts3
  60 and
  61 .B tfasts3
  62 can accept a query sequence from the unix "stdin" data stream.  This makes it much
  63 easier to use fasta3 and its relatives as part of a WWW page. To
  64 indicate that stdin is to be used, use "-" or "@" as the query
  65 sequence file name.
  66 .TP
  67 \-b #
  68 number of best scores to show (must be < -E cutoff)
  69 .TP
  70 \-d #
  71 number of best alignments to show ( must be < -E cutoff)
  72 .TP
  73 \-D
  74 turn on debugging mode.  Enables checks on sequence alphabet that
  75 cause problems with tfastx3, tfasty3, tfasta3.
  76 .TP
  77 \-E #
  78 Expectation value limit for displaying scores and
  79 alignments.  Expectation values for
  80 .B fasts3
  81 and
  82 .B tfasts3
  83 are not as accurate as those for the other
  84 .B fasta3
  85 programs.
  86 .TP
  87 \-H
  88 turn off histogram display
  89 .TP
  90 \-i
  91 compare against only the reverse complement of the library sequence.
  92 .TP
  93 \-L
  94 report long sequence description in alignments
  95 .TP
  96 \-m 0,1,2,3,4,5,6,9,10
  97 alignment display options
  98 .TP
  99 \-N #
 100 break long library sequences into blocks of # residues.  Useful for
 101 bacterial genomes, which have only one sequence entry.  -N 2000 works
 102 well for well for bacterial genomes.
 103 .TP
 104 \-O file
 105 send output to file
 106 .TP
 107 \-q/-Q
 108 quiet option; do not prompt for input
 109 .TP
 110 \-R file
 111 save all scores to statistics file
 112 .TP
 113 \-S #
 114 offset substitution matrix values by  a constant #
 115 .TP
 116 \-s name
 117 specify substitution matrix.  BLOSUM50 is used by default;
 118 PAM250, PAM120, and BLOSUM62 can be specified by setting -s P120,
 119 P250, or BL62.  With this version, many more scoring matrices are
 120 available, including BLOSUM80 (BL80), and MDM_10, MDM_20, MDM_40 (M10,
 121 M20, M40). Alternatively, BLASTP1.4 format scoring matrix files can be
 122 specified.
 123 .TP
 124 \-T #
 125 (threaded, parallel only) number of threads or workers to use (set by
 126 default to 4 at compile time).
 127 .TP
 128 \-t #
 129 Translation table - tfasts3 can use the BLAST tranlation tables.  See
 130 \fChttp://www.ncbi.nlm.nih.gov/htbin-post/Taxonomy/wprintgc?mode=c/\fP.
 131 .TP
 132 \-w #
 133 line width for similarity score, sequence alignment, output.
 134 .TP
 135 \-x "#,#"
 136 offsets query, library sequence for numbering alignments
 137 .TP
 138 \-z #
 139 Specify statistical calculation. Default is -z 1, which uses
 140 regression against the length of the library sequence. -z 0 disables
 141 statistics.  -z 2 uses the ln() length correction. -z 3 uses Altschul
 142 and Gish's statistical estimates for specific protein BLOSUM scoring
 143 matrices and gap penalties. -z 4: an alternate regression method.
 144 .TP
 145 \-Z db_size
 146 Set the apparent database size used for expectation value calculations.
 147 .TP
 148 \-3
 149 (TFASTS3 only) use only forward frame translations
 150 .SH Environment variables:
 151 .TP
 152 FASTLIBS
 153 location of library choice file (-l FASTLIBS)
 154 .TP
 155 SMATRIX
 156 default scoring matrix (-s SMATRIX)
 157 .TP
 158 SRCH_URL
 159 the format string used to define the option to re-search the
 160 database.
 161 .TP
 162 REF_URL
 163 the format string used to define the option to lookup the library
 164 sequence in entrez, or some other database.
 165
 166 .SH AUTHOR
 167 Bill Pearson
 168 .br
 169 wrp@virginia.EDU