binaries/src/fasta34/readme.pvm_3.4

   1
   2  $Name: fa_34_26_5 $ - $Id: readme.pvm_3.4,v 1.3 2001/09/17 21:18:19 wrp Exp $
   3
   4
   5 20-August-2001
   6
   7 The pvm/mpi complib programs have been substantially updated with
   8 release 3.4.  See readme.v34t0 for more information.  With version
   9 3.4, the MPI programs are mp34comp*, mu34comp*, etc.
  10
  11 A major effect of this change is to disable automatic sequence type
  12 (protein/DNA) recognition with pv34compfa/mp34compfa.  By default,
  13 protein libraries are assumed.  Thus, pv34compfa/mp34compfa require
  14 the "-n" command line option when running pv34compfa/mp34compfa on DNA
  15 sequence libraries.  This issue does not occur with the other
  16 programs, which will recognize the appropriate sequence type, because
  17 it is determined by the program (e.g. pv34compfx requires
  18 DNA:protein).
  19
  20 ================
  21 pv4comp* - July, August, 2000
  22
  23 As noted in readme.pvm_3.3 - the major problem that users have had
  24 with the PVM/MPI version of the programs is in reading database files
  25 on the nodes.  All previous versions of the program (pvcompfa,
  26 pv3compfa, etc) had the nodes read the databases in parallel. Thus,
  27 the database file had to be visible to the nodes, typically through
  28 NFS on modern clusters of workstations.
  29
  30 This strategy caused some problems. It did not work on beowulf-type
  31 systems, where most of the nodes are in an isolated local network and
  32 do not have NFS access to the outside world.  And it made it
  33 complicated to read more than one database file.  Because specialized
  34 functions were used, the nodes could not read the full set of library
  35 file formats available to the other fasta programs.
  36
  37 These problems have been addressed by significantly changing the the
  38 way the pv4comp*/mp4comp* programs read the second "reference"
  39 library.  With these versions, both databases, but specifically the
  40 reference library, are read by a manager process.  The manager process
  41 then sends the sequences to the workers.  This solves problems with
  42 NFS reads from the workers (they don't do any), and uses exactly the
  43 same functions as the other fasta programs, so the full set of
  44 database formats can be read. In addition, the FASTLIBS database
  45 abbreviations are available. This also should also solve problems with
  46 searches of very long sequences (bacterial genomes); they can now be
  47 broken up into smaller pieces with the -N ##### option, as with
  48 fasta33/tfastx33.
  49
  50 Thus, you are encouraged to use the pv4comp*/mp4comp* versions of the
  51 programs, which should run more like fasta33.
  52
  53 ================
  54 Program summary:
  55
  56 Programs to produce conventional scores and alignments:
  57
  58 pv4compfa       protein vs protein, DNA vs DNA
  59 pv4compsw       protein vs protein, DNA vs DNA
  60 pv4compfx/      DNA vs protein
  61 pv4comptfx/y    protein vs DNA
  62
  63 Programs to summarize the effectiveness of a search (require
  64 super-family-labeled databases):
  65
  66 ps4compfa       protein vs protein, DNA vs DNA
  67 ps4compsw       protein vs protein, DNA vs DNA
  68 ps4compfx/      DNA vs protein
  69 ps4comptfx/y    protein vs DNA
  70
  71 Programs to report the scores and alignments of the highest scoring
  72 unrelated sequence (require super-family-labeled databases). These
  73 programs are used to evaluate the super-family labeling.
  74
  75 pu4compfa       protein vs protein, DNA vs DNA
  76 pu4compsw       protein vs protein, DNA vs DNA
  77 pucompfx/       DNA vs protein
  78 pu4comptfx/y    protein vs DNA
  79
  80 ================
  81 Release notes:
  82
  83 --> Aug. 4, 2000
  84
  85 Compiled and tested mp4compfa/mp4compsw programs.
  86
  87 --> July 22, 2000
  88
  89 First release of restructured p2_complib.c/p2_workcomp.c, which use
  90 the manager program to read both sequence databases and send the
  91 "reference database" to the workers.
  92