X-Git-Url: http://source.jalview.org/gitweb/?a=blobdiff_plain;ds=sidebyside;f=wiki%2FRIO.wiki;h=e14e5d1c6bb5a5e7eab08f2c85179222d3a8008c;hb=a6bf619d06e064f2267586263723dbac73a42583;hp=144d2f136e6f4f2cb011fda20d46c21ec5ae5dd9;hpb=12dcbed140836c15a8294e2b575f72d86909bbd5;p=jalview.git diff --git a/wiki/RIO.wiki b/wiki/RIO.wiki index 144d2f1..e14e5d1 100644 --- a/wiki/RIO.wiki +++ b/wiki/RIO.wiki @@ -6,54 +6,39 @@ RIO (Resampled Inference of Orthologs) is a method for automated phylogenomics based on explicit phylogenetic inference. RIO analyses are performed over resampled phylogenetic trees to estimate the reliability of orthology assignments. + == Usage == {{{ -java -Xmx2048m -cp forester.jar org.forester.application.rio [options] [outfile] +java -Xmx2048m -cp forester.jar org.forester.application.rio [options] [logfile] }}} + === Options === + * `-f=` : first gene tree to analyze (0-based index) (default: analyze all gene trees) + * `-l=` : last gene tree to analyze (0-based index) (default: analyze all gene trees) + * `-r=` : re-rooting method for gene trees, possible values or 'none', 'midpoint', or 'outgroup' (default: by minizming duplications) + * `-o=` : for rooting by outgroup, name of outgroup (external gene tree node) + * `-b` : to use SDIR instead of GSDIR (faster, but non-binary species trees are disallowed) - * -co: cutoff for ortholog output (default: 50) - * -t : file-name for output table of all vs. all ortholgy support - - * -q : name for query (sequence/node), if this is used, `[`outfile`]` is required as well - - * -s : sort (default: 2) - - * -u : to output ultra-paralogs (species specific expansions/paralogs) - - * -cu: cutoff for ultra-paralog output (default: 50) - -==== Note ==== - -Either output of all vs. all ortholgy support with -t=`<`output table`>` and/or output for one query sequence with -q and a `[`outfile`]` are required. - -==== Sort ==== - - * 0: orthologies - * 1: orthologies > super orthologies - * 2: super orthologies > orthologies - ==== Gene trees ==== The gene trees ideally are in [http://www.biomedcentral.com/1471-2105/10/356/ phyloXML] format, with taxonomy and sequence data in appropriate fields; but can also be in New Hamphshire (Newick) or Nexus format, as long as species information can be extracted from the gene names (e.g. "HUMAN" from "BCL2_HUMAN") ([http://forester.googlecode.com/files/gene_trees_rio.nh example]). - -==== Species tree ==== -Must be in [http://www.biomedcentral.com/1471-2105/10/356/ phyloXML] format ([http://forester.googlecode.com/files/species_tree_rio.xml example]). +All gene trees must be *completely binary*. -=== Examples === -`rio gene_trees_rio.nh species_tree_rio.xml outfile -q=D_HUMAN -t=outtable -u -cu=10 -co=10` +==== Species tree ==== +The species tree ideally is in [http://www.biomedcentral.com/1471-2105/10/356/ phyloXML] format, but can also be in New Hamphshire (Newick) or Nexus format ([http://forester.googlecode.com/files/species_tree_rio.xml example]). +The species tree is allowed to have nodes with more than two descendants (polytomies), as long as the (slower) GSDIR ([GSDI GSDI] re-rooting) algorithm is used. -`rio gene_trees.nh species.xml outfile -q=BCL2_HUMAN -t=outtable -u -cu=60 -co=60` -`rio gene_trees.nh species.xml -t=outtable` +=== Example === +`rio gene_trees.nh species.xml outtable.tsv log.txt` +`rio gene_trees.nh species.xml outtable.tsv log.txt -r=outgroup -o=XVL1_ECOLI` === Example files === * [http://forester.googlecode.com/files/gene_trees_rio.nh gene trees file] * [http://forester.googlecode.com/files/species_tree_rio.xml species tree file] - * [http://forester.googlecode.com/files/rio_outfile.txt output] (for query "D_HUMAN") - + == References ==