Programs-5.1.2/ 0000755 0001750 0000156 00000000000 12306362647 012164 5 ustar bneron sis Programs-5.1.2/INSTALL 0000644 0001750 0000156 00000002063 11672707410 013212 0 ustar bneron sis INSTALLATION INSTRUCTIONS FOR PROGRAMS DEFINITIONS IN MOBYLE
************************************************************
1 - copy the xml programs definitions in MOBYLEHOME/Services/Programs
2 - copy the xml in Entities directory in MOBYLEHOME/Services/Programs/Entities
3 - copy the xml in Env directory in MOBYLEHOME/Local/Services/Programs/Env
4 - modify the content of each xml you have copied in MOBYLEHOME/Local/Services/Programs/Env
to fit to your installation
5 - deploy your services ( see configuration guide: 1.13 Services management )
if you experienced some trouble with some programs definitions you can contact us
by mail at the following adress: mobyle-support@pasteur.fr.
6 - Mailing list:
=================
There is a mailing list dedicated to Mobyle server administrators, called
"mobyle-users". This list discusses new releases, related software announcements,
administration and development issues, etc. This is a moderated and low traffic list.
You can subscribe to Mobyle users at:
http://sympa.pasteur.fr/wws/subrequest/mobyle-users
Programs-5.1.2/biosed.xml 0000644 0001750 0000156 00000017664 12072525233 014161 0 ustar bneron sis
biosedEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netbiosedReplace or delete sequence sectionshttp://bioweb2.pasteur.fr/docs/EMBOSS/biosed.htmlhttp://emboss.sourceforge.net/docs/themessequence:editbiosede_inputInput sectione_sequencesequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_requiredRequired sectione_targetregionSequence section to matchStringN("", " -targetregion=" + str(value))[value is not None and value!=vdef]2e_deleteDelete the target sequence sectionsBoolean0("", " -delete")[ bool(value) ]3e_replaceReplacement sequence sectionStringnot e_deleteA("", " -replace=" + str(value))[value is not None and value!=vdef]4e_additionalAdditional sectione_positionSequence position to match (value greater than or equal to 0)Integer0("", " -position=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 05e_outputOutput sectione_outseqName of the output sequence file (e_outseq)Filenamebiosed.e_outseq("" , " -outseq=" + str(value))[value is not None]6e_osformat_outseqChoose the sequence output formatChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]7e_outseq_outoutseq_out optionSequencee_outseqautoTurn off any promptingString" -auto -stdout"8
Programs-5.1.2/drawtree.xml 0000644 0001750 0000156 00000072224 11724156742 014532 0 ustar bneron sis
drawtreedrawtreePlots an unrooted tree diagramhttp://bioweb2.pasteur.fr/docs/phylip/doc/drawtree.htmlDRAWTREE interactively plots an unrooted tree diagram, with many options
including orientation of tree and branches, label sizes and angles, margin
sizes. Particularly if you can use your computer screen to
preview the plot, you can very effectively adjust the details of the plotting
to get just the kind of plot you want.phylogeny:displaydisplay:treedrawtreeString"drawtree <drawtree.params""drawtree <drawtree.params"0treefileTree File (intree)TreeNEWICK"ln -s $treefile intree && ""ln -s "+str( treefile ) + " intree && "-10Tree in Newick format.
(A,(B,(H,(D,(J,(((G,E),(F,I)),C))))));
screen_typeString"0\\n""0\n"-1drawtree.paramsoptionsDrawtree optionsplotterWhich plotter or printer will the tree be drawn onChoiceLLMJWKHDBECOTPXFAZVR(defined $value and $value ne $vdef) ? "P\\n$value\\n" : ""("" , "P\n" + str(value) + "\n")[ value is not None and value != vdef ]2drawtree.paramsbitmap_optionsBitmap options$plotter =~ /^[XW]$/plotter in [ "X" , "W" ]xresX resolution (in pixels)Integer500"$value\\n"str( value ) + "\n"X resolution cannot exceed 2500 pixels$value <= 2500value <= 25003drawtree.paramsxyresY resolution (in pixels)Integer500"$value\\n"str( value ) + "\n"Y resolution cannot exceed 2500 pixels$value <= 2500value <= 25004drawtree.paramslaserjet_optionsLaserjet options$plotter eq "J"plotter == "J"laserjet_resolutionLaserjet resolutionChoice3123"$value\\n"str(value) + "\n"3drawtree.paramspcx_optionsPaintbrush options$plotter eq "P"plotter == "P"pcx_resolutionPaintbrush PCX resolutionChoice3123"$value\\n"str(value) + "\n"3drawtree.paramspov_optionsPOVRAY options$plotter eq "V"plotter == "V"pov_validateString"Y\\n""Y\n"2000drawtree.paramsps_optionsPostScript options$plotter eq "L"plotter == "L"fontFont (F)ChoiceTimes-RomanCourierHelveticaHelvetica-BoldHelvetica-BoldObliqueHelvetica-ObliqueHersheyTimesTimes-BoldTimes-BoldItalicTimes-ItalicTimes-Roman(defined $value and $value ne $vdef) ? "F\\n$value\\n" : ""("", "F\n"+str(value)+"\n")[value is not None and value != vdef]5drawtree.paramsvrml_optionsVRML options$plotter eq "Z"plotter == "Z"vrml_validateString"Y\\n""Y\n"2000drawtree.paramsray_optionsRayshade options$plotter eq "R"plotter == "R"ray_validateString"Y\\n""Y\n"2000drawtree.paramspreviewString"V\\nN\\n""V\nN\n"1drawtree.paramsbranch_lengthsUse branch lengths (B)Boolean1($value) ? "" : "B\\n"("B\n" , "")[ value ]5drawtree.paramsangleAngle of labels (L)ChoiceMA"L\\nA\\n""L\nA\n"R"L\\nR\\n""L\nR\n"F""""M""""5drawtree.paramsfixed_angleFixed angle: Are the labels to be plotted vertically (90), horizontally (0), or downwards (-90) (L)?Float$angle eq "" or $angle eq "F"angle == "" or angle == "F"0.0(defined $value and $value != $vdef) ? "L\\nF\\n$value\\n" : ""( "" , "L\nF\n"+ str( value ) +"\n")[ value is not None and value != vdef ]The value must be comprised between -90.0 and 90.0$value >= -90.0 and $value <= 90.0value >= -90.0 and value <= 90.07drawtree.paramsrotationRotation of tree (in degrees from 360 to -360) (R)Float0.0(defined $value and $value != $vdef) ? "R\\n$value\\n" : ""("" , "R\n" +str( value ) +"\n" )[ value is not None and value != vdef]-3603605drawtree.paramsarcAngle of arc for tree (in degrees from 0 to 360) (A)Float0.0(defined $value and $value != $vdef) ? "A\\n$value\\n" : ""( "" , "A\n" + str( value ) +"\n" )[ value is not None and value != vdef]03605drawtree.paramsiterateIterate to improve tree (I)ChoiceEE""""B"I\\n""I\n"N"I\\nI\\n""I\nI\n"5drawtree.paramsscaleScale of branch length (S)Float(defined $value) ? "S\\n$value\\n" : ""( "" , "S\n"+ str( value )+ "\n")[ value is not None ]5Default value: Automatically rescaleddrawtree.paramslabel_overlapTry to avoid label overlap (D)Boolean$iterate ne "N"iterate != "N"0($value) ? "D\\n" : ""( "" , "D\n" )[ value ]5drawtree.paramshorizontal_marginsHorizontal margins (M)Float1.73(defined $value and $value != $vdef) ? "M\\n$value\\n$vertical_margins\\n" : ""("" , "M\n" + str( value ) + "\n" + str(vertical_margins ) + "\n")[ value is not None and value != vdef ]10drawtree.paramsvertical_marginsVertical margins (M)Float2.24""""9drawtree.paramscharacter_heightRelative character height (C)Float0.3333(defined $value and $value != $vdef) ? "C\\n$value\\n" : ""("", "C\n" + str( value ) + "\n" )[ value is not None and value != vdef ]5drawtree.paramsplotfileGraphic tree filePictureBinary$plotter !~ /^[LMWX]$/plotter not in [ "L" , "M" , "W", "X" ]"plotfile""plotfile"psfileGraphic tree file ( postscript format )PostScriptBinary$plotter eq "L"plotter == "L"" && ln -s plotfile plotfile.ps"" && ln -s plotfile plotfile.ps"10"plotfile.ps""plotfile.ps"pictfileGraphic tree file ( pict format )PictureBinary$plotter eq "M"plotter == "M"" && ln -s plotfile plotfile.pict"" && ln -s plotfile plotfile.pict"10"plotfile.pict""plotfile.pict"xbmfileGraphic tree file ( xbm format )PictureBinary$plotter eq "X"plotter == "X"" && ln -s plotfile plotfile.xbm"" && ln -s plotfile plotfile.xbm"10"plotfile.xbm""plotfile.xbm"bmpfileGraphic tree file ( bmp format )PictureBinary$plotter eq "W"plotter == "W"" && ln -s plotfile plotfile.bmp"" && ln -s plotfile plotfile.bmp"10"plotfile.bmp""plotfile.bmp"confirmString"Y\\n""Y\n"1000drawtree.params
Programs-5.1.2/findkm.xml 0000644 0001750 0000156 00000020214 11672346320 014150 0 ustar bneron sis
findkmEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netfindkmCalculate and plot enzyme reaction datahttp://bioweb2.pasteur.fr/docs/EMBOSS/findkm.htmlhttp://emboss.sourceforge.net/docs/themessequence:enzyme:kineticsfindkme_inputInput sectione_infileEnzyme kinetics data (application-specific) fileEnzymeDataAbstractText("", " -infile=" + str(value))[value is not None]1e_advancedAdvanced sectione_plotS/v vs sBoolean1(" -noplot", "")[ bool(value) ]2e_outputOutput sectione_outfileName of the output file (e_outfile)Filenamefindkm.e_outfile("" , " -outfile=" + str(value))[value is not None]3e_outfile_outoutfile_out optionFindkmReportReporte_outfilee_graphlbChoose the e_graphlb output formatChoicepngpnggifcpspsmetadata(" -graphlb=" + str(vdef), " -graphlb=" + str(value))[value is not None and value!=vdef]4xy_goutfileName of the output graphFilenamefindkm_xygraph("" , " -goutfile=" + str(value))[value is not None]5xy_outgraph_pngGraph filePictureBinarye_graphlb == "png""*.png"xy_outgraph_gifGraph filePictureBinarye_graphlb == "gif""*.gif"xy_outgraph_psGraph filePostScriptBinarye_graphlb == "ps" or e_graphlb == "cps""*.ps"xy_outgraph_metaGraph filePictureBinarye_graphlb == "meta""*.meta"xy_outgraph_dataGraph fileTexte_graphlb == "data""*.dat"autoTurn off any promptingString" -auto -stdout"6
Programs-5.1.2/rnapdist.xml 0000644 0001750 0000156 00000031164 12265777220 014540 0 ustar bneron sis
rnapdistRNApdistCalculate distances between thermodynamic RNA secondary structures ensemblesStadler, Hofacker, BonhoefferBonhoeffer S, McCaskill J S, Stadler P F, Schuster P, (1993) RNA multistructure landscapes, Euro Biophys J:22,13-24RNApdist reads RNA sequences from stdin and calculates structure distances between the thermodynamic ensembles of their secondary structures. To do this the partition function and matrix of base pairing probabilities is computed for each sequence. The probability matrix is then condensed into a vector holding for each base the proba-bilities of being unpaired, paired upstream, or paired downstream, respectively. These profiles are compared by a standard alignment algorithm.sequence:nucleic:2D_structurestructure:2D_structureRNApdistseqRNA Sequences FileDNASequenceFASTA" < $value" " < " + str(value) 1000comparison_optionsComparison options2compareWhich comparisons (-X)Choiceppmfc(defined $value and $value ne $vdef)? " -X$value" : ""("", " -X" + str(value) )[value is not None and value != vdef]alignment_fileAlignment file (-B)Filename(defined $value)? " -B $value" : ""("" , " -B " + str(value))[ value is not None ]Print an 'alignment' with gaps of the structures, to show matching substructures. ( ) essentially upstream (downstream) paired bases{ } weakly upstream (downstream) paired bases| strongly paired bases without preference, weakly paired bases without preference. essentially unpaired bases.others_optionsOther options2temperatureRescale energy parameters to a temperature of temperature Celcius (-T)Integer37(defined $value and $value != $vdef)? " -T $value" : ""( "" , " -T " + str(value) )[ value is not None and value != vdef]tetraloopsDo not include special stabilizing energies for certain tetraloops (-4)Boolean0($value)? " -4" : ""( "" , " -4" )[ value ]danglingHow to treat dangling end energies for bases adjacent to helices in free ends and multiloops (-d)Choice-d1-d1-d-d2(defined $value and $value ne $vdef)? " $value" : ""( "" , " " + str(value) )[ value is not None and value != vdef]How to treat 'dangling end' energies for bases adjacent to helices in free ends and multiloops: Normally only unpaired bases can participate in at most one dangling end. With -d2 this check is ignored, this is the default for partition function folding (-p). -d ignores dangling ends altogether. Note that by default pf and mfe folding treat dangling ends differently, use -d2 (or -d) in addition to -p to ensure that both algorithms use the same energy model. The -d2 options is available for RNAfold, RNAeval, and RNAinverse only.noGUDo not allow GU pairs (-noGU)Boolean0($value)? " -noGU" : ""( "" , " -noGU" )[ value ]noCloseGUDo not allow GU pairs at the end of helices (-noCloseGU)Boolean0($value)? " -noCloseGU" : ""( "" , " -noCloseGU" )[ value ]nspNon standard pairs (comma seperated list) (-nsp)String(defined $value)? " -nsp $value" : "" ( "" , " -nsp " + str(value) )[ value is not None ]Allow other pairs in addition to the usual AU,GC,and GU pairs. pairs is a comma seperated list of additionally allowed pairs. If a the first character is a '-' then AB will imply that AB and BA are allowed pairs. e.g. RNAfold -nsp -GA will allow GA and AG pairs. Nonstandard pairs are given 0 stacking energy.parameterParameter file (-P)EnergyParameterFileAbstractText(defined $value)? " -P $value" : ""( "" , " -P " + str(value) )[ value is not None ]Read energy parameters from paramfile, instead of using the default parameter set. A sample parameterfile should accompany your distribution. See the RNAlib documentation for details on the file format.readseqString"readseq -f=19 -a $seq > $seq.tmp && (cp $seq $seq.orig && mv $seq.tmp $seq) ; ""readseq -f=19 -a "+ str(seq) + " > "+ str(seq) +".tmp && (cp "+ str(seq) +" "+ str(seq) +".orig && mv "+ str(seq) +".tmp "+ str(seq) +") ; "-10psfilesPostscript output filePostScriptBinary"*.ps""*.ps"alnoutfileResult alignment fileAlignmentdefined $alignment_filealignment_file is not None"$alignment_file"str(alignment_file)
Programs-5.1.2/restover.xml 0000644 0001750 0000156 00000031157 12072525233 014556 0 ustar bneron sis
restoverEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netrestoverFind restriction enzymes producing a specific overhanghttp://bioweb2.pasteur.fr/docs/EMBOSS/restover.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:restrictionrestovere_inputInput sectione_sequencesequence optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_datafileRestriction enzyme data file (optional)RestrictionEnzymeDataAbstractText("", " -datafile=" + str(value))[value is not None]2e_mfileRestriction enzyme methylation data fileRestrictionEnzymeMethylationDataAbstractText("", " -mfile=" + str(value))[value is not None ]3e_requiredRequired sectione_seqcompOverlap sequenceString("", " -seqcomp=" + str(value))[value is not None]4e_advancedAdvanced sectione_minMinimum cuts per re (value from 1 to 1000)Integer1("", " -min=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1 is requiredvalue >= 1Value less than or equal to 1000 is requiredvalue <= 10005e_maxMaximum cuts per reInteger2000000000("", " -max=" + str(value))[value is not None and value!=vdef]6e_singleForce single site only cutsBoolean0("", " -single")[ bool(value) ]7e_threeprimeUse 3' overhang e.g. bamhi has ctag as a 5' overhang, and apai has ccgg as 3' overhang.Boolean0("", " -threeprime")[ bool(value) ]8e_bluntAllow blunt end cuttersBoolean1(" -noblunt", "")[ bool(value) ]9e_stickyAllow sticky end cuttersBoolean1(" -nosticky", "")[ bool(value) ]10e_ambiguityAllow ambiguous matchesBoolean1(" -noambiguity", "")[ bool(value) ]11e_plasmidAllow circular dnaBoolean0("", " -plasmid")[ bool(value) ]12e_methylationUse methylation dataBoolean0("", " -methylation")[ bool(value) ]13If this is set then RE recognition sites will not match methylated bases.e_commercialOnly enzymes with suppliersBoolean1(" -nocommercial", "")[ bool(value) ]14e_outputOutput sectione_htmlCreate html outputBoolean0("", " -html")[ bool(value) ]15e_limitLimits reports to one isoschizomerBoolean1(" -nolimit", "")[ bool(value) ]16e_alphabeticSort output alphabeticallyBoolean0("", " -alphabetic")[ bool(value) ]17e_fragmentsShow fragment lengthsBoolean0("", " -fragments")[ bool(value) ]18e_outfileName of the output file (e_outfile)Filenamerestover.e_outfile("" , " -outfile=" + str(value))[value is not None]19e_outfile_outoutfile_out optionRestoverReportReporte_outfileautoTurn off any promptingString" -auto -stdout"20
Programs-5.1.2/extractseq.xml 0000644 0001750 0000156 00000016423 12072525233 015067 0 ustar bneron sis
extractseqEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netextractseqExtract regions from a sequencehttp://bioweb2.pasteur.fr/docs/EMBOSS/extractseq.htmlhttp://emboss.sourceforge.net/docs/themessequence:editextractseqe_inputInput sectione_sequencesequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -sequence=" + str(value))[value is not None]1e_requiredRequired sectione_regionsRegions to extract (eg: 4-57,78-94)String("", " -regions=" + str(value))[value is not None]2Regions to extract.
A set of regions is specified by a set of pairs of positions.
The positions are integers.
They are separated by any non-digit, non-alpha character.
Examples of region specifications are:
24-45, 56-78
1:45, 67=99;765..888
1,5,8,10,23,45,57,99e_additionalAdditional sectione_separateWrite regions to separate sequencesBoolean0("", " -separate")[ bool(value) ]3If this is set true then each specified region is written out as a separate sequence. The name of the sequence is created from the name of the original sequence with the start and end positions of the range appended with underscore characters between them, eg: XYZ region 2 to 34 is written as: XYZ_2_34e_outputOutput sectione_outseqName of the output sequence file (e_outseq)Filenameextractseq.e_outseq("" , " -outseq=" + str(value))[value is not None]4e_osformat_outseqChoose the sequence output formatChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]5e_outseq_outoutseq_out optionSequencee_outseqautoTurn off any promptingString" -auto -stdout"6
Programs-5.1.2/README 0000644 0001750 0000156 00000000463 11022262432 013030 0 ustar bneron sis
This package contains XML definitions, for bioinformatics software and
databanks, to be integrated into the Mobyle framework.
It is developped at the Institut Pasteur, in the Software and
Databanks group; and distributed under the LGPLv2 license.
For further inquiries, please contact mobyle@pasteur.fr.
Programs-5.1.2/etandem.xml 0000644 0001750 0000156 00000025423 12072525233 014321 0 ustar bneron sis
etandemEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netetandemFinds tandem repeats in a nucleotide sequencehttp://bioweb2.pasteur.fr/docs/EMBOSS/etandem.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:repeatsetandeme_inputInput sectione_sequencesequence optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -sequence=" + str(value))[value is not None]1e_requiredRequired sectione_minrepeatMinimum repeat size (Integer, 2 or higher)Integer10("", " -minrepeat=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 2 is requiredvalue >= 22e_maxrepeatMaximum repeat size (Integer, same as -minrepeat or higher)Integere_minrepeat("", " -maxrepeat=" + str(value))[value is not None]3Same as -minrepeate_advancedAdvanced sectione_thresholdThreshold scoreInteger20("", " -threshold=" + str(value))[value is not None and value!=vdef]4e_mismatchAllow n as a mismatchBoolean0("", " -mismatch")[ bool(value) ]5e_uniformAllow uniform consensusBoolean0("", " -uniform")[ bool(value) ]6e_outputOutput sectione_outfileName of the report fileFilenamereport.tan("" , " -outfile=" + str(value))[value is not None]7e_rformat_outfileChoose the report output formatChoiceTABLEDASGFFDBMOTIFDIFFSEQEMBLEXCELFEATTABLEGENBANKGFFLISTFILEMOTIFNAMETABLECODATAREGIONSSEQTABLESIMPLESRSSWISSTABLETAGSEQ("", " -rformat=" + str(value))[value is not None and value!=vdef]8e_outfile_outoutfile_out optionTexte_rformat_outfile in ['DASGFF', 'DBMOTIF', 'DIFFSEQ', 'EMBL', 'EXCEL', 'FEATTABLE', 'GENBANK', 'GFF', 'LISTFILE', 'MOTIF', 'NAMETABLE', 'CODATA', 'REGIONS', 'SEQTABLE', 'SIMPLE', 'SRS', 'SWISS', 'TABLE', 'TAGSEQ']e_outfilee_origfileName of the output file (e_origfile)Filenameoutfile.oldtan("" , " -origfile=" + str(value))[value is not None]9e_origfile_outorigfile_out optionTandemReportReporte_origfileautoTurn off any promptingString" -auto -stdout"10
Programs-5.1.2/pima.xml 0000644 0001750 0000156 00000030760 12073003734 013630 0 ustar bneron sis
pima1.40PIMAPattern-Induced Multi-sequence Alignment programR. D. Smith and T. F. SmithR. D. Smith and T. F. Smith. Pattern-induced multi-sequence alignment (PIMA) algorithm employing secondary structure-dependent gap penalties for use in comparative modelling. protein Engineering, vol5, number 1, pp 35-41, 1992ftp://ftp.ebi.ac.uk/pub/software/unix/pima/alignment:multiplepimasequenceSequences fileSequenceIGGENBANKNBRFEMBLCODATAFASTA" $value" " " + str(value) 3 Name of the input file containing the sequences to be clustered and multi-aligned. Sequences can be in any of the following formats: IG/Stanford, GenBank/GB, NBRF, EMBL, Pearson/Fasta, PIR/CODATA. The format of the output sequence files will match the format of this input file. cluster_nameCluster nameString" $value" " " + str(value)2An arbitrary name used to label the cluster.pima_paramsParametersref_seq_nameReference sequence nameStringdefined $sec_struc_seq_filenamesec_struc_seq_filename is not None(defined $value)? " $value" : "" ("", " "+str(value))[value is not None]4[optional; if specified, then sec_struct_seq_filename must also be specified]. Locus name of one of the primary sequences for which the secondary structure is in the file seq_struct_seq_filename.sec_struc_seq_filenamesec_struc_seq_filenameTextdefined $ref_seq_name_ref_seq_name is not None(defined $value)? " $value" : "" ( "" , " " + str(value) )[ value is not None ]5[optional; if specified, then ref_seq_name must also be specified] Name of a file containing secondary structure sequences for one or more of the primary sequences in the set. The secondary structure sequences in this file must be in one of the formats listed above (see sequence_filename, above). The locus name of each sequence must be the locus name of it's corresponding primary sequence with the suffix '.ss' (e.g. 1ldm.ss). An alpha-helix, 3-10 helix and beta-strand must be designated 'h', 'g', and 'e', respectively. All other characters in the secondary structure sequences will be ignored with respect to the the structure-dependent gap penalty. To allow gaps to be placed between the first and the second and the last elements of these structures, the first and last 2 elements of each should be changed to another character designation. In the secondary structure sequence file pdb-dssp.ss provided with this package, these end cap elements are designated 'i', 'f', and 'd', for alpha-helices, 3-10 helices and beta-strands, respectfully.
pima_optionsOptions1score_cutoffCluster score cutoff (-c) Float0.0(defined $value and $value != $vdef)? " -c $value " : "" ( "" , " -c " + str(value) )[ value is not None and value != vdef] Use a cluster score cutoff of number. This is the lowest match score to be used to incorporate a sequence into a cluster. The default value of 0.0 will force all input sequences into 1 cluster, but the final pattern may be completely degenerate.ext_gap_costGap extension penalty (-d) Integer(defined $value)? " -d $value" : "" ( "" , " -d " + str(value) )[ value is not None ]Use a length dependent gap penalty of number. This is the cost of extending a gap. The default value is dependent on the matrix file used.gap_open_costGap opening penalty (-i) Integer(defined $value)? " -i $value" : "" ( "" , " -i " + str(value) )[ value is not None ]Use a length independent gap penalty of number. This is the cost of opening a gap. The default value is dependent on the matrix file used.min_scoreMinimum local score (-l) Integer(defined $value)? " -l $value" : "" ( "" , " -l " + str(value) )[ value is not None ] Use minimum local score of number. This is the lowest score a quadrant can have before an attempt is made to join this local alignment with the local alignment at the previous step. The default value is dependent on the matrix file used.mat_fileMatrix file (-m) Choicepatgen.matpatgen.matclass1.matclass2.matuser(defined $value and $value ne $vdef and $value ne "user")? " -m $value" : "" ( "" , " -m " + str(value) )[ value is not None and value !=vdef and value !="user"]Use matrix file with the name file. The default matrix file is patgen.mat and is provided with this package. The matrix file class1.mat uses the original pima alphabet. The matrix file class2.mat is also provided, which is similar to the matrix file class1.mat but uses the new alphabet.user_mat_fileUser matrix file (-m) Text$mat_file eq "user"mat_file == "user"(defined $value)? " -m $value" : "" ( "" , " -m " + str(value) )[ value is not None ]User matrix file.not_num_extDo not use numerical extensions on each step of the alignment. (-n)Boolean0($value)? " -n" : "" ( "" , " -n" )[ value ]sec_struc_gap_costSecondary structure gap penalty (-t)Integer(defined $value)? " -t $value " : "" ( "" , " -t " + str(value) + " " )[ value is not None ]Use a secondary structure gap penalty of number. This is the cost of a gap at a position matching a secondary structure character. The default value is dependent on the matrix file used and is always 10 times the value of the length independent gap penalty of the matrix file.resultsOutput filesText"*.cluster"*.pattern"*.pima""*.cluster""*.pattern""*.pima"
Programs-5.1.2/infoalign.xml 0000644 0001750 0000156 00000062330 11672346320 014653 0 ustar bneron sis
infoalignEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netinfoalignDisplay basic information about a multiple sequence alignmenthttp://bioweb2.pasteur.fr/docs/EMBOSS/infoalign.htmlhttp://emboss.sourceforge.net/docs/themesalignment:multiple:informationinfoaligne_inputInput sectione_sequencesequence optionAlignmentFASTAMSFPAIRMARKX0MARKX1MARKX2MARKX3MARKX10SRSSRSPAIRSCOREUNKNOWNMULTIPLESIMPLEMATCH1,n("", " -sequence=" + str(value))[value is not None]1The sequence alignment to be displayed.e_matrixSimilarity scoring matrix fileChoicemobyle_nullmobyle_nullEBLOSUM30EBLOSUM35EBLOSUM40EBLOSUM45EBLOSUM50EBLOSUM55EBLOSUM60EBLOSUM62EBLOSUM62-12EBLOSUM65EBLOSUM70EBLOSUM75EBLOSUM80EBLOSUM85EBLOSUM90EBLOSUMNEDNAFULLEDNAMATEDNASIMPLEEPAM10EPAM100EPAM110EPAM120EPAM130EPAM140EPAM150EPAM160EPAM170EPAM180EPAM190EPAM20EPAM200EPAM210EPAM220EPAM230EPAM240EPAM250EPAM260EPAM270EPAM280EPAM290EPAM30EPAM300EPAM310EPAM320EPAM330EPAM340EPAM350EPAM360EPAM370EPAM380EPAM390EPAM40EPAM400EPAM410EPAM420EPAM430EPAM440EPAM450EPAM460EPAM470EPAM480EPAM490EPAM50EPAM500EPAM60EPAM70EPAM80EPAM90SSSUB("", " -matrix=" + str(value))[value is not None and value!=vdef]2This is the scoring matrix file used when comparing sequences. By default it is the file 'EBLOSUM62' (for proteins) or the file 'EDNAFULL' (for nucleic sequences). These files are found in the 'data' directory of the EMBOSS installation.e_refseqThe number or the name of the reference sequenceString0("", " -refseq=" + str(value))[value is not None and value!=vdef]3If you give the number in the alignment or the name of a sequence, it will be taken to be the reference sequence. The reference sequence is the one against which all the other sequences are compared. If this is set to 0 then the consensus sequence will be used as the reference sequence. By default the consensus sequence is used as the reference sequence.e_advancedAdvanced sectione_pluralityPlurality check % for consensus (value from 0.0 to 100.0)Float50.0("", " -plurality=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0.0 is requiredvalue >= 0.0Value less than or equal to 100.0 is requiredvalue <= 100.04Set a cut-off for the % of positive scoring matches below which there is no consensus. The default plurality is taken as 50% of the total weight of all the sequences in the alignment.e_identityRequired % of identities at a position fro consensus (value from 0.0 to 100.0)Float0.0("", " -identity=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0.0 is requiredvalue >= 0.0Value less than or equal to 100.0 is requiredvalue <= 100.05Provides the facility of setting the required number of identities at a position for it to give a consensus. Therefore, if this is set to 100% only columns of identities contribute to the consensus.e_outputOutput sectione_outfileName of the output file (e_outfile)Filenameoutfile.infoalign("" , " -outfile=" + str(value))[value is not None]6If you enter the name of a file here then this program will write the sequence details into that file.e_outfile_outoutfile_out optionInfoalignReportReporte_outfilee_htmlFormat output as an html tableBoolean0("", " -html")[ bool(value) ]7e_onlyDisplay the specified columnsBoolean0("", " -only")[ bool(value) ]8This is a way of shortening the command line if you only want a few things to be displayed. Instead of specifying:
'-nohead -nousa -noname -noalign -nogaps -nogapcount -nosimcount -noidcount -nodiffcount -noweight'
to get only the sequence length output, you can specify
'-only -seqlength'e_headingDisplay column headingsBooleannot e_only0("", " -heading")[ bool(value) ]9e_usaDisplay the usa of the sequenceBooleannot e_only0("", " -usa")[ bool(value) ]10e_nameDisplay 'name' columnBooleannot e_only0("", " -name")[ bool(value) ]11e_seqlengthDisplay 'seqlength' columnBooleannot e_only0("", " -seqlength")[ bool(value) ]12e_alignlengthDisplay 'alignlength' columnBooleannot e_only0("", " -alignlength")[ bool(value) ]13e_gapsDisplay number of gapsBooleannot e_only0("", " -gaps")[ bool(value) ]14e_gapcountDisplay number of gap positionsBooleannot e_only0("", " -gapcount")[ bool(value) ]15e_idcountDisplay number of identical positionsBooleannot e_only0("", " -idcount")[ bool(value) ]16e_simcountDisplay number of similar positionsBooleannot e_only0("", " -simcount")[ bool(value) ]17e_diffcountDisplay number of different positionsBooleannot e_only0("", " -diffcount")[ bool(value) ]18e_changeDisplay % number of changed positionsBooleannot e_only0("", " -change")[ bool(value) ]19e_weightDisplay 'weight' columnBooleannot e_only0("", " -weight")[ bool(value) ]20e_descriptionDisplay 'description' columnBooleannot e_only0("", " -description")[ bool(value) ]21autoTurn off any promptingString" -auto -stdout"22
Programs-5.1.2/gblocks.xml 0000644 0001750 0000156 00000043735 12265214442 014337 0 ustar bneron sis
gblocks0.91bGblocksSelection of conserved blocks from multiple sequence alignmentJose CastresanaCastresana, J. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molecular Biology and Evolution 17, 540-552Talavera, G., and Castresana, J. (2007). Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Systematic Biology 56, 564-577http://bioweb2.pasteur.fr/docs/gblocks/http://molevol.cmima.csic.es/castresana/Gblocks.htmlhttp://molevol.cmima.csic.es/castresana/Gblocks.htmlalignment:multiple:informationgblocksinputInput2infileAlignmentProteinDNAAlignmentFASTA" $value -g"" "+str(value)+" -g"FASTA-formatted alignments are accepted by gblocks.
There is no limit for the number of sequences or positions in the alignment
as long as there is enough memory available for the program.input_typeType of sequence (-t)ChoiceautoautoProteinDNA(defined $value and $value ne $vdef) ? " -t=$value" : ""("", " -t="+str(value))[value is not None and value!=vdef] Automatic: For FASTA-formatted alignments, gblocks assigned the type Protein automatically. In other cases, precise.sel_optionsOptions for selection3MNOSFACPMinimum Number Of Sequences For A Conserved Position(-b1)Integer(defined $value) ? " -b1=$value" : ""("" , " -b1="+str(value))[value is not None]Any integer BIGGER than half the number of sequences
and SMALLER OR EQUAL than the total number of sequences. If you put an integer not in this interval, gblocks will run the program with default value, check the warning in output. By default, = 50% of the number of sequences +1.MNOSFOFPMinimum Number Of Sequences For A Flank Position(-b2)Integer(defined $value) ? " -b2=$value" : ""("" , " -b2="+str(value))[value is not None]Any integer EQUAL OR SMALLER than Minimum Number Of Sequences For A Conserved Position. If you put an integer not in this interval, gblocks will run the program with default value, check the warning in output. By default, = 85% of the number of sequences.MLOABMinimum Length Of A Block(-b4)Integer10(defined $value and $value != $vdef) ? " -b4=$value" : ""("" , " -b4="+str(value))[value is not None and value!=vdef]Any integer equal or bigger than 2 (= 10 by default).gapAllowed Gap Positions (-b5)Choicennha(defined $value and $value ne $vdef) ? " -b5=$value" : ""("", " -b5="+str(value))[value is not None and value!=vdef] None(default value): No gap positions are allowed in the final alignment. With Half: Positions with a gap in less than 50% of the sequences selected in the final alignment (if they're in an appropriate block). All: Positions with gaps are not treated differently from others positions.MNOCNPMaximum Number Of Contiguous Nonconserved Positions(-b3)Integer8(defined $value and $value != $vdef) ? " -b3=$value" : ""("" , " -b3="+str(value))[value is not None and value!=vdef]All segment with contiguous non-conserved positions bigger than this value is rejected (=8 by default).saving_optionsSaving options4sbSelected Blocks (-s)Boolean1($value) ? "" , " -s=n"(" -s=n" , "")[value]Saving or Not the alignment file with the selected blocks.Res_ParamResults And Parameters File(-p)Choiceyytsn(defined $value and $value ne $vdef) ? " -p=$value" : ""("" , " -p="+str(value))[value is not None and value!=vdef]Saving an HTML file (Yes), saving a text file (Text), saving a short text file (Short Text) or not saving any of them (No). With the first two options the original file is shown with the selected blocks underlined and, in the HTML file, with colored conserved positions.The parameters used and the flank positions of the selected blocks are written in these files.PerLineCharacters Per Line In Results And Parameters File (>50)(-v)Integer60(defined $value and $value != $vdef) ? " -v=$value" : ""("" , " -v="+str(value))[value is not None and value!=vdef]Number of characters per line in the alignment shown in the Results And Parameters File.Any integer bigger than 50 is accepted (60 by default).ncNonConserved Blocks (-n)Boolean0($value) ? "-n=y" , ""("" , " -n=y")[value]Saving or Not a the alignment file with the blocks NOT selected (i.e., the complementary of the selected blocks).uaUngapped Alignment (-u)Boolean0($value) ? "-u=y" , ""("" , " -u=y")[value]Saving or Not the alignment file where only gap positions (i.e. positions with at least one gap) have been removed.maskMask File With The Selected Blocks (-k)Boolean0($value) ? "-k=y" , ""("" , " -k=y")[value]Saving or Not a file that can be read by the program SeqPup.In this file, conserved positions as defined by Gblocks are shadowed and selected blocks underlinedpsPostscript File With The Selected Blocks (-d)Boolean0($value) ? "-d=y" , ""("" , " -d=y")[value]Saving or Not a Postscript file that shows schematically the selected blocks. You need a postscript viewer or editor to view this file.Res_Param_HTMLResults and Params (HTML format)Text$Res_Param ne 't' and $Res_Param ne 's' and $Res_Param ne 'n'Res_Param != 't' and Res_Param != 's' and Res_Param != 'n'"*-gb.htm""*-gb.htm"Res_Param_tResults and Params (Text format)Text$Res_Param eq 't'Res_Param == 't'"*-gb.txt"Res_Param_stResults and Params (ShortText format)Text$Res_Param eq 's'Res_Param == 's'"*-gb.txts""*-gb.txts"alignment_resultAlignment file with selected blocksAlignmentFASTA$sbsb"*-gb""*-gb"nc_fileNonConserved Blocks FileAlignmentFASTA$ncnc"*-gbComp""*-gbComp"ua_fileUngapped Alignment FileAlignmentFASTA$uaua"*--""*--"mask_fileMask File With The Selected BlocksText$maskmask"*-gbMask""*-gbMask"ps_filePostscript FileText$psps"*-gbPS""*-gbPS"
Programs-5.1.2/needleall.xml 0000644 0001750 0000156 00000062022 12072525233 014625 0 ustar bneron sis
needleallEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netneedleallMany-to-many pairwise alignments of two sequence setshttp://bioweb2.pasteur.fr/docs/EMBOSS/needleall.htmlhttp://emboss.sourceforge.net/docs/themesalignment:pairwise:globalneedlealle_inputInput sectione_asequenceasequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF2,n("", " -asequence=" + str(value))[value is not None]1e_bsequencebsequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -bsequence=" + str(value))[value is not None]2e_datafileMatrix fileChoicemobyle_nullmobyle_nullEBLOSUM30EBLOSUM35EBLOSUM40EBLOSUM45EBLOSUM50EBLOSUM55EBLOSUM60EBLOSUM62EBLOSUM62-12EBLOSUM65EBLOSUM70EBLOSUM75EBLOSUM80EBLOSUM85EBLOSUM90EBLOSUMNEDNAFULLEDNAMATEDNASIMPLEEPAM10EPAM100EPAM110EPAM120EPAM130EPAM140EPAM150EPAM160EPAM170EPAM180EPAM190EPAM20EPAM200EPAM210EPAM220EPAM230EPAM240EPAM250EPAM260EPAM270EPAM280EPAM290EPAM30EPAM300EPAM310EPAM320EPAM330EPAM340EPAM350EPAM360EPAM370EPAM380EPAM390EPAM40EPAM400EPAM410EPAM420EPAM430EPAM440EPAM450EPAM460EPAM470EPAM480EPAM490EPAM50EPAM500EPAM60EPAM70EPAM80EPAM90SSSUB("", " -datafile=" + str(value))[value is not None and value!=vdef]3This is the scoring matrix file used when comparing sequences. By default it is the file 'EBLOSUM62' (for proteins) or the file 'EDNAFULL' (for nucleic sequences). These files are found in the 'data' directory of the EMBOSS installation.e_requiredRequired sectione_gapopenGap opening penalty (Floating point number from 1.0 to 100.0)Float("", " -gapopen=" + str(value))[value is not None]Value greater than or equal to 0.0 is requiredvalue >= 0.0Value less than or equal to 100.0 is requiredvalue <= 100.04The gap open penalty is the score taken away when a gap is created. The best value depends on the choice of comparison matrix. The default value assumes you are using the EBLOSUM62 matrix for protein sequences, and the EDNAFULL matrix for nucleotide sequences.e_gapextendGap extension penalty (Floating point number from 0.0 to 10.0)Float("", " -gapextend=" + str(value))[value is not None]Value greater than or equal to 0.0 is requiredvalue >= 0.0Value less than or equal to 10.0 is requiredvalue <= 10.05The gap extension, penalty is added to the standard gap penalty for each base or residue in the gap. This is how long gaps are penalized. Usually you will expect a few long gaps rather than many short gaps, so the gap extension penalty should be lower than the gap penalty. An exception is where one or both sequences are single reads with possible sequencing errors in which case you would expect many single base gaps. You can get this result by setting the gap open penalty to zero (or very low) and using the gap extension penalty to control gap scoring.e_additionalAdditional sectione_endweightApply end gap penalties.Boolean0("", " -endweight")[ bool(value) ]6e_endopenEnd gap opening penalty (Floating point number from 1.0 to 100.0)Float("", " -endopen=" + str(value))[value is not None]Value greater than or equal to 0.0 is requiredvalue >= 0.0Value less than or equal to 100.0 is requiredvalue <= 100.07The end gap open penalty is the score taken away when an end gap is created. The best value depends on the choice of comparison matrix. The default value assumes you are using the EBLOSUM62 matrix for protein sequences, and the EDNAFULL matrix for nucleotide sequences.e_endextendEnd gap extension penalty (Floating point number from 0.0 to 10.0)Float("", " -endextend=" + str(value))[value is not None]Value greater than or equal to 0.0 is requiredvalue >= 0.0Value less than or equal to 10.0 is requiredvalue <= 10.08The end gap extension, penalty is added to the end gap penalty for each base or residue in the end gap.e_minscoreMinimum alignment score (Floating point number from -10.0 to 100.0)Float0("", " -minscore=" + str(value))[value is not None and value!=vdef]Value greater than or equal to -10.0 is requiredvalue >= -10.0Value less than or equal to 100.0 is requiredvalue <= 100.09Minimum alignment score to report an alignment.e_outputOutput sectione_briefBrief identity and similarityBoolean1(" -nobrief", "")[ bool(value) ]10Brief identity and similaritye_outfileName of the output alignment fileFilenameneedleall.align("" , " -outfile=" + str(value))[value is not None]11e_aformat_outfileChoose the alignment output formatChoiceSCOREFASTAMSFPAIRMARKX0MARKX1MARKX2MARKX3MARKX10SRSSRSPAIRSCOREUNKNOWNMULTIPLESIMPLEMATCH("", " -aformat=" + str(value))[value is not None and value!=vdef]11e_outfile_outoutfile_out optionAlignmente_aformat_outfile in ['FASTA', 'MSF']e_outfilee_outfile_out2outfile_out2 optionTexte_aformat_outfile in ['PAIR', 'MARKX0', 'MARKX1', 'MARKX2', 'MARKX3', 'MARKX10', 'SRS', 'SRSPAIR', 'SCORE', 'UNKNOWN', 'MULTIPLE', 'SIMPLE', 'MATCH']e_outfilee_errorfileerrorfile optionFilenameneedleall.e_errorfile("" , " -errorfile=" + str(value))[value is not None]12Error file to be written toe_errorfile_outerrorfile_out optionNeedleallErrorReporte_errorfileautoTurn off any promptingString" -auto -stdout"13
Programs-5.1.2/toppred.xml 0000644 0001750 0000156 00000031226 12265214442 014360 0 ustar bneron sis
toppred1.10TopPredTopology prediction of membrane proteinsHeijne, Wallin, Claros, Deveaud, Schuerervon Heijne, G. (1992) Membrane Protein Structure Prediction: Hydrophobicity Analysis and the 'Positive Inside' Rule. J.Mol.Biol. 225, 487-494.Claros, M.G., and von Heijne, G. (1994) TopPred II: An Improved Software For Membrane Protein Structure Predictions. CABIOS 10, 685-686.Deveaud and Schuerer (Institut Pasteur) new implementation of the original toppred program, based on G. von Heijne algorithm.http://bioweb2.pasteur.fr/docs/toppred/toppred.pdfftp://ftp.pasteur.fr/pub/gensoft/projects/toppred/sequence:protein:2D_structurestructure:2D_structuretoppredCommandString"toppred""toppred"0querySequenceProteinSequenceFASTA1,n" $value"" "+ str( value )10graph_outputProduce hydrophobicity graph image (-g)Boolean1topo_outputProduce image of each topology (-t)Boolean0($value) ? "" : " -t none"(" -t none","")[ value ]7controlControl optionsscaleHydrophobicity scale (-H)ChoiceGES-scaleKD-scaleGVH-scaleGES-scale(defined $value and $value ne $vdef) ? "-H $value" : ""( "" , " -H " + str( value ) )[ value is not None and value != vdef ]1organismOrganism: eukaryot (default is prokaryot) (-e)Boolean0($value) ? " -e" : ""( "" , " -e" )[ value ]1certainCertain cutoff (-c)Float1.0(defined $value and $value != $vdef) ? " -c $value" : ""( "" , " -c " + str(value) ) [ value is not None and value!= vdef ]Certain cutoff must be greater than putative cutoff$certain > $putativecertain > putative2putativePutative cutoff (-p)Float0.6(defined $value and $value != $vdef) ? "-p $value" : ""( "" , " -p " + str(value) )[ value is not None and value != vdef ]Putative cutoff must be not be greater than certain cutoff$certain > $putativecertain > putative2coreCore window size: (-n)Integer10(defined $value and $value != $vdef) ? " -n $value" : ""( "" , " -n " + str(value) )[ value is not None and value != vdef ]2triangleWedge window size: (-q)Integer5(defined $value and $value != $vdef) ? " -q $value" : ""( "" , " -q " + str(value) )[ value is not None and value != vdef ]2loop_lengthCritical loop length (-s)Integer60(defined $value and $value != $vdef) ? " -s $value" : ""( "" , " -s " + str(value) )[ value is not None and value != vdef ]2Segment_distanceCritical transmembrane spacer (-d)Integer2(defined $value and $value != $vdef) ? " -d $value" : ""( "" , " -d " + str( value ) )[ value is not None and value!= vdef ]2output_optionsOutput optionsoutformatOutput format (-O)Choicenewnewhtmlold(defined $value and $value ne $vdef) ? " -O $value" : ""( "" , " -O " + str(value) )[ value is not None and value != vdef ]5profile_formatHydrophobicity Profile file format (-g)Choice$graph_output graph_output pngpngpsppm($graph_output) ? " -g $value" : " -g none"( " -g none" , " -g " + str(value) )[ graph_output ]7graphicfilesGraphic output filesPictureBinary0,n$graph_output graph_output *.$profile_format'*.' + profile_formathydrophobicity_filesHydrophobicity output filesText1,n*.hydro*'*.hydro*'html_fileOutput file in html formatToppredHtmlReportReportoutformat eq 'html'outformat == 'html'*.html'*.html'
Programs-5.1.2/newseq.xml 0000644 0001750 0000156 00000016450 12072525233 014206 0 ustar bneron sis
newseqEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netnewseqCreate a sequence file from a typed-in sequencehttp://bioweb2.pasteur.fr/docs/EMBOSS/newseq.htmlhttp://emboss.sourceforge.net/docs/themessequence:editnewseqe_inputInput sectione_nameName of the sequenceString("", " -name=" + str(value))[value is not None]1The name of of the sequence should be a single word that you will use to identify the sequence. It should have no (or few) punctuation characters in it.e_descriptionDescription of the sequenceString("", " -description=" + str(value))[value is not None]2Enter any description of the sequence that you require.e_typeType of sequenceChoiceNNP("", " -type=" + str(value))[value is not None and value!=vdef]3e_sequenceEnter the sequenceString("", " -sequence=" + str(value))[value is not None]4The sequence itself.
Because of the limitation of the operating system, you will only be able to type in a short sequence of (typically) 250 characters, or so.
The keyboard will beep at you when you have reached this limit and you will not be able to press the RETURN/ENTER key until you have deleted a few characters.e_outputOutput sectione_outseqName of the output sequence file (e_outseq)Filenamenewseq.e_outseq("" , " -outseq=" + str(value))[value is not None]5e_osformat_outseqChoose the sequence output formatChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]6e_outseq_outoutseq_out optionSequencee_outseqautoTurn off any promptingString" -auto -stdout"7
Programs-5.1.2/isochore.xml 0000644 0001750 0000156 00000022774 12072525233 014525 0 ustar bneron sis
isochoreEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netisochorePlots isochores in DNA sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/isochore.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:compositionisochoree_inputInput sectione_sequencesequence optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -sequence=" + str(value))[value is not None]1e_additionalAdditional sectione_windowWindow size (value greater than or equal to 1)Integer1000("", " -window=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1 is requiredvalue >= 12e_shiftShift increment (value greater than or equal to 1)Integer100("", " -shift=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1 is requiredvalue >= 13e_outputOutput sectione_outfileName of the output file (e_outfile)Filenameoutfile.iso("" , " -outfile=" + str(value))[value is not None]4e_outfile_outoutfile_out optionIsochoreReportReporte_outfilee_graphChoose the e_graph output formatChoicepngpnggifcpspsmetadata(" -graph=" + str(vdef), " -graph=" + str(value))[value is not None and value!=vdef]5xy_goutfileName of the output graphFilenameisochore_xygraph("" , " -goutfile=" + str(value))[value is not None]6xy_outgraph_pngGraph filePictureBinarye_graph == "png""*.png"xy_outgraph_gifGraph filePictureBinarye_graph == "gif""*.gif"xy_outgraph_psGraph filePostScriptBinarye_graph == "ps" or e_graph == "cps""*.ps"xy_outgraph_metaGraph filePictureBinarye_graph == "meta""*.meta"xy_outgraph_dataGraph fileTexte_graph == "data""*.dat"autoTurn off any promptingString" -auto -stdout"7
Programs-5.1.2/silent.xml 0000644 0001750 0000156 00000021477 12072525233 014207 0 ustar bneron sis
silentEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netsilentFind restriction sites to insert (mutate) with no translation changehttp://bioweb2.pasteur.fr/docs/EMBOSS/silent.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:restrictionsilente_inputInput sectione_sequencesequence optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -sequence=" + str(value))[value is not None]1e_requiredRequired sectione_enzymesComma separated enzyme listStringall("", " -enzymes=" + str(value))[value is not None and value!=vdef]2e_outputOutput sectione_sshowDisplay untranslated sequenceBoolean0("", " -sshow")[ bool(value) ]3e_tshowDisplay translated sequenceBoolean0("", " -tshow")[ bool(value) ]4e_allmutDisplay all mutationsBoolean0("", " -allmut")[ bool(value) ]5e_outfileName of the report fileFilenamesilent.report("" , " -outfile=" + str(value))[value is not None]6e_rformat_outfileChoose the report output formatChoiceTABLEDASGFFDBMOTIFDIFFSEQEMBLEXCELFEATTABLEGENBANKGFFLISTFILEMOTIFNAMETABLECODATAREGIONSSEQTABLESIMPLESRSSWISSTABLETAGSEQ("", " -rformat=" + str(value))[value is not None and value!=vdef]7e_outfile_outoutfile_out optionTexte_rformat_outfile in ['DASGFF', 'DBMOTIF', 'DIFFSEQ', 'EMBL', 'EXCEL', 'FEATTABLE', 'GENBANK', 'GFF', 'LISTFILE', 'MOTIF', 'NAMETABLE', 'CODATA', 'REGIONS', 'SEQTABLE', 'SIMPLE', 'SRS', 'SWISS', 'TABLE', 'TAGSEQ']e_outfileautoTurn off any promptingString" -auto -stdout"8
Programs-5.1.2/backtranseq.xml 0000644 0001750 0000156 00000106545 12072525233 015207 0 ustar bneron sis
backtranseqEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netbacktranseqBack-translate a protein sequence to a nucleotide sequencehttp://bioweb2.pasteur.fr/docs/EMBOSS/backtranseq.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:translationsequence:protein:compositionbacktranseqe_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_cfilecfile optionChoiceEhuman.cutEacc.cutEacica.cutEadenovirus5.cutEadenovirus7.cutEagrtu.cutEaidlav.cutEanasp.cutEani.cutEani_h.cutEanidmit.cutEarath.cutEasn.cutEath.cutEatu.cutEavi.cutEazovi.cutEbacme.cutEbacst.cutEbacsu.cutEbacsu_high.cutEbja.cutEbly.cutEbme.cutEbmo.cutEbna.cutEbommo.cutEbov.cutEbovin.cutEbovsp.cutEbpphx.cutEbraja.cutEbrana.cutEbrare.cutEbst.cutEbsu.cutEbsu_h.cutEcac.cutEcaeel.cutEcal.cutEcanal.cutEcanfa.cutEcaucr.cutEccr.cutEcel.cutEchi.cutEchick.cutEchicken.cutEchisp.cutEchk.cutEchlre.cutEchltr.cutEchmp.cutEchnt.cutEchos.cutEchzm.cutEchzmrubp.cutEcloab.cutEcpx.cutEcre.cutEcrigr.cutEcrisp.cutEctr.cutEcyapa.cutEdayhoff.cutEddi.cutEddi_h.cutEdicdi.cutEdicdi_high.cutEdog.cutEdro.cutEdro_h.cutEdrome.cutEdrome_high.cutEdrosophila.cutEeca.cutEeco.cutEeco_h.cutEecoli.cutEecoli_high.cutEemeni.cutEemeni_high.cutEemeni_mit.cutEerwct.cutEf1.cutEfish.cutEfmdvpolyp.cutEhaein.cutEhalma.cutEhalsa.cutEham.cutEhha.cutEhin.cutEhma.cutEhorvu.cutEhum.cutEhuman.cutEkla.cutEklepn.cutEklula.cutEkpn.cutElacdl.cutElla.cutElyces.cutEmac.cutEmacfa.cutEmaize.cutEmaize_chl.cutEmam_h.cutEmammal_high.cutEmanse.cutEmarpo_chl.cutEmedsa.cutEmetth.cutEmixlg.cutEmouse.cutEmsa.cutEmse.cutEmta.cutEmtu.cutEmus.cutEmussp.cutEmva.cutEmyctu.cutEmze.cutEmzecp.cutEncr.cutEneigo.cutEneu.cutEneucr.cutEngo.cutEoncmy.cutEoncsp.cutEorysa.cutEorysa_chl.cutEpae.cutEpea.cutEpet.cutEpethy.cutEpfa.cutEphavu.cutEphix174.cutEphv.cutEphy.cutEpig.cutEplafa.cutEpolyomaa2.cutEpombe.cutEpombecai.cutEpot.cutEppu.cutEprovu.cutEpse.cutEpseae.cutEpsepu.cutEpsesm.cutEpsy.cutEpvu.cutErab.cutErabbit.cutErabit.cutErabsp.cutErat.cutEratsp.cutErca.cutErhile.cutErhime.cutErhm.cutErhoca.cutErhosh.cutEric.cutErle.cutErme.cutErsp.cutEsalsa.cutEsalsp.cutEsalty.cutEsau.cutEschma.cutEschpo.cutEschpo_cai.cutEschpo_high.cutEsco.cutEserma.cutEsgi.cutEsheep.cutEshp.cutEshpsp.cutEsli.cutEslm.cutEsma.cutEsmi.cutEsmu.cutEsoltu.cutEsoy.cutEsoybn.cutEspi.cutEspiol.cutEspn.cutEspo.cutEspo_h.cutEspu.cutEsta.cutEstaau.cutEstrco.cutEstrmu.cutEstrpn.cutEstrpu.cutEsty.cutEsus.cutEsv40.cutEsyhsp.cutEsynco.cutEsyncy.cutEsynsp.cutEtbr.cutEtcr.cutEter.cutEtetsp.cutEtetth.cutEtheth.cutEtob.cutEtobac.cutEtobac_chl.cutEtobcp.cutEtom.cutEtrb.cutEtrybr.cutEtrycr.cutEvco.cutEvibch.cutEwheat.cutEwht.cutExel.cutExenla.cutExenopus.cutEyeast.cutEyeast_cai.cutEyeast_high.cutEyeast_mit.cutEyeastcai.cutEyen.cutEyeren.cutEyerpe.cutEysc.cutEysc_h.cutEyscmt.cutEysp.cutEzebrafish.cutEzma.cut("", " -cfile=" + str(value))[value is not None and value!=vdef]2e_outputOutput sectione_outfileName of the output file (e_outfile)DNAFilenamebacktranseq.e_outfile("" , " -outfile=" + str(value))[value is not None]3e_osformat_outfileChoose the sequence output formatDNAChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]4e_outfile_outoutfile_out optionDNASequencee_outfileautoTurn off any promptingString" -auto -stdout"5
Programs-5.1.2/helixturnhelix.xml 0000644 0001750 0000156 00000024514 12072525233 015760 0 ustar bneron sis
helixturnhelixEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.nethelixturnhelixIdentify nucleic acid-binding motifs in protein sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/helixturnhelix.htmlhttp://emboss.sourceforge.net/docs/themessequence:protein:2D_structuresequence:protein:motifsstructure:2D_structurehelixturnhelixe_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_additionalAdditional sectione_meanMean value (value from 1. to 10000.)Float238.71("", " -mean=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1. is requiredvalue >= 1.Value less than or equal to 10000. is requiredvalue <= 10000.2e_sdStandard deviation value (value from 1. to 10000.)Float293.61("", " -sd=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1. is requiredvalue >= 1.Value less than or equal to 10000. is requiredvalue <= 10000.3e_minsdMinimum sd (value from 0. to 100.)Float2.5("", " -minsd=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0. is requiredvalue >= 0.Value less than or equal to 100. is requiredvalue <= 100.4e_eightysevenUse the old (1987) weight dataBoolean0("", " -eightyseven")[ bool(value) ]5e_outputOutput sectione_outfileName of the report fileFilenamereport.hth("" , " -outfile=" + str(value))[value is not None]6e_rformat_outfileChoose the report output formatChoiceMOTIFDASGFFDBMOTIFDIFFSEQEMBLEXCELFEATTABLEGENBANKGFFLISTFILEMOTIFNAMETABLECODATAREGIONSSEQTABLESIMPLESRSSWISSTABLETAGSEQ("", " -rformat=" + str(value))[value is not None and value!=vdef]7e_outfile_outoutfile_out optionTexte_rformat_outfile in ['DASGFF', 'DBMOTIF', 'DIFFSEQ', 'EMBL', 'EXCEL', 'FEATTABLE', 'GENBANK', 'GFF', 'LISTFILE', 'MOTIF', 'NAMETABLE', 'CODATA', 'REGIONS', 'SEQTABLE', 'SIMPLE', 'SRS', 'SWISS', 'TABLE', 'TAGSEQ']e_outfileautoTurn off any promptingString" -auto -stdout"8
Programs-5.1.2/charge.xml 0000644 0001750 0000156 00000024034 12072525233 014132 0 ustar bneron sis
chargeEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netchargeDraw a protein charge plothttp://bioweb2.pasteur.fr/docs/EMBOSS/charge.htmlhttp://emboss.sourceforge.net/docs/themessequence:protein:compositionchargee_inputInput sectione_seqallseqall optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -seqall=" + str(value))[value is not None]1e_aadataAmino acids properties and molecular weight data fileProteinAminoAcidPropertiesAbstractText("", " -aadata=" + str(value))[value is not None ]2e_additionalAdditional sectione_windowWindow length (value greater than or equal to 1)Integer5("", " -window=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1 is requiredvalue >= 13e_outputOutput sectione_plotProduce graphicBoolean0("", " -plot")[ bool(value) ]4e_graphChoose the e_graph output formatChoicee_plotpngpnggifcpspsmetadata(" -graph=" + str(vdef), " -graph=" + str(value))[value is not None and value!=vdef]5xy_goutfileName of the output graphFilenamee_plotcharge_xygraph("" , " -goutfile=" + str(value))[value is not None]6xy_outgraph_pngGraph filePictureBinarye_plot and e_graph == "png""*.png"xy_outgraph_gifGraph filePictureBinarye_plot and e_graph == "gif""*.gif"xy_outgraph_psGraph filePostScriptBinarye_plot and e_graph == "ps" or e_graph == "cps""*.ps"xy_outgraph_metaGraph filePictureBinarye_plot and e_graph == "meta""*.meta"xy_outgraph_dataGraph fileTexte_plot and e_graph == "data""*.dat"e_outfileName of the output file (e_outfile)Filenamenot e_plotcharge.e_outfile("" , " -outfile=" + str(value))[value is not None]7e_outfile_outoutfile_out optionChargeReportReporte_outfileautoTurn off any promptingString" -auto -stdout"8
Programs-5.1.2/scan_region.xml 0000644 0001750 0000156 00000060735 11767572177 015223 0 ustar bneron sis
scan_regionscan_regionScan genomic regions in a query-file against a DB-file which contains
chromosome locations for various genomics featuresgenetics:detectionscan_region.plcnvfileCNV calls file (cnv)CnvAbstractText(defined $value) ? " $value " : ""( "" , " " + str(value) )[ value is not None]A file containing CNV calls, that could be generated by the
test operation of detect_cnv program.1reffileReference genes for CNV calls generated using hg18 (Mar 2006, NCBI build 36) human genome assemblyChoicenullnullhg18_refGene.txtUCSCknownGene.txt(defined $value) ? " $value " : ""( "" , " " + str(value) )[ value is not None ]2referenceFlags specifying type of databasesChoicenullnull--refgene--refcds--refexon--knowngene(defined $value and $value ne $vdef) ? " $value " : ""( "" , " " + str(value) )[ value is not None and value !=vdef]3--refgene: specify that the database file is in refGene format from UCSC genome
browser.--refcds: specify that the database file is in refGene format from UCSC genome
browser, but user is only interested in the overlap of coding
region (first exon to last exon).--refexon: specify that the database file is in refGene format from UCSC genome
browser, but user is only interested in the overlap of query
with exons.--knowngene: specify that the database file is in knownGene format from UCSC
genome browser.dbfile-specificDatabase-specific argumentsname2Use name2 annotation in refGene file in outputBoolean$reference eq '--refgene'reference == '--refgene'0($value) ? "--name2 " : ""( "" , " --name2 " )[ value]3This argument is used in conjunction with the --refgene argument,
to specify that the alternative gene symbol in the
"name2" field in the refGene file be printed in the output.reflinkSpecify a cross-reference file for the RefGene track in UCSC genome browserBoolean$reference ne '--knowngene' reference != '--knowngene'0($value) ? "--reflink hg18_refLink.txt " : ""( "" , " --reflink hg18_refLink.txt " )[ value]3Specify a cross-reference file for the RefGene track in UCSC
genome browser, so that in the output, the gene identifier
(gene name or refseq id) are replaced by the gene symbol specified
in the link file. (If not found in the reflink file, the
gene identifiers are still used).kgxrefSpecify a cross-reference file for the knownGene track in UCSC genome browserBoolean$reference eq '--knowngene'reference == '--knowngene'0($value) ? "--kgxref UCSCkgXref.txt " : ""( "" , " --kgxref UCSCkgXref.txt " )[ value]3Specify a cross-reference file for the knownGene track in UCSC
genome browser, so that in the output, the gene identifier
(gene name or refseq id) are replaced by the gene symbol specified
in the kgxref file. (If not found in the kgxref file, the
gene identifiers are still used).query-db-matchCriteria for defining query-db matchcondense_queryCondense and eliminate overlapping regions in queryBoolean0($value) ? "--condense_query " : ""( "" , " --condense_query " )[ value]3Condense overlapped regions in the query file into non-over-lapped
regions. When this argument is set, the annotation for
each query (the strings after the chromosome location in each
line of the query file) will not in the output.score_thresholdScore threshold for database in UCSC annotation fileFloat(defined $value) ? " --score_threshold $value " : ""( "" , " --score_threshold " + str(value) )[ value is not None]3Specify the score threshold in the database file to include in the
search for overlaps. This argument is file format dependent.normscore_thresholdNormalized score threshold for database in UCSC annotation fileFloat(defined $value) ? " --normscore_threshold $value " : ""( "" , " --normscore_threshold " + str(value) )[ value is not None]3Specify the normalized score threshold in the database file to
include in the search for overlaps. This argument is file format dependent.expansion_queryExpansion of query to find matchexpandleftExpand left side of query regions (overwrite --expandmax)Integer($reference eq '--knowngene' or $reference eq '--refgene') and (not $expandmax)(reference == '--knowngene' or reference == '--refgene') and (not expandmax)(defined $value) ? " --expandleft $value " : ""( "" , " --expandleft " + str(value) )[ value is not None]3Expand the query region on the left side (5 megabases in forward strand,
3 megabases in reverse strand) to find overlap (used in conjunction with
--refgene or --knowngene argument).expandrightExpand right side of query regions (overwrite --expandmax)Integer($reference eq '--knowngene' or $reference eq '--refgene') and (not $expandmax)(reference == '--knowngene' or reference == '--refgene') and (not expandmax)(defined $value) ? " --expandright $value " : ""( "" , " --expandright " + str(value) )[ value is not None]3Expand the query region on the right side (3 megabases in forward
strand, 5 megabases in reverse strand) to find overlap (used in conjunction
with --refgene or --knowngene argument).expandmaxSize of maximum expansion for query region to find overlapInteger$reference eq '--knowngene' or $reference eq '--refgene'reference == '--knowngene' or reference == '--refgene'(defined $value) ? " --expandmax $value " : ""( "" , " --expandmax " + str(value) )[ value is not None]3Maximum expansion size of the query region on both side to find
at least one overlap (used in junction with --refgene or
--knowngene argument). After query expansion, only the closet
gene will be printed; other genes, even if overlapping with the
query after expansion, will not be printed.expanddbExpand definition of gene/cds/exon at both sidesInteger(defined $value) ? " --expanddb $value " : ""( "" , " --expanddb " + str(value) )[ value is not None]3Expand the chromosome region specified in the database-file to find
overlap with the query regions.output_optionInput/output optionsoverlapPrint overlapped portion of region onlyBoolean0($value) ? "--overlap " : ""( "" , " --overlap " )[ value]3Instead of printing the query region, only print the overlapped
portion of the query region and template region.dbregionPrint database region (default is to print query region)Boolean0($value) ? "--dbregion " : ""( "" , " --dbregion " )[ value]3Print the region in database file, rather than query file, when an
overlapped hit is found.appendAppend extra information from annotation file to outputBoolean0($value) ? "--append " : ""( "" , " --append " )[ value]3Append the score and normscore for the overlapped template
region to the output for database files downloaded as UCSC tables.queryinfoForce to print query info when Print database region is usedBoolean$dbregiondbregion0($value) ? "--queryinfo " : ""( "" , " --queryinfo " )[ value]3output_fileOutput fileCnvAbstractText"scan_region.out""scan_region.out"
Programs-5.1.2/rnaduplex.xml 0000644 0001750 0000156 00000015545 12265777220 014723 0 ustar bneron sis
rnaduplexRNAduplexCompute the structure upon hybridization of two RNA strandsIvo HofackerRNAduplex reads two RNA sequences from file and
computes optimal and suboptimal secondary structures for their hybridization. The calculation is simplified by allowing only inter-molecular base pairs.sequence:nucleic:2D_structurestructure:2D_structureRNAduplexseqRNA Sequence FileDNASequenceFASTA" < $value" " < "+ str(value) 1000controlControl options2suboptimalSuboptimal structures (-e)Integer0(defined $value and $value!=$vdef)? " -e $value" : ""( "" , " -e " + str(value))[ value is not None and value != vdef ]Compute suboptimal structures with energy with range kcal/mol of the optimum. Default is calculation of mfe structure only.temperatureRescale energy parameters to a temperature of temp C. (-T)Integer37(defined $value and $value != $vdef)? " -T $value" : ""( "" , " -T " + str(value) )[ value is not None and value != vdef]tetraloopsDo not include special stabilizing energies for certain tetraloops (-4)Boolean0($value)? " -4" : ""( "" , " -4" )[ value ]inputInput parameters2noGUDo not allow GU pairs (-noGU)Boolean0($value)? " -noGU" : ""( "" , " -noGU" )[ value ]noCloseGUDo not allow GU pairs at the end of helices (-noCloseGU)Boolean0($value)? " -noCloseGU" : ""( "" , " -noCloseGU" )[ value ]parameterEnergy parameter file (-P)EnergyParameterFileAbstractText(defined $value)? " -P $value" : ""( "" , " -P " + str(value) )[ value is not None ]Read energy parameters from paramfile, instead of using the default parameter set. A sample parameterfile should accompany your distribution. See the RNAlib documentation for details on the file format.readseqString"readseq -f=19 -a $seq > $seq.tmp && (cp $seq $seq.orig && mv $seq.tmp $seq) ; ""readseq -f=19 -a "+ str(seq) + " > "+ str(seq) +".tmp && (cp "+ str(seq) +" "+ str(seq) +".orig && mv "+ str(seq) +".tmp "+ str(seq) +") ; "-10psfilesPostscript filePostScriptBinary"*.ps""*.ps"
Programs-5.1.2/msa.xml 0000644 0001750 0000156 00000022355 11767572177 013510 0 ustar bneron sis
msa2.1MSAMultiple sequence alignmentS. F. Altschulhttp://www.ncbi.nlm.nih.gov/CBBresearch/Schaffer/msa.htmlftp://fastlink.nih.gov/pub/msa/msa.tar.Zalignment:multiplemsaseqsSequences FileSequenceFASTA" $value"" "+str(value)2This is a file containing the sequences to be aligned.controlControl parameters1optimalTurns off the optimal multiple alignment (-m)Boolean0($value)? " -m":""("" , " -m")[ value ]Turns off the optimal multiple alignment segment of the program.This allows the user to see the heuristic alignment and other data produced by the program before the it attempts to produce an optimal multiple alignment.forcedresUser file to force the alignment of certain residues (-f)ForceAlignmentPatternAbstractText(defined $value)? " -f $value" : "" ( "" , " -f " + str(value) )[ value is not None ]Allows the user to force the alignment of certain residues. The file referred to must have one or more lines of the following format:seqs.| "S" precedes block start | "L" precedes block lengthThe example would force positions 22 to 31 of sequence 2 to be aligned with positions 21 to 30 of sequence 3 and positions 25 to 34 of sequence 5; it would else force position 35 of sequence 2 to be aligned with position 36 of sequence 3 and position 41 of sequence 5. Needless to say, all positions forced into alignment must be mutually consistent.
2 3 5 S 22 21 25 L 10 S 35 36 41 L 1CostCost parameters1endgapCharges terminal gaps the same as internal gaps (-g)Boolean0 ($value)? " -g":""("" , " -g")[ value ]As a default, no charge is made for the existence of a terminal gap.unweightCost of a multiple alignment (-b)Boolean0 ($value)? " -b":""("" , " -b")[ value ]The cost of a multiple alignment is taken to be the unweighted sum of all the pairwise alignments. In the absence of this flag, the program estimates an evolutionary tree and uses it to assign weights to each pairwise alignment using either rationale-1 or rational-2 as described in Altschul et al., J. Molec. Biol. 208 (1989).maxscoreMaximum score of an optimal multiple alignment (-d)Integer (defined $value)? " -d$value" : ""( "" , " -d" + str(value) )[ value is not None ]Specifies the maximum score of an optimal multiple alignment. Default is calculated from the scores of the optimal pairwise alignments, the weights, and the epsilons.epsilonsUser specified epsilons for each pairwise alignment (-e)EpsilonFileAbstractText(defined $value)? " -e $value" : "" ( "" , " -e " + str(value) )[ value is not None ]As a default, the program calculates an heuristic multiple alignment and uses it to set epsilons for each pairwise alignment. Frequently the "optimal multiple alignment" will be found to have observed epsilons exceeding those supplied or calculated. When this is the case, it is advisable to rerun the program using suitably augmented epsilons. The file named here should have integers separated by spaces or newlines or both, with one integer for each pair of sequences in the order 1-2, 1-3, ... , 1-N, 2-3, ... , (N-1)-N.costsUser costs file (-c)CostsAbstractText(defined $value)? " -c $value" : "" ( "" , " -c " + str(value) )[ value is not None ]Allows the user to specify the cost for a gap, as well as the cost for aligning any pair of letters or a letter with a null. The default is PAM-250 costs for protein sequences, using the one-letter code. The format of this file is an integer, followed by all possible pairs of aligned symbols followed by their cost. For example, the file might begin as follows: This would specify a cost of 0 for aligning a null symbol with another null symbol, a cost of 1 for aligning an A with a null symbol, etc., and an additional cost of 3 for the existence of a gap. The program assumes the costs are symmetric, so that there is no need to have a line for C A as well as for A C. All costs must be non-negative integers.
3
- - 0
- A 1
A C 2
outputOutput parameters1quietSuppress verbose output (-o)Boolean0($value)? " -o":""("" , " -o")[ value ]
Programs-5.1.2/clustalO-sequence.xml 0000644 0001750 0000156 00000070745 12104230615 016300 0 ustar bneron sis
clustalO-sequenceClustal-Omega: Multiple alignmentAdd new sequences to an existing alignment. Use this interface to add new sequences to an existing alignment.The profile is converted into a HMM and the un-aligned sequences will
be multiply aligned (using the HMM background information) to form
a profile; this constructed profile is aligned with the input
profile; the columns in each profile (the original one and the one
created from the un-aligned sequences) will be kept fixed and the
alignment of the two profiles will be written out.
The un/aligned sequences must contain at least two
sequences.alignment:multipleclustaloinputData Inputsequences_inputUnaligned set of sequencesProteinSequenceFASTASWISSPROTCODATANBRF2,n" --infile=$value"" --infile=" + str( value )alignment_inputProfile (Aligned sequences)ProteinAlignmentFASTACLUSTALSTOCKHOLMMSF1" --profile1=$value"" --profile1=" + str( value )seqtypetype of sequencesChoiceautoautoProteinRNADNA(defined $value and $value neq $vdef)? " --seqtype=$value" : ""("", " --seqtype="+str(value))[value is not None and value != vdef]Since version 1.1.0 the Clustal-Omega alignment engine can process
DNA/RNA. Clustal-Omega tries to guess the sequence type (protein,
DNA/RNA), but this can be over-ruled with this flag.
dealignDealign input sequences$alignment_inputbool( alignment_input )Boolean0(defined $value and $value) " --dealign " : ""( "" , " --dealign ")[ value is not None and value !=vdef ] When the sequences are aligned (all sequences
have the same length and at least one sequence has at least one
gap), then the alignment is turned into a HMM, the sequences are
de-aligned and the now un-aligned sequences are aligned using the
HMM as an External Profile for External Profile Alignment (EPA).
If no EPA is desired use turn on this option. Clustal-Omega reads the file of aligned sequences.
It de-aligns the sequences and then re-aligns them.
No HMM is produced in the process, no pseudo-count information is transferred.
Consequently, the output must be the same as for unaligned output.clusteringClustering
In order to produce a multiple alignment Clustal-Omega requires a
guide tree which defines the order in which sequences/profiles are
aligned. A guide tree in turn is constructed, based on a distance
matrix. Conventionally, this distance matrix is comprised of all the
pair-wise distances of the sequences. The distance measure
Clustal-Omega uses for pair-wise distances of un-aligned sequences is
the k-tuple measure [4], which was also implemented in Clustal 1.83
and ClustalW2
[5,6]. If the sequences inputted via -i are aligned
Clustal-Omega uses the Kimura-corrected pairwise aligned identities
[7]. The computational effort (time/memory) to calculate and store a
full distance matrix grows quadratically with the number of sequences.
Clustal-Omega can improve this scalability to N*log(N) by employing a
fast clustering algorithm called mBed [2]; this option is
automatically invoked (default). If a full distance matrix evaluation
is desired, then the --full flag has to
be set. The mBed mode
calculates a reduced set of pair-wise distances. These distances are
used in a k-means algorithm, that clusters at most 100 sequences. For
each cluster a full distance matrix is calculated. No full distance
matrix (of all input sequences) is calculated in mBed mode. If there
are less than 100 sequences in the input, then in effect a full
distance matrix is calculated in mBed mode, however, no distance
matrix can be outputted (see below).
Clustal-Omega uses Muscle's [8] fast UPGMA implementation to construct
its guide trees from the distance matrix. By default, the distance
matrix is used internally to construct the guide tree and is then
discarded. By specifying --distmat-out the internal distance matrix
can be written to file. This is only possible in --full mode. The
guide trees by default are used internally to guide the multiple
alignment and are then discarded. By specifying the --guidetree-out
option these
internal guide trees can be written out to
file. Conversely, the distance calculation and/or guide tree building
stage can be skipped, by reading in a pre-calculated distance matrix
and/or pre-calculated guide tree. These options are invoked by
specifying the --distmat-in and/or --guidetree-in flags,
respectively. However, distance matrix reading is disabled in the
current version. By default, distance matrix and guide tree files are
not over-written, if a file with the specified name already
exists. In
this case Clustal-Omega aborts during the command-line processing
stage. In mBed mode a full distance matrix cannot
be outputted, distance matrix output is only possible in --full mode.
mBed or --full distance mode do not affect the ability to write out
guide-trees.
Guide trees can be iterated to refine the alignment (see section
ITERATION). Clustal-Omega takes the alignment, that was produced
initially and constructs a new distance matrix from this alignment.
The distance measure used at this stage is the Kimura distance [7]. By
default, Clustal-Omega constructs a reduced distance matrix at this
stage using the mBed algorithm, which will then be used to create an
improved (iterated) new guide tree. To turn off mBed-like clustering
at this
stage the --full-iter flag has to be set. While Kimura
distances in general are much faster to calculate than k-tuple
distances, time and memory requirements still scale quadratically with
the number of sequences and --full-iter clustering should only be
considered for smaller cases ( << 10,000 sequences).
[2] Blackshields G, Sievers F, Shi W, Wilm A, Higgins DG. Sequence
embedding for fast construction of guide trees for multiple
sequence alignment. Algorithms Mol Biol. 2010 May 14;5:21.
[4] Wilbur and Lipman, 1983; PMID 6572363[5] Thompson JD, Higgins DG, Gibson TJ. (1994). CLUSTAL W: improving
the sensitivity of progressive multiple sequence alignment through
sequence weighting, position-specific gap penalties and weight
matrix choice. Nucleic Acids Res., 22, 4673-4680.
[6] Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA,
McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD,
Gibson TJ, Higgins DG. (2007). Clustal W and Clustal X version
2.0. Bioinformatics, 23, 2947-2948.
[7] Kimura M (1980). "A simple method for estimating evolutionary
rates of base substitutions through comparative studies of
nucleotide sequences". Journal of Molecular Evolution 16: 111–120.
distmat_outPairwise distance matrix output fileFilename(defined $value and $value)? " --distmat-out=$value ":""( "" , " --distmat-out="+str(value))[ value is not None ]the full option must be set$fullfullguidetree_inGuide tree input file (--guidetree-in)TreeNEWICK(defined $value )? " --guidetree-in= $value" : ""( "" , " --guidetree-in="+str(value))[ value is not None ]guidetree_outGuide tree output file (--guidetree-out)Filename(defined $value and $value)? " --guidetree-out=$value ":""( "" , " --guidetree-out="+str(value))[ value is not None ]fullUse full distance matrix for guide-tree calculation (slow; mBed is default) (--full)Boolean0(defined $full and $ full)? " --full ": ""( "" , " --full ")[ value is not None and value ]full_iterUse full distance matrix for guide-tree calculation during iteration (mBed is default) (--full-iter)Boolean0(defined $full and $ full)? " --full-iter ": ""( "" , " --full-iter ")[ value is not None and value ]output_formatAlignment Outputoutput_formatalignment output formatChoicefastafastaclustalmsfphylipstockholmvienna(defined $value and $value ne $vdef)? " --outfmt=$value" : ""( "" , " --outfmt=" + value )[ value is not None and value != vdef ]iterationIterationBy default, Clustal-Omega calculates (or reads in) a guide tree and
performs a multiple alignment in the order specified by this guide
tree. This alignment is then outputted. Clustal-Omega can 'iterate'
its guide tree. The hope is that the (Kimura) distances, that can be
derived from the initial alignment, will give rise to a better guide
tree, and by extension, to a better alignment.A similar rationale applies to HMM-iteration. MSAs in general are very
'vulnerable' at their early stages. Sequences that are aligned at an
early stage remain fixed for the rest of the MSA. Another way of
putting this is: 'once a gap, always a gap'. This behaviour can be
mitigated by HMM iteration. An initial alignment is created and turned
into a HMM. This HMM can help in a new round of MSA to 'anticipate'
where residues should align. This is using the HMM as an External
Profile and carrying out iterative EPA. In practice, individual
sequences and profiles are aligned to the External HMM, derived after
the initial alignment. Pseudo-count information is then transferred to
the (internal) HMM, corresponding to the individual
sequence/profile. The now somewhat 'softened' sequences/profiles are
then in turn aligned in the order specified by the guide
tree. Pseudo-count transfer is reduced with the size of the
profile. Individual sequences attain the greatest
pseudo-count
transfer, larger profiles less so. Pseudo-count transfer to profiles
larger than, say, 10 is negligible. The effect of HMM iteration is
more pronounced in larger test sets (that is, with more sequences).Both, HMM- and guide tree-iteration come at a cost of increasing the
run-time. One round of guide tree iteration adds on (roughly) the time
it took to construct the initial alignment. If, for example, the
initial alignment took 1min, then it will take (roughly) 2min to
iterate the guide tree once, 3min to iterate the guide tree twice, and
so on. HMM-iteration is more costly, as each round of iteration adds
three times the time required for the alignment stage. For example, if
the initial alignment took 1min, then each additional round of HMM
iteration will add on 3min; so 4 iterations will take 13min
(=1min+4*3min). The factor of 3 stems from the fact that at every
stage both intermediate profiles have to be aligned with the
background HMM, and finally the (softened) HMMs have to be aligned as
well. All times are quoted for single processors.By default, guide tree iteration and HMM-iteration are coupled. This
means, at each iteration step both, guide tree and HMM, are
re-calculated. This is invoked by setting the --iter flag. For
example, if --iter=1, then first an initial alignment is produced
(without external HMM background information and using k-tuple
distances to calculate the guide tree). This initial alignment is then
used to re-calculate a new guide tree (using Kimura distances) and to
create a HMM. The new
guide tree and the HMM are then used to produce
a new MSA.Iteration of guide tree and HMM can be de-coupled. This means that the
number of guide tree iterations and HMM iterations can be
different. This can be done by combining the --iter flag with the
--max-guidetree-iterations and/or the --max-hmm-iterations flag. The
number of guide tree iterations is the minimum of --iter and
--max-guidetree-iterations, while the number of HMM iterations is the
minimum of --iter and --max-hmm-iterations. If, for example, HMM
iteration should be
performed 5 times but guide tree iteration should
be performed only 3 times, then one should set --iter=5 and
--max-guidetree-iterations=3. All three flags can be specified at the
same time (however, this makes no sense). It is not sufficient just to
specify --max-guidetree-iterations and --max-hmm-iterations but not
--iter. If any iteration is desired --iter has to be set.iterationsNumber of (combined guide-tree/HMM) iterations (--iter)Integer(defined $value)? " --iter=$value ": ""( "" , " --iter="+str(value) )[ value is not None ]if iterations= 2. Clustal-Omega reads the input file, creates a UPGMA guide tree
built from k-tuple distances, and performs an initial alignment. This
initial alignment is converted into a HMM and a new guide tree is
built from the Kimura distances of the initial alignment. The
un-aligned sequences are then aligned (for the second time but this
time) using pseudo-count information from the HMM created after the
initial alignment (and using the new guide tree). This second
alignment is then again converted into a HMM and a new guide tree is
constructed. The un-aligned sequences are then aligned (for a third
time), again using pseudo-count information of the HMM from the
previous step and the most recent guide tree. The final alignment is
written to screen.max_guidetree_iterationsMaximum number guidetree iterations (--max-guidetree-iterations)Integer(defined $value)? " --max-guidetree-iterations=$value ": ""( "" , " --max-guidetree-iterations="+str(value) )[ value is not None ]If iterations= 5 and the "Maximum number guidetree iterations" is set to 1.
Clustal-Omega reads the input file, creates a UPGMA guide tree
built from k-tuple distances, and performs an initial alignment. This
initial alignment is converted into a HMM and a new guide tree is
built from the Kimura distances of the initial alignment. The
un-aligned sequences are then aligned (for the second time but this
time) using pseudo-count information from the HMM created after the
initial alignment (and using the new guide tree). For the last 4
iterations the guide tree is left unchanged and only HMM iteration is
performed. This means that intermediate alignments are converted to
HMMs, and these intermediate HMMs are used to guide the MSA during
subsequent iteration stages.max_hmm_iterationsMaximum number of HMM iterations (--max-hmm-iterations)Integer(defined $value)? " --max-hmm-iterations=$value ": ""( "" , " --max-hmm-iterations="+str(value) )[ value is not None ]miscellaneousMiscellaneousautoSet options automatically (might overwrite some of your options) (--auto)Boolean0(defined $value and $value)? " --auto ": ""( "" , " --auto ")[value is not None and value]Users may feel unsure which options are appropriate in certain
situations even though using ClustalO without any special options
should give you the desired results. The --auto flag tries to
alleviate this problem and selects accuracy/speed flags according to
the number of sequences. For all cases will use mBed and thereby
possibly overwrite the --full option. For more than 1,000 sequences
the iteration is turned off as the effect of iteration is more
noticeable for 'larger'
problems. Otherwise iterations are set to 1 if
not already set to a higher value by the user. Expert users may want
to avoid this flag and exercise more fine tuned control by selecting
the appropriate options manually.verbosity100String" -v --force --log=clustalO_log"" -v --force --log=clustalO_log"alignment_outputMultiple Sequence AlignmentProteinAlignmentFASTACLUSTALMSFPHILIPISTOCKHOLMFASTA"clustalO-sequence.out""clustalO-sequence.out"guidetree_outfileGuide tree output filedefined $guidetree_outguidetree_out is not NoneTreeNEWICK$guidetree_outguidetree_outdistmat_outfilePairwise distance matrix output filedefined $distmat_outdistmat_out is not NoneDistanceMatrixAbstractText$distmat_outdistmat_outlogfileClustal omega log fileClustalOReportReport"clustalO_log""clustalO_log"
Programs-5.1.2/fuzzpro.xml 0000644 0001750 0000156 00000021676 12072525233 014431 0 ustar bneron sis
fuzzproEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netfuzzproSearch for patterns in protein sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/fuzzpro.htmlhttp://emboss.sourceforge.net/docs/themessequence:protein:motifsfuzzproe_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_patternSearch patternProteinPatternAbstractText("", " -pattern=@" + str(value))[value is not None]2The standard IUPAC one-letter codes for the amino acids are used.
The symbol 'x' is used for a position where any amino acid is accepted.
Ambiguities are indicated by listing the acceptable amino acids for a given position, between square parentheses '[ ]'. For example: [ALT] stands for Ala or Leu or Thr.
Ambiguities are also indicated by listing between a pair of curly brackets '{ }' the amino acids that are not accepted at a given position. For example: {AM} stands for any amino acid except Ala and Met.
Each element in a pattern is separated from its neighbor by a '-'. (Optional in fuzzpro).
Repetition of an element of the pattern can be indicated by following that element with a numerical value or a numerical range between parenthesis. Examples: x(3) corresponds to x-x-x, x(2,4) corresponds to x-x or x-x-x or x-x-x-x.
When a pattern is restricted to either the N- or C-terminal of a sequence, that pattern either starts with a '<' symbol or respectively ends with a '>' symbol.
A period ends the pattern. (Optional in fuzzpro).
For example, [DE](2)HS{P}X(2)PX(2,4)Ce_pmismatchSearch patternInteger0("", " -pmismatch=" + str(value))[value is not None and value!=vdef]3e_outputOutput sectione_outfileName of the report fileFilenamefuzzpro.report("" , " -outfile=" + str(value))[value is not None]4e_rformat_outfileChoose the report output formatChoiceSEQTABLEDASGFFDBMOTIFDIFFSEQEMBLEXCELFEATTABLEGENBANKGFFLISTFILEMOTIFNAMETABLECODATAREGIONSSEQTABLESIMPLESRSSWISSTABLETAGSEQ("", " -rformat=" + str(value))[value is not None and value!=vdef]5e_outfile_outoutfile_out optionTexte_rformat_outfile in ['DASGFF', 'DBMOTIF', 'DIFFSEQ', 'EMBL', 'EXCEL', 'FEATTABLE', 'GENBANK', 'GFF', 'LISTFILE', 'MOTIF', 'NAMETABLE', 'CODATA', 'REGIONS', 'SEQTABLE', 'SIMPLE', 'SRS', 'SWISS', 'TABLE', 'TAGSEQ']e_outfileautoTurn off any promptingString" -auto -stdout"6
Programs-5.1.2/patmatmotifs.xml 0000644 0001750 0000156 00000020265 12072525233 015413 0 ustar bneron sis
patmatmotifsEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netpatmatmotifsScan a protein sequence with motifs from the PROSITE databasehttp://bioweb2.pasteur.fr/docs/EMBOSS/patmatmotifs.htmlhttp://emboss.sourceforge.net/docs/themessequence:protein:motifspatmatmotifse_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -sequence=" + str(value))[value is not None]1e_additionalAdditional sectione_fullProvide full documentation for matching patternsBoolean0("", " -full")[ bool(value) ]2e_pruneIgnore simple patternsBoolean1(" -noprune", "")[ bool(value) ]3Ignore simple patterns. If this is true then these simple post-translational modification sites are not reported: myristyl, asn_glycosylation, camp_phospho_site, pkc_phospho_site, ck2_phospho_site, and tyr_phospho_site.e_outputOutput sectione_outfileName of the report fileFilenamepatmatmotifs.report("" , " -outfile=" + str(value))[value is not None]4e_rformat_outfileChoose the report output formatChoiceDBMOTIFDASGFFDBMOTIFDIFFSEQEMBLEXCELFEATTABLEGENBANKGFFLISTFILEMOTIFNAMETABLECODATAREGIONSSEQTABLESIMPLESRSSWISSTABLETAGSEQ("", " -rformat=" + str(value))[value is not None and value!=vdef]5e_outfile_outoutfile_out optionTexte_rformat_outfile in ['DASGFF', 'DBMOTIF', 'DIFFSEQ', 'EMBL', 'EXCEL', 'FEATTABLE', 'GENBANK', 'GFF', 'LISTFILE', 'MOTIF', 'NAMETABLE', 'CODATA', 'REGIONS', 'SEQTABLE', 'SIMPLE', 'SRS', 'SWISS', 'TABLE', 'TAGSEQ']e_outfileautoTurn off any promptingString" -auto -stdout"6
Programs-5.1.2/showseq.xml 0000644 0001750 0000156 00000125413 12072525233 014375 0 ustar bneron sis
showseqEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netshowseqDisplays sequences with features in pretty formathttp://bioweb2.pasteur.fr/docs/EMBOSS/showseq.htmlhttp://emboss.sourceforge.net/docs/themesdisplay:nucleic:restrictiondisplay:nucleic:translationshowseqe_inputInput sectione_sequencesequence optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_mfileRestriction enzyme methylation data file (optional)RestrictionEnzymeMethylationDataAbstractText("", " -mfile=" + str(value))[value is not None ]2e_requiredRequired sectione_formatThings to displayChoice2012345678("", " -format=" + str(value))[value is not None and value!=vdef]3e_thingsSpecify your own things to display (value from 1 to 100)Stringe_format=="0"B,N,T,S,A,F("", " -things=" + str(value))[value is not None and value!=vdef]4Specify a list of one or more code characters in the order in which you wish things to be displayed one above the other down the page.S: SequenceB: Blank line1: Frame1 translation2: Frame2 translation3: Frame3 translation-1: Compframe1 translation-2: Compframe2 translation-3: Compframe3 translationT: Ticks lineN: Number ticks lineC: Complement sequenceF: FeaturesR: Restriction enzyme cut sites in forward sense-R: Restriction enzyme cut sites in reverse senseA: AnnotationFor example if you wish to see things displayed in the order: sequence, complement sequence, ticks line, frame 1 translation, blank line; then you should enter 'S,C,T,1,B'.e_additionalAdditional sectione_translateRegions to translate (eg: 4-57,78-94)String("", " -translate=" + str(value))[value is not None]5Regions to translate (if translating).
If this is left blank the complete sequence is translated.
A set of regions is specified by a set of pairs of positions.
The positions are integers.
They are separated by any non-digit, non-alpha character.
Examples of region specifications are:
24-45, 56-78
1:45, 67=99;765..888e_revtranslateRegions to translate in reverse direction (eg: 78-94,4-57)String("", " -revtranslate=" + str(value))[value is not None]6Regions to translate (if translating).
If this is left blank the complete sequence is translated.
A set of regions is specified by a set of pairs of positions.
The positions are integers.
They are separated by any non-digit, non-alpha character.
Examples of region specifications are:
78-56, 45-24,
888..765, 99=67; 45:1e_uppercaseRegions to put in uppercase (eg: 4-57,78-94)String("", " -uppercase=" + str(value))[value is not None]7Regions to put in uppercase.
If this is left blank, then the sequence case is left alone.
A set of regions is specified by a set of pairs of positions.
The positions are integers.
They are separated by any non-digit, non-alpha character.
Examples of region specifications are:
24-45, 56-78
1:45, 67=99;765..888
1,5,8,10,23,45,57,99e_highlightRegions to colour in html (eg: 4-57 red 78-94 green)String("", " -highlight=" + str(value))[value is not None]8Regions to colour if formatting for HTML.
If this is left blank, then the sequence is left alone.
A set of regions is specified by a set of pairs of positions.
The positions are integers.
They are followed by any valid HTML font colour.
Examples of region specifications are:
24-45 blue 56-78 orange
1-100 green 120-156 red
A file of ranges to colour (one range per line) can be specified as '@filename'.e_annotationRegions to mark (eg: 4-57 promoter region 78-94 first exon)String("", " -annotation=" + str(value))[value is not None]9Regions to annotate by marking.
If this is left blank, then no annotation is added.
A set of regions is specified by a set of pairs of positions followed by optional text.
The positions are integers.
They are followed by any text (but not digits when on the command-line).
Examples of region specifications are:
24-45 new domain 56-78 match to Mouse
1-100 First part 120-156 oligo
A file of ranges to annotate (one range per line) can be specified as '@filename'.e_enzymesComma separated restriction enzyme listStringall("", " -enzymes=" + str(value))[value is not None and value!=vdef]10The name 'all' reads in all enzyme names from the REBASE database. You can specify enzymes by giving their names with commas between then, such as: 'HincII,hinfI,ppiI,hindiii'.
The case of the names is not important. You can specify a file of enzyme names to read in by giving the name of the file holding the enzyme names with a '@' character in front of it, for example, '@enz.list'.
Blank lines and lines starting with a hash character or '!' are ignored and all other lines are concatenated together with a comma character ',' and then treated as the list of enzymes to search for.
An example of a file of enzyme names is:
! my enzymes
HincII, ppiII
! other enzymes
hindiii
HinfI
PpiIe_tableGenetic codesChoice00123456910111213141516212223("", " -table=" + str(value))[value is not None and value!=vdef]11e_featuresectionFeature display optionse_sourcematchSource of feature to displayString("", " -sourcematch=" + str(value))[value is not None]12By default any feature source in the feature table is shown. You can set this to match any feature source you wish to show.
The source name is usually either the name of the program that detected the feature or it is the feature table (eg: EMBL) that the feature came from.
The source may be wildcarded by using '*'.
If you wish to show more than one source, separate their names with the character '|', eg:
gene* | emble_typematchType of feature to displayString("", " -typematch=" + str(value))[value is not None]13By default any feature type in the feature table is shown. You can set this to match any feature type you wish to show.
See http://www.ebi.ac.uk/embl/WebFeat/ for a list of the EMBL feature types and see Appendix A of the Swissprot user manual in http://www.expasy.org/sprot/userman.html for a list of the Swissprot feature types.
The type may be wildcarded by using '*'.
If you wish to show more than one type, separate their names with the character '|', eg:
*UTR | introne_sensematchSense of feature to display (value from -1 to 1)Integer0("", " -sensematch=" + str(value))[value is not None and value!=vdef]Value greater than or equal to -1 is requiredvalue >= -1Value less than or equal to 1 is requiredvalue <= 114By default any feature type in the feature table is shown. You can set this to match any feature sense you wish to show. 0 - any sense, 1 - forward sense, -1 - reverse sensee_minscoreMinimum score of feature to displayFloat0.0("", " -minscore=" + str(value))[value is not None and value!=vdef]15Minimum score of feature to display (see also maxscore)e_maxscoreMaximum score of feature to displayFloat0.0("", " -maxscore=" + str(value))[value is not None and value!=vdef]16Maximum score of feature to display.
If both minscore and maxscore are zero (the default), then any score is ignorede_tagmatchTag of feature to displayString("", " -tagmatch=" + str(value))[value is not None]17Tags are the types of extra values that a feature may have. For example in the EMBL feature table, a 'CDS' type of feature may have the tags '/codon', '/codon_start', '/db_xref', '/EC_number', '/evidence', '/exception', '/function', '/gene', '/label', '/map', '/note', '/number', '/partial', '/product', '/protein_id', '/pseudo', '/standard_name', '/translation', '/transl_except', '/transl_table', or '/usedin'. Some of these tags also have values, for example '/gene' can have the value of the gene name.
By default any feature tag in the feature table is shown. You can set this to match any feature tag you wish to show.
The tag may be wildcarded by using '*'.
If you wish to show more than one tag, separate their names with the character '|', eg:
gene | labele_valuematchValue of feature tags to displayString("", " -valuematch=" + str(value))[value is not None]18Tag values are the values associated with a feature tag. Tags are the types of extra values that a feature may have. For example in the EMBL feature table, a 'CDS' type of feature may have the tags '/codon', '/codon_start', '/db_xref', '/EC_number', '/evidence', '/exception', '/function', '/gene', '/label', '/map', '/note', '/number', '/partial', '/product', '/protein_id', '/pseudo', '/standard_name', '/translation', '/transl_except', '/transl_table', or '/usedin'. Only some of these tags can have values, for example '/gene' can have the value of the gene name. By default any feature tag value in the feature table is shown. You can set this to match any feature tag value you wish to show.
The tag value may be wildcarded by using '*'.
If you wish to show more than one tag value, separate their names with the character '|', eg:
pax* | 10e_stricttagsOnly display the matching tagsBoolean0("", " -stricttags")[ bool(value) ]19By default if any tag/value pair in a feature matches the specified tag and value, then all the tags/value pairs of that feature will be displayed. If this is set to be true, then only those tag/value pairs in a feature that match the specified tag and value will be displayed.e_advancedAdvanced sectione_remapsectionRestriction map optionse_flatreformatDisplay re sites in flat formatBoolean0("", " -flatreformat")[ bool(value) ]20This changes the output format to one where the recognition site is indicated by a row of '===' characters and the cut site is pointed to by a '>' character in the forward sense, or a '<' in the reverse sense strand.e_mincutsMinimum cuts per re (value from 1 to 1000)Integer1("", " -mincuts=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1 is requiredvalue >= 1Value less than or equal to 1000 is requiredvalue <= 100021This sets the minimum number of cuts for any restriction enzyme that will be considered. Any enzymes that cut fewer times than this will be ignored.e_maxcutsMaximum cuts per reInteger2000000000("", " -maxcuts=" + str(value))[value is not None and value!=vdef]22This sets the maximum number of cuts for any restriction enzyme that will be considered. Any enzymes that cut more times than this will be ignored.e_sitelenMinimum recognition site length (value from 2 to 20)Integer4("", " -sitelen=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 2 is requiredvalue >= 2Value less than or equal to 20 is requiredvalue <= 2023This sets the minimum length of the restriction enzyme recognition site. Any enzymes with sites shorter than this will be ignored.e_singleForce single re site only cutsBoolean0("", " -single")[ bool(value) ]24If this is set then this forces the values of the mincuts and maxcuts qualifiers to both be 1. Any other value you may have set them to will be ignored.e_bluntAllow blunt end re cuttersBoolean1(" -noblunt", "")[ bool(value) ]25This allows those enzymes which cut at the same position on the forward and reverse strands to be considered.e_stickyAllow sticky end re cuttersBoolean1(" -nosticky", "")[ bool(value) ]26This allows those enzymes which cut at different positions on the forward and reverse strands, leaving an overhang, to be considered.e_ambiguityAllow ambiguous re matchesBoolean1(" -noambiguity", "")[ bool(value) ]27This allows those enzymes which have one or more 'N' ambiguity codes in their pattern to be considerede_plasmidAllow circular dnaBoolean0("", " -plasmid")[ bool(value) ]28If this is set then this allows searches for restriction enzyme recognition site and cut positions that span the end of the sequence to be considered.e_methylationUse methylation dataBoolean0("", " -methylation")[ bool(value) ]29If this is set then RE recognition sites will not match methylated bases.e_commercialOnly use restriction enzymes with suppliersBoolean1(" -nocommercial", "")[ bool(value) ]30If this is set, then only those enzymes with a commercial supplier will be searched for. This qualifier is ignored if you have specified an explicit list of enzymes to search for, rather than searching through 'all' the enzymes in the REBASE database. It is assumed that, if you are asking for an explicit enzyme, then you probably know where to get it from and so all enzymes names that you have asked to be searched for, and which cut, will be reported whether or not they have a commercial supplier.e_limitLimits re hits to one isoschizomerBoolean1(" -nolimit", "")[ bool(value) ]31This limits the reporting of enzymes to just one enzyme from each group of isoschizomers. The enzyme chosen to represent an isoschizomer group is the prototype indicated in the data file 'embossre.equ', which is created by the program 'rebaseextract'. If you prefer different prototypes to be used, make a copy of embossre.equ in your home directory and edit it. If this value is set to be false then all of the input enzymes will be reported. You might like to set this to false if you are supplying an explicit set of enzymes rather than searching 'all' of them.e_orfminsizeMinimum size of orfs (value greater than or equal to 0)Integer0("", " -orfminsize=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 032This sets the minimum size of Open Reading Frames (ORFs) to display in the translations. All other translation regions are masked by changing the amino acids to '-' characters.e_threeletterDisplay protein sequences in three-letter codeBoolean0("", " -threeletter")[ bool(value) ]33e_numberNumber the sequencesBoolean0("", " -number")[ bool(value) ]34e_widthWidth of sequence to display (value greater than or equal to 1)Integer60("", " -width=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1 is requiredvalue >= 135e_lengthLine length of page (0 for indefinite) (value greater than or equal to 0)Integer0("", " -length=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 036e_marginMargin around sequence for numbering (value greater than or equal to 0)Integer10("", " -margin=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 037e_nameShow sequence idBoolean1(" -noname", "")[ bool(value) ]38Set this to be false if you do not wish to display the ID name of the sequencee_descriptionShow descriptionBoolean1(" -nodescription", "")[ bool(value) ]39Set this to be false if you do not wish to display the description of the sequencee_offsetOffset to start numbering the sequence fromInteger1("", " -offset=" + str(value))[value is not None and value!=vdef]40e_htmlUse html formattingBoolean0("", " -html")[ bool(value) ]41e_outputOutput sectione_outfileName of the output file (e_outfile)Filenameshowseq.e_outfile("" , " -outfile=" + str(value))[value is not None]42e_outfile_outoutfile_out optionShowseqReportReporte_outfileautoTurn off any promptingString" -auto -stdout"43
Programs-5.1.2/wise2.xml 0000644 0001750 0000156 00000104021 11767572177 013750 0 ustar bneron sis
wise22.2.0WISE2Comparisons of protein/DNA sequencesE. Birneyhttp://www.ebi.ac.uk/Tools/Wise2/doc_wise2.htmlftp://ftp.ebi.ac.uk/pub/software/unix/wise2/alignment:pairwisewise2Wise programChoiceNullNullgenewiseestwise"$value"str(value)1protein_fileProtein fileproteinProtein sequence FileProteinSequenceFASTAnot $hmmerhmmer is None" $value"" " +str(value)You must give a protein sequence file in fasta formatnot ($hmmer)not (hmmer)2You must give a protein sequence file in fasta format.hmmeror Protein HMM FileProteinHmmTextProfileAbstractText$protein is not defined protein is None" $value"" " + str(value)You must give an HMMER filenot ($protein)not (protein)2You must give an HMMER file.hmmer_commandHMM command (-hmmer)Stringdefined $hmmerhmmer is not None" -hmmer"" -hmmer"4dnaDNA sequence FileDNASequenceFASTA" $value" " " + str(value) 3quietSilent mode (-silent -quiet)String" -silent -quiet" " -silent -quiet" 100dna_optionsDNA sequence Options5dna_startStart position in dna (-u)Integer(defined $value) ? " -u $value" : ""( "" , " -u " + str(value) )[ value is not None ]dna_endEnd position in dna (-v)Integer(defined $value) ? " -v $value" : ""( "" , " -v " + str(value) )[ value is not None ]strandStrand comparisonChoice-tfor-tfor-trev-both(defined $value and $value ne $vdef) ? " $value" : ""( "" , " " + str(value) )[ value is not None and value != vdef]tabsReport positions as absolute to truncated/reverse sequence (-tabs)Boolean0($value) ? " -tabs" : ""( "" , " -tabs" )[ value ]protein_optionsProtein comparison Optionsnot $hmmerhmmer is None6protein_startStart position in protein (-s)Integer(defined $value) ? " -s $value" : ""( "" , " -s " + str(value) )[ value is not None ]protein_endEnd position in protein (-t)Integer(defined $value) ? " -t $value" : ""( "" , " -t " + str(value) )[ value is not None ]gapGap penalty (-g)Integer12(defined $value and $value != $vdef) ? " -g $value" : ""( "" , " -g " + str(value) )[ value is not None and value != vdef]extGap extension penalty (-e)Integer2(defined $value and $value != $vdef) ? " -e $value" : ""( "" , " -e " + str(value) )[ value is not None and value != vdef]matrixComparison matrix (-m)ChoiceBLOSUM62.blaBLOSUM30.blaBLOSUM45.blaBLOSUM62.blaBLOSUM80.blagon120.blagon160.blagon200.blagon250.blagon350.blaidenity.bla(defined $value and $value ne $vdef) ? " -m $value" : ""( "" , " -m" + str(value) )[ value is not None and value != vdef]gene_model_optionsModel Options7initType of match (-init)Choicedefaultdefaultgloballocalwingendbias(defined $value and $value ne $vdef) ? " -init $value" : ""( "" , " -init " + str(value) )[ value is not None and value != vdef]subsSubstitution error rate (-subs)Float1e-5(defined $value and $value != $vdef) ? " -subs $value" : ""( "" , " -subs " + str(value) )[ value is not None and value != vdef]indelInsertion/deletion error rate (-indel)Float1e-5(defined $value and $value != $vdef) ? " -indel $value" : ""( "" , " -indel " + str(value) )[ value is not None and value != vdef]nullRandom Model as synchronous or flat (-null)Choicesynsynflat(defined $value and $value ne $vdef) ? " -null $value" : ""( "" , " -null " + str(value) )[ value is not None and value != vdef]allnProbability of matching a NNN codon (-alln)Float1.0(defined $value and $value != $vdef) ? " -alln $value" : ""( "" , " -alln " + str(value) )[ value is not None and value != vdef]wise2_model_optGenewise special option$wise2 eq "genewise"wise2 == "genewise"geneGene parameter file (-gene)Choicehuman.gfhuman.gfpb.gfpombe.gfworm.gf(defined $value and $value ne $vdef) ? " -gene $value" : ""( "" , " -gene " + str(value) )[ value is not None and value != vdef]cfreqUsing codon bias or not (-cfreq)?Choiceflatmodelflat(defined $value and $value ne $vdef) ? " -cfreq $value" : ""( "" , " -cfreq " + str(value) )[ value is not None and value != vdef]spliceUsing splice model or GT/AG (-splice)?Choicemodelmodelflat(defined $value and $value ne $vdef) ? " -splice $value" : ""( "" , " -splice " + str(value) )[ value is not None and value != vdef]intronUse tied model for introns (-intron)Choicetiedmodeltied(defined $value and $value ne $vdef) ? " -intron $value" : ""( "" , " -intron " + str(value) )[ value is not None and value != vdef]insertProtein insert model (-insert)Choiceflatmodelflat(defined $value and $value ne $vdef) ? " -insert $value" : ""( "" , " -insert " + str(value) )[ value is not None and value != vdef]output_optionsOutput Options9prettyShow pretty ascii output (-pretty)Boolean1($value) ? " -pretty" : ""( "" , " -pretty" )[ value ]paraShow parameters (-para)Boolean1($value) ? " -para" : ""( "" , " -para" )[ value ]sumShow summary output (-sum)Boolean0($value) ? " -sum" : ""( "" , " -sum" )[ value ]pepShow protein translation, splicing frameshifts (-pep)Boolean0($value) ? " -pep" : ""( "" , " -pep" )[ value ]albShow logical AlnBlock alignment (-alb)Boolean0($value) ? " -alb" : ""( "" , " -alb" )[ value ]palShow raw matrix alignment (-pal)Boolean0($value) ? " -pal" : ""( "" , " -pal" )[ value ]blockLength of main block in pretty output (-block)Integer50(defined $value and $value != $vdef) ? " -block $value" : ""( "" , " -block " + str(value) )[ value is not None and value != vdef]divideDivide string for multiple outputs (-divide)String(defined $value) ? " -divide \"$value\"" : ""( "" , " -divide " + str(value) )[ value is not None ]wise2_out_optGenewise special option$wise2 eq "genewise"wise2 == "genewise"pseudoMark genes with frameshifts as pseudogenes (-pseudo)Boolean0($value) ? " -pseudo" : ""( "" , " -pseudo" )[ value ]genesShow gene structure (-genes)Boolean0($value) ? " -genes" : ""( "" , " -genes" )[ value ]genesfShow gene structure with supporting evidence (-genesf)Boolean0($value) ? " -genesf" : ""( "" , " -genesf" )[ value ]emblShow EMBL feature format with CDS key (-embl)Boolean0($value) ? " -embl" : ""( "" , " -embl" )[ value ]dianaShow EMBL feature format with misc_feature key for diana (-diana)Boolean0($value) ? " -diana" : ""( "" , " -diana" )[ value ]cdnaShow cDNA (-cdna)Boolean0($value) ? " -cdna" : ""( "" , " -cdna" )[ value ]transShow protein translation, breaking at frameshifts (-trans)Boolean0($value) ? " -trans" : ""( "" , " -trans" )[ value ]aceAce file gene structure (-ace)Boolean0($value) ? " -ace" : ""( "" , " -ace" )[ value ]gffGene Feature Format file (-gff)Boolean0($value) ? " -gff" : ""( "" , " -gff" )[ value ]generRaw gene structure (-gener)Boolean0($value) ? " -gener" : ""( "" , " -gener" )[ value ]New_gene_optionsNew gene model statistics for genewise$wise2 eq "genewise"wise2 == "genewise"10splice_max_collarMaximum Bits value for a splice site (-splice_max_collar)Float5.0(defined $value and $value != $vdef) ? " -splice_max_collar $value" : ""( "" , " -splice_max_collar " + str(value) )[ value is not None and value != vdef]splice_min_collarMinimum Bits value for a splice site (-splice_min_collar)Float-5.0(defined $value and $value != $vdef) ? " -splice_min_collar $value" : ""( "" , " -splice_min_collar " + str(value) )[ value is not None and value != vdef]splice_score_offsetScore offset for splice sites (-splice_score_offset)Float4.5(defined $value and $value != $vdef) ? " -splice_score_offset $value" : ""( "" , " -splice_score_offset " + str(value) )[ value is not None and value != vdef]standard_optionsStandard Options11erroroffstdNo warning messages (-erroroffstd)Boolean0($value) ? " -erroroffstd" : ""( "" , " -erroroffstd" )[ value ]
Programs-5.1.2/mobyle_xml_types.txt 0000644 0001750 0000156 00000016543 11135113762 016320 0 ustar bneron sis python class | xml class | description
AbstractText | AceAssembly | file in ace format (cap3)
AbstractText | AncestorsFile | file to specify ancestors state in phylip (mix)
AbstractText | BambeTree |
AbstractText | BananaOutput |
AbstractText | Blast2taxonomyHtmlOutput |
AbstractText | BlastHtmlOutput |
AbstractText | BlastTextOutput |
AbstractText | BlastXmlOutput |
AbstractText | BoxshadeHtmlOutput |
AbstractText | BoxshadeRtfOutput |
AbstractText | BoxshadeXfigOutput |
AbstractText | BtwistedOutput |
AbstractText | CaiOutput |
AbstractText | ChargeOutput |
AbstractText | ChecktransOutput |
AbstractText | ChipsOutput |
AbstractText | ClippingParametersFile |
AbstractText | CodcmpOutput |
AbstractText | CoderetOutput |
AbstractText | CodonWAnalysisFile |
AbstractText | CofoldSequence |
AbstractText | ComAlignAlignment |
AbstractText | CompseqOutput |
AbstractText | ConsensusAlphabet |
AbstractText | CostsFile |
AbstractText | CpgplotOutput |
AbstractText | CpgreportOutput |
AbstractText | CuspOutput |
AbstractText | DCAlignment |
AbstractText | DsspOutput |
AbstractText | ELPOutput |
AbstractText | EmbossDistanceMatrix |
AbstractText | EmowseOutput |
AbstractText | EnergyParameterFile |
AbstractText | EntryFullText |
AbstractText | EnzymeData |
AbstractText | EpestfindOutput |
AbstractText | EpsilonFile |
AbstractText | Est2genomeOutput |
AbstractText | FastaHtmlOutput |
AbstractText | FastaTextOutput |
AbstractText | Feature |
AbstractText | FindkmOutput |
AbstractText | ForceAlignFile |
AbstractText | FreakOutput |
AbstractText | GeeceeOutput |
AbstractText | GffCustomFile |
AbstractText | GruppiOutput |
AbstractText | HmmDirichletPrior |
AbstractText | HmmNullModelFile |
AbstractText | HmmPAM |
AbstractText | HmmTextProfile |
AbstractText | HmomentOutput |
AbstractText | IepOutput |
AbstractText | InfoalignOutput |
AbstractText | InvertedOutput |
AbstractText | IsochoreOutput |
AbstractText | LindnaMappingCommands |
AbstractText | MFoldFoldingConstraints |
AbstractText | MegamergerOutput |
AbstractText | MfoldDetailHtmlOutput |
AbstractText | MfoldHtmlOutput |
AbstractText | MfoldRnamlOutput |
AbstractText | MixtureFile |
AbstractText | Molwt |
AbstractText | MrepsXmlOutput |
AbstractText | MuscleAlignment |
AbstractText | MuscleHtmlOutput |
AbstractText | NewcpgreportOutput |
AbstractText | NewcpgseekOutput |
AbstractText | OddcompOutput |
AbstractText | PalindromeOutput |
AbstractText | Pattern |
AbstractText | Pdb |
AbstractText | PepcoilOutput |
AbstractText | PepinfoOutput |
AbstractText | PepstatsOutput |
AbstractText | PeptideMolweights |
AbstractText | PhylipCategoriesRates |
AbstractText | PhylipDiscreteCharMatrix |
AbstractText | PhylipDistanceMatrix |
AbstractText | PhylipWeight |
AbstractText | PrettyseqOutput |
AbstractText | Primer3Mishybridizing |
AbstractText | Primer3Mispriming |
AbstractText | Primer3Output |
AbstractText | PrimerPairs |
AbstractText | PrimoOligo |
AbstractText | PrimoRegion |
AbstractText | PrimoRepeats |
AbstractText | ProfileOrMatrix |
AbstractText | ProfitOutput |
AbstractText | ProphecyOutput |
AbstractText | ProsePattern |
AbstractText | PrositePattern |
AbstractText | PrositeProfile |
AbstractText | ProteinCodes |
AbstractText | PsortHtmlOutput |
AbstractText | QuicktandemOutput |
AbstractText | RNAStructure |
AbstractText | RedataOutput |
AbstractText | RemapOutput |
AbstractText | RestoverOutput |
AbstractText | RestrictionFile |
AbstractText | RnaFoldSequence |
AbstractText | RnadistanceAlignment |
AbstractText | ScanPattern |
AbstractText | SequenceOverlap |
AbstractText | ShowalignOutput |
AbstractText | ShowfeatOutput |
AbstractText | ShoworfOutput |
AbstractText | ShowseqOutput |
AbstractText | SigPattern |
AbstractText | SixpackOutput |
AbstractText | StrideOutputFile |
AbstractText | StssearchOutput |
AbstractText | SupermatcherError |
AbstractText | SycoOutput |
AbstractText | SymbolFile |
AbstractText | TRnaScanFirstPassResult |
AbstractText | TacgHtmlOutput |
AbstractText | TacgTextOutput |
AbstractText | TandemOutput |
AbstractText | TaxonomyOutput |
AbstractText | TextsearchOutput |
AbstractText | TfscanOutput |
AbstractText | TipdateAlignment |
AbstractText | ToppredHtmlOutput |
AbstractText | USA_list |
AbstractText | Vector |
AbstractText | VectorstripOutput |
AbstractText | WobbleOutput |
AbstractText | WordcountOutput |
AbstractText | WordfinderError |
AbstractText | XpoundOutput |
AbstractText | XreportOutput |
Binary | AbiTraceFile |
Binary | HmmBinProfile |
Binary | PDF |
Binary | Picture |
Binary | PostScript |
Text | 3DStructure |
//
Programs-5.1.2/profit.xml 0000644 0001750 0000156 00000010474 12072525233 014207 0 ustar bneron sis
profitEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netprofitScan one or more sequences with a simple frequency matrixhttp://bioweb2.pasteur.fr/docs/EMBOSS/profit.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:profilessequence:protein:profilesprofite_inputInput sectione_infileProfile or weight matrix fileProfileOrMatrixAbstractText("", " -infile=" + str(value))[value is not None]1e_sequencesequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]2e_outputOutput sectione_outfileName of the output file (e_outfile)Filenameprofit.e_outfile("" , " -outfile=" + str(value))[value is not None]3e_outfile_outoutfile_out optionProfitReportReporte_outfileautoTurn off any promptingString" -auto -stdout"4
Programs-5.1.2/cif.xml 0000644 0001750 0000156 00000037632 11767572177 013475 0 ustar bneron sis
cif0.2.2CIFCut DNA regions in frameE. Quevillon, B. Boedahttp://bioweb2.pasteur.fr/docs/cif/cif.htmlhttp://bioweb2.pasteur.fr/docs/cif/compatible_cohesive_ends.txthttp://bioweb2.pasteur.fr/docs/cif/paillasse_liste.txtcif (for Cut In Frame) is a tool that works with DNA sequences. It is used to digest
your sequences with a pool of restriction enzymes and to search which enzymes cut your
sequence keeping your reading frame after ligation without any frame shift produced
due to the digestion.This helps users to work with a gene of interest to localize potential region(s) that
could be removed from the final protein to check if regions have an impact or not on
the final gene product. It can also allow to identify vital region(s) for the gene.sequence:enzyme:analysissequence:nucleic:restrictiondisplay:nucleic:restrictioncifinputInput sectionsequenceSequenceDNASequenceFASTA1,n" -i $value "( "" , " -i " + str( value ) + " " )[value is not None]1optionsOptions2enztypeType of enzymes (-T)Choice00bluntcohesiveklenow($value) ?"-T $value ": ""( "" , " -T " + str( value ) )[value != vdef]You can choose between:- blunt : Use blunt cutters- cohesive : Use cohesive cutters- klenow : Use only 5' enzymes for Klenow fill-in.[Default all three]strandCohesive enzyme strand (-S)Choice0053($value) ?"-S $value ": ""( "" , " -S " + str( value ) )[value != vdef]Cohesive enzyme strand to use: - 5' - 3' - both : 5' and 3' (Default value).(No effect if you choose to use blunt enzymes)digestionmod Digestion mode (-D)Choice00doublesimple($value) ?"-D $value ": ""( "" , " -D " + str( value ) )[value != vdef]Digestion mode:- double: Report couple of enzymes that digest sequence- simple: Report enzyme name that cut more than one time- both: simple + double (Default value).LengthMinimum length of recognition site (-L)Integer6($value) ?"-L $value ": ""( "" , " -L " + str( value ) )[value is not None and value!=vdef]Use enzymes with minimum length for DNA recognition site.By default, 6.variantUse enzymes with variant recognition site (-V)Boolean0($value) ?"-V": ""( "" , " -V " )[value]Some cohesive enzymes have variant recognition site like 'GDGCH^C' for Bsp1286I, where: D = not C (A or G or T) H = not G (A or C or T)This option, when set, use of these type of enzymes..By default, this option is off.exoticReport digestions in frame without ends compatibilities (-X)Boolean0$variant == 1variant == 1($value) ?"-X": ""( "" , " -X " )[value]Some cohesive enzymes have variant recognition site like 'GDGCH^C' for Bsp1286I, where: D = not C (A or G or T) H = not G (A or C or T)Thus, using thoses enzymes may produce a cut in frame but the produced ends may not be
compatible together regarding DNA sequence.Requires the '[-V | --variant]' option to work. By default, this option is off.compatList of compatible cohesive ends (-C)Text($value) ?"-C $value ": ""( "" , " -C " + str( value ) )[value is not None]File with list of compatible cohesive ends.The default list used is given in the program help pages (compatible_cohesive_ends.txt).If you want to give your own list, the format must be as follow:"Enzyme_name:compatEnz1_name,compatEnz2_name,..."
Acc65I:BanI,BsiWI,BsrGI
AccI:AciI,AclI,BsaHI,HinP1I,HpaII,NarI,ClaI,BstBI,TaqI
enzlistEnzyme list to work with (-E)Text($value) ?"-E $value ": ""( "" , " -E " + str( value ) )[value is not None]By default, the program works with a list of enzymes commonly used in laboratory
given in the program help pages (paillasse_liste.txt).If you want to give your own list, the format is one enzyme per line.
AatII
Acc651
...
outputoptOutput parameters3stopShow stop codon (-P)Boolean0$enztype == 0 or $enztype eq 'blunt'enztype == 0 or enztype == 'blunt'($value) ?"-P": ""( "" , " -P " )[value]Sometimes, blunt digestion, after linkage, can produce new codon around the cutting site
that leads to stop codon.This option displays such digestions with a tag 'stopCodon' in the output line results.NOTE: This option only works if 'blunt' type is set. By default this option is off, thus
if such case happened no results are reported for enzymes digestion.cut_posShow cut positions (-N)Boolean0($value) ?"-N": ""( "" , " -N " )[value] Enzymes may cut your sequence more than once.This option
reports the number of time enzyme(s) cut your sequence. [Default off]mod_aaShow new generated amino acid (-A)Boolean0$enztype ne 'cohesive'enztype != 'cohesive'($value) ?"-A": ""( "" , " -A " )[value]Experimental option. [Default off]This option allows, for 'blunt' or 'klenow' analysis to
show, in such case, the Amino acid that have been changed due to the ligation between
the 2 parts of the DNA after the digestion.It will be shown as OldAA>NewAA (e.g.: G>N).outputstyleOutput style format (-F)Choicetexttextgffimage($value) ?"-F $value ": ""( "" , " -F " + str( value ) )[value != vdef]Choose the output type that you prefer.- Text output (Default value)- GFF3 output- Image (png): Creates image (png).imgOutput imagePictureBinary1,n$outputstyle eq 'image'outputstyle == 'image'*.png"*.png"
Programs-5.1.2/stretcher.xml 0000644 0001750 0000156 00000044257 12072525233 014715 0 ustar bneron sis
stretcherEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netstretcherNeedleman-Wunsch rapid global alignment of two sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/stretcher.htmlhttp://emboss.sourceforge.net/docs/themesalignment:pairwise:globalstretchere_inputInput sectione_asequenceasequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -asequence=" + str(value))[value is not None]1e_bsequencebsequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -bsequence=" + str(value))[value is not None]2e_datafileMatrix fileChoicemobyle_nullmobyle_nullEBLOSUM30EBLOSUM35EBLOSUM40EBLOSUM45EBLOSUM50EBLOSUM55EBLOSUM60EBLOSUM62EBLOSUM62-12EBLOSUM65EBLOSUM70EBLOSUM75EBLOSUM80EBLOSUM85EBLOSUM90EBLOSUMNEDNAFULLEDNAMATEDNASIMPLEEPAM10EPAM100EPAM110EPAM120EPAM130EPAM140EPAM150EPAM160EPAM170EPAM180EPAM190EPAM20EPAM200EPAM210EPAM220EPAM230EPAM240EPAM250EPAM260EPAM270EPAM280EPAM290EPAM30EPAM300EPAM310EPAM320EPAM330EPAM340EPAM350EPAM360EPAM370EPAM380EPAM390EPAM40EPAM400EPAM410EPAM420EPAM430EPAM440EPAM450EPAM460EPAM470EPAM480EPAM490EPAM50EPAM500EPAM60EPAM70EPAM80EPAM90SSSUB("", " -datafile=" + str(value))[value is not None and value!=vdef]3This is the scoring matrix file used when comparing sequences. By default it is the file 'EBLOSUM62' (for proteins) or the file 'EDNAFULL' (for nucleic sequences). These files are found in the 'data' directory of the EMBOSS installation.e_additionalAdditional sectione_gapopenGap penalty (Positive integer)Integer("", " -gapopen=" + str(value))[value is not None]Value greater than or equal to 0 is requiredvalue >= 0412 for protein, 16 for nucleice_gapextendGap length penalty (Positive integer)Integer("", " -gapextend=" + str(value))[value is not None]Value greater than or equal to 0 is requiredvalue >= 052 for protein, 4 for nucleice_outputOutput sectione_outfileName of the output alignment fileFilenamestretcher.align("" , " -outfile=" + str(value))[value is not None]6e_aformat_outfileChoose the alignment output formatChoiceMARKX0FASTAMSFPAIRMARKX0MARKX1MARKX2MARKX3MARKX10SRSSRSPAIRSCOREUNKNOWNMULTIPLESIMPLEMATCH("", " -aformat=" + str(value))[value is not None and value!=vdef]6e_outfile_outoutfile_out optionAlignmente_aformat_outfile in ['FASTA', 'MSF']e_outfilee_outfile_out2outfile_out2 optionTexte_aformat_outfile in ['PAIR', 'MARKX0', 'MARKX1', 'MARKX2', 'MARKX3', 'MARKX10', 'SRS', 'SRSPAIR', 'SCORE', 'UNKNOWN', 'MULTIPLE', 'SIMPLE', 'MATCH']e_outfileautoTurn off any promptingString" -auto -stdout"7
Programs-5.1.2/psort.xml 0000644 0001750 0000156 00000004757 12265214442 014063 0 ustar bneron sis
psort981201PSORTPredicts protein subcellular localization sites from their amino acid sequenceNakai, K. and Horton, P.A program for detecting the sorting signals of
proteins and predicting their subcellular localization, trends
Biochem. Sci., in press, 1999.http://bioweb2.pasteur.fr/docs/psort/index.htmlhttp://psort.hgc.jp/sequence:protein:compositionpsortseqfileProtein sequence fileProteinSequenceFASTA" $value"" "+str(value)2VerboseVerbose mode (-w)Boolean1($value) ? " -w" : ""("", " -w")[ value ]1htmlfileHtml output filePsortHtmlReportReport" >psort.html"" >psort.html"100"psort.html""psort.html"
Programs-5.1.2/kronaextract.xml 0000644 0001750 0000156 00000007712 12306354570 015416 0 ustar bneron sis kronaextract1.1kronaextractkronaextract extract, from xml file obtained by rankoptimizer program, list of reads and blast offset for a given taxonomic name.C. Maufraishttp://sourceforge.net/p/krona/home/krona/Ondov BD, Bergman NH, and Phillippy AM. Interactive metagenomic visualization in a Web browser. BMC Bioinformatics. 2011 Sep 30; 12(1):385.database:search:displaykronaextractinputFileRankoptimizer xml output fileKronaXMLReportReportXML file with krona Specification.' -i ' + str(value)10OptionsrankoptimizerOptionstaxoNameTaxonomic name needed to extract list of informations (-n).String('', ' -n ' + str(value) )[value is not None]20splitOutSplit output file in two files with the given prefix name.Boolean0The output file is split in two files: one contain reads names and the other contain corresponding taxoptimizer's line offset.('', ' -s rankoptimizer' )[value]30offsettfileOffset numbersOffsetReportReportsplitOut"*.offset"queryoutfileQueries nameQueryNameReportReportsplitOut"*.seq"OutputFileOutput fileReport"kronaextract.out"not splitOut90' -o kronaextract.out'
Programs-5.1.2/blast2.xml 0000644 0001750 0000156 00000127342 12270435360 014077 0 ustar bneron sis
blast2BLAST2NCBI BLAST, with gapsAltschul, Stephen F., Thomas L. Madden, Alejandro A. Schaeffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402.http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=handbook.chapter.ch16http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/tut1.htmldatabase:search:homologyblast_initBlast initiationString"blastall""blastall"1blast2Blast program (-p)Choicenullnullblastnblastxtblastxblastptblastn" -p $value"" -p "+ str(value)2- Blastp compares an amino acid query sequence against a protein sequence database;- Blastn compares a nucleotide query sequence against a nucleotide sequence database;- Blastx compares the six-frame conceptual translation products of a nucleotide query sequence (both strands) against a protein sequence database;- tBlastn compares a protein query sequence against a nucleotide sequence database dynamically translated in all six reading frames (both strands).- tBlastx compares the six-frame translations of a nucleotide query sequence against the six-frame translations of a nucleotide sequence database.- psitBlastn compares a protein query sequence against a nucleotide sequence database dynamically translated in all six reading frames (both strands) using a position specific matrix created by PSI-BLAST.dbDatabase3protein_dbProtein db (-d)Choice$blast2 =~ /^blast[px]$/blast2 in [ "blastx" , "blastp" ]null" -d $value"" -d "+ str(value)Choose a protein db for blastp or blastx.Please note that Swissprot usage by and for commercial entities requires a license agreement.nucleotid_dbNucleotid db (-d)Choice$blast2 =~ /^(blastn|tblast[nx]|psitblastn)$/blast2 in [ "blastn" , "tblastx", "tblastn" , "psitblastn" ]null" -d $value"" -d "+ str(value)Choose a nucleotide db for blastn, tblastn or tblastxqueryQuery Sequence4query_seqQuery (-i)SequenceFASTA1,n" -i $query"" -i "+ str(query_seq)Read (first, query) sequence or set from filestart_regionStart of required region in query sequence (-L)IntegerLocation on query sequenceend_regionEnd of required region in query sequence (-L)Integerdefined $start_regionstart_region is not None(defined $value) ? " -L \"$start_region $value\"" : " -L \"$start_region\""(' -L "%s"' % (str(start_region)), ' -L "%s %s"' % (str(start_region), str(value)))[value is not None]Location on query sequenceconcatNumber of concatenated queries (blastn or tblastn) (-B)Integer$blast2 =~ /^t?blastn$/blast2 in [ "blastn" , "tblastn" ](defined $value) ? " -B $value" : ""("" , " -B "+str(value))[value is not None]scoring_optScoring options5open_a_gapCost to open a gap (-G)Float(defined $value) ? " -G $value" : ""("" , " -G "+str(value))[value is not None]-1 invokes default behavior: non-affine
if greedy, 5 if using dynamic programmingextend_a_gapCost to extend a gap (-E)Float(defined $value) ? " -E $value" : ""("" , " -E "+str(value))[value is not None]Default: 2 for blastn; 1 for blastp, blastx and tblastnLimited values for gap existence and extension are supported for these programs. Some supported and suggested values are:Existence Extension10 -- 110 -- 211 -- 18 -- 29 -- 2scoring_blastProtein penalty (not for blastn)$blast2 ne "blastn"blast2 != "blastn"matrixSimilarity matrix (-M)ChoiceBLOSUM62BLOSUM62BLOSUM45BLOSUM80PAM30PAM70(defined $value and $value ne $vdef) ? " -M $value" : ""("" , " -M "+str(value))[value is not None and value != vdef]scoring_blastnBlastn penalty$blast2 eq "blastn"blast2 == "blastn"mismatchPenalty for a nucleotide mismatch (-q)Float-3(defined $value and $value != $vdef) ? " -q $value" : ""("" , " -q "+str(value))[value is not None and value != vdef]matchReward for a nucleotide match (-r)Float1(defined $value and $value != $vdef) ? " -r $value" : ""("" , " -r "+str(value))[value is not None and value != vdef]frameshiftFrame shift penalty (-w)Float(defined $value) ? " -w $value" : ""("", " -w "+str(value))[value is not None]filter_optFiltering and masking options6BLAST 2.0 uses the dust low-complexity filter for blastn and seg for the other programs.If one uses '-F T' then normal filtering by seg or dust (for blastn) occurs (likewise '-F F' means no filtering whatsoever).filterFilter or Masking query sequence (DUST with blastn, SEG with others) (-F)Boolean1($value) ? "" : " -F F"(" -F F" , "")[value]other_filtersFiltering options (Filter must be true)Choice$filter and not $other_masking filter and not other_masking nullnull""""v1" -F C"" -F C"v2" -F \"C;S\""" -F \"C;S\""v3" -F D"" -F D"A coiled-coiled filter, based on the work of Lupas et al. (Science, vol 252, pp. 1162-4 (1991)) written by John Kuzio (Wilson et al., J Gen Virol, vol. 76, pp. 2923-32 (1995))other_maskingMasking options (Filter must be true)Choice$filter and not $other_filters filter and not other_filters nullnull""""v1" -F \"m S\""" -F \"m S\""v2" -F \"m D\""" -F \"m D\""v3" -F \"m C\""" -F \"m C\""v4" -F m"" -F m"For Lower-case masking the lower case filtering must be select.($value eq 'null' or $value eq 'v1' or $value eq 'v2' or $value eq 'v3']) or ($value eq 'v4' and $lower_case)value in ['null', 'v1', 'v2', 'v3'] or (value == 'v4' and lower_case)A coiled-coiled filter, based on the work of Lupas et al. (Science, vol 252, pp. 1162-4 (1991)) written by John Kuzio (Wilson et al., J Gen Virol, vol. 76, pp. 2923-32 (1995)).It is possible to specify that the masking should only be done during the process of building the initial words .If the -U option (to mask any lower-case sequence in the input FASTA file) is used and one does not wish any other filtering, but does wish to mask when building the lookup tables then one should specify: -F 'm'lower_caseUse lower case filtering (-U)Boolean0($value) ? " -U T" : ""("", " -U T")[value]This option specifies that any lower-case letters in the input FASTA file should be masked.selectivity_optSelectivity options7The programs blastn and blastp offer fully gapped alignments. blastx and tblastn have 'in-frame' gapped alignments and use sum statistics to link alignments from different frames. tblastx provides only ungapped alignments.ExpectExpected value (-e)Float10(defined $value and $value != $vdef) ? " -e $value" : ""("" , " -e "+str(value))[value is not None and value != vdef]
The statistical significance threshold for reporting matches against database sequences; the default value is 10, such that 10 matches are expected to be found merely by chance, according to the stochastic model of Karlin and Altschul (1990). If the statistical significance ascribed to a match is greater than the EXPECT threshold, the match will not be reported. Lower EXPECT thresholds are more stringent, leading to fewer chance matches being reported. Fractional values are
acceptable.
word_sizeWord Size (-W)Integer(defined $value) ? " -W $value" : ""("" , " -W "+str(value))[value is not None]Use words of size N.Zero invokes default behaviorDefault values:- 11 for blastn- 3 for othersdist_hitsMultiple Hits window size (-A)Integer(defined $value) ? " -A $value" : ""("" , " -A "+str(value))[value is not None]Generally defaults to 0 (for single-hit extensions), but defaults to 40 when using discontiguous
templates.extend_hitThreshold for extending hits (-f)Float(defined $value)? " -f $value" : ""("" , " -f " + str(value))[ value is not None ]Blast seeks first short word pairs whose aligned score reaches at least this value.Default values:- 0 for blastn- 11 for blastp- 12 for blastx- 13 for tblastn and tblastxdropoff_extentX dropoff value for gapped alignment (-X)Float(defined $value) ? " -X $value" : ""("" , " -X "+str(value))[value is not None]This is the value that control the path graph region explored by Blast during a gapped extension (Xg in the NAR paper) (default for blastp is 15).Default values:- 30 for blastn- 0 for tblastx- 15 for othersdropoff_extent_ungappedX dropoff value for ungapped extention (-y)Float(defined $value and $value != $vdef ) ? " -y $value" : ""("" , " -y "+str(value))[value is not None and value != vdef]- 0.0: default behavior:- 20 for blastn- 7 for othersdropoff_finalX dropoff value for final gapped alignment (-Z)Float(defined $value) ? " -Z $value" : ""("" , " -Z "+str(value))[value is not None]Default values:- 100 for blastn- 0 for tblastx- 25 for otherseff_lenEffective length of the search space (-Y)Integer(defined $value) ? " -Y $value" : ""("" , " -Y "+str(value))[value is not None]Use zero for the real sizekeep_hitsNumber of best hits from a region to keep (-K)Integer(defined $value) ? " -K $value" : ""("" , " -K "+str(value))[value is not None]If this option is used, a value of 100 is recommended.gapped_aligPerform or not gapped alignment (not available with tblastx) (-g)Boolean$blast2 ne "tblastx"blast2 != "tblastx"1($value) ? "" : " -g F "(" -g F " , "")[value]modeSingle-hit or multiple-hit mode (-P)Choice$blast2 ne "blastn"blast2 != "blastn"001($value ne "0") ? " -P $value" : ""("" , " -P "+str(value))[value != "0"]translation_optTranslation options8gc_queryGenetic code used for query translation (-Q)Choice$blast2 =~ /^t?blastx$/blast2 in [ "blastx" , "tblastx" ]11234569101112131415(defined $value and $value ne $vdef) ? " -Q $value" : ""("" , " -Q "+str(value))[value is not None and value != vdef]gc_dbGenetic code used for database translation (-D)Choice$blast2 =~ /^tblast[nx]$/blast2 in [ "tblastn", "tblastx" ]11234569101112131415($value ne $vdef) ? " -D $value" : ""("" , " -D "+str(value))[value != vdef]strandQuery strands to search against database (-S)Choice$blast2 =~ /^(blastn|t?blastx)$/blast2 in [ "blastn" ,"blastx" , "tblastx" ]3123(defined $value and $value ne $vdef) ? " -S $value" : ""("" , " -S "+str(value))[value is not None and value != vdef]affichageReport options9DescriptionsNumber of one-line descriptions to show (-v)Integer500(defined $value and $value != $vdef) ? " -v $value" : ""("" , " -v "+str(value))[value is not None and value != vdef]Maximum number of database sequences for which one-line descriptions will be reported.AlignmentsNumber of database sequences to show alignments (-b)Integer250(defined $value and $value != $vdef) ? " -b $value" : ""("" , " -b "+str(value))[value is not None and value != vdef]Maximum number of database sequences for which high-scoring segment pairs will be reported (-b).view_alignmentsAlignment view options (-m)Choice0012345678(defined $value and $value ne $vdef) ? " -m $value" : ""("" , " -m "+str(value))[value is not None and value != vdef]txtoutputText output fileString$view_alignments ne "7"view_alignments != "7"" -o blast2.txt"" -o blast2.txt"10xmloutputXml output fileString$view_alignments eq "7"view_alignments == "7"" -o blast2.xml"" -o blast2.xml"10htmloutputHtml outputBoolean$view_alignments !~ /^[78]$/view_alignments not in [ "7" , "8" ]1($value) ? " && html4blast -g -o blast2.html blast2.txt" : ""("" , " && html4blast -g -o blast2.html blast2.txt")[value]11txtfileBlast text reportBlastTextReportReport$view_alignments ne "7"view_alignments != "7""blast2.txt""blast2.txt"xmlfileBlast xml reportBlastXmlReportReport$view_alignments eq "7"view_alignments == "7""blast2.xml""blast2.xml"htmlfileBlast html reportBlastHtmlReportReport$view_alignments !~ /^[78]$/view_alignments not in [ "7" , "8" ]"blast2.html""blast2.html"imgfilePictureBinary$view_alignments !~ /^[78]$/view_alignments not in ["7", "8"]"*.png""*.gif""*.png""*.gif"
Programs-5.1.2/hmmconvert.xml 0000644 0001750 0000156 00000016107 11444656032 015071 0 ustar bneron sis
hmmconvertHMMCONVERTConvert profile HMM file to a HMMER formathmmconvert reads an HMM file from oldhmmfile in any HMMER format, and writes it to a new file newhmmfile in a new format. oldhmmfile and newhmmfile must be different files; you can't reliably overwrite the old file. By default, the new HMM file is written in HMMER 2 ASCII format.hmm:buildinghmmconvertoldhmmfileOld HMM ascii fileHmmProfileAbstractTextnot $oldhmmfile oldhmmfile is not None" $oldhmmfile"" " + str(oldhmmfile)Do not enter ascii and bin files at the same timenot defined $oldbinfileoldbinfile is None2oldbinfileOld HMM binary fileHmmProfileBinBinarynot $oldhmmfile oldbinfile is not None" $oldbinfile"" " + str(oldbinfile)Do not enter ascii and bin files at the same timenot defined $oldhmmfileoldhmmfile is None2advancedAdvanced options1new_formatNew formatChoice-a-a-b-2($value)? " $value":""" "+str(value)1outfmtChoose output legacy 3.x file formats by nameChoice$new_format ne '-2'new_format != '-2'3/b3/b3/a($value ne $vdef)? " --outfmt $value":""("", " --outfmt "+str(value))[value !=vdef]Output in a HMMER3 ASCII text format other then the most current one. Valid
choices for the value are '3/b' or '3/a'. The current format is '3/b', and this is the
default. There is a slightly different format '3/a' that was used in some alpha test
code.1result_fileHmm profileHmmProfileAbstractText$new_format eq '-a' or $new_format eq '-2'new_format == '-a' or new_format == '-2'(defined $oldhmmfile)? "> $oldhmmfile.convert": "> $oldbinfile.convert"("> " + str(oldbinfile) + ".convert", "> " + str(oldhmmfile) + ".convert")[oldhmmfile is not None]3(defined $oldhmmfile)? "$oldhmmfile.convert": "$oldbinfile.convert"(str(oldbinfile) + ".convert" , str(oldhmmfile) + ".convert")[oldhmmfile is not None]result_bin_fileHmm profile (binary)HmmProfileBinBinary$new_format eq '-b'new_format == '-b'(defined $oldhmmfile)? "> $oldhmmfile.bin": "> $oldbinfile.bin"("> " + str(oldbinfile) + ".bin", "> " + str(oldhmmfile) + ".bin")[oldhmmfile is not None]3(defined $oldhmmfile)? "$oldhmmfile.bin":"oldbinfile.bin"(str(oldbinfile) + ".bin" , str(oldhmmfile) + ".bin")[oldhmmfile is not None]
Programs-5.1.2/backtranambig.xml 0000644 0001750 0000156 00000021014 12072525233 015461 0 ustar bneron sis
backtranambigEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netbacktranambigBack-translate a protein sequence to ambiguous nucleotide sequencehttp://bioweb2.pasteur.fr/docs/EMBOSS/backtranambig.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:translationsequence:protein:compositionbacktranambige_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_additionalAdditional sectione_tableGenetic codesChoice00123456910111213141516212223("", " -table=" + str(value))[value is not None and value!=vdef]2e_outputOutput sectione_outfileName of the output file (e_outfile)DNAFilenamebacktranambig.e_outfile("" , " -outfile=" + str(value))[value is not None]3e_osformat_outfileChoose the sequence output formatDNAChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]4e_outfile_outoutfile_out optionDNASequencee_outfileautoTurn off any promptingString" -auto -stdout"5
Programs-5.1.2/hmmscan.xml 0000644 0001750 0000156 00000101415 11767572177 014351 0 ustar bneron sis
hmmscanHMMSCANSearch sequence(s) against pfam a profile HMM databasehmmscan reads sequence(s) from seqfile and compares it against all the HMMs in pfam
database looking for significantly similar sequence matches.
The output consists of three sections: a ranked list of the best scoring HMMs,
a list of the best scoring domains in order of their occurrence in the sequence,
and alignments for all the best scoring domains. A sequence score may be higher than a
domain score for the same sequence if there is more than one domain in the sequence;
the sequence score takes into account all the domains. All sequences scoring above the
-E and -T cutoffs are shown in the first list, then every domain found in this list is shown
in the second list of domain hits. If desired, E-value and score thresholds may also be applied
to the domain list using the --domE and --domT options.
hmm:database:searchdatabase:search:hmmhmmscanseqfileSequence fileSequenceFASTA" $value"" "+str(value)3HMMDBHMM databaseChoicePfam-A.hmmPfam-A.hmmPfam-B.hmm" $value"" "+str(value)2thresholds_reportOptions for reporting thresholds1E_value_cutoffE_value cutoff (-E)Floatnot defined $Bit_cutoff and $model_specific ne '--cut_ga' and $model_specific ne '--cut_nc'Bit_cutoff is None and model_specific != '--cut_ga' and model_specific != '--cut_nc'10.0(defined $value and $value != $vdef) ? " -E $value" : ""( "" , " -E " + str(value) )[ value is not None and value != vdef]1In the per-target output, report target profiles with an E-value of <= value. The
default is 10.0, meaning that on average, about 10 false positives will be reported
per query, so you can see the top of the 'noise' and decide for yourself if it's really
noise.Bit_cutoffBit score cutoff (-T)Float$E_value_cutoff == 10.0 and $model_specific ne '--cut_ga' and $model_specific ne '--cut_nc'E_value_cutoff == 10.0 and model_specific != '--cut_ga' and model_specific != '--cut_nc'(defined $value)? " -T $value" : ""( "" , " -T " + str(value) )[ value is not None ]1Instead of thresholding per-profile output on E-value, instead report target profiles
with a bit score of >= value.domEE-value cutoff for the per-domain ranked hit list (--domE)Floatnot defined $domT and $model_specific ne '--cut_ga' and $model_specific ne '--cut_nc'domT is None and model_specific != '--cut_ga' and model_specific != '--cut_nc'10.0(defined $value and $value != $vdef) ? " --domE $value" : ""( "" , " --domE " + str(value) )[ value is not None and value !=vdef ]In the per-domain output, for target profiles that have already satisfied the perprofile
reporting threshold, report individual domains with a conditional E-value
of <= value. The default is 10.0. A 'conditional' E-value means the expected
number of additional false positive domains in the smaller search space of those
comparisons that already satisfied the per-profile reporting threshold (and thus
must have at least one homologous domain already).domTBit score cutoff for the per-domain ranked hit list (--domT)Float$domE == 10.0 and $model_specific ne '--cut_ga' and $model_specific ne '--cut_nc'domE == 10.0 and model_specific != '--cut_ga' and model_specific != '--cut_nc'(defined $value) ? " --domT $value" : ""( "" , " --domT " + str(value) )[ value is not None ]Instead of thresholding per-domain output on E-value, instead report domains with
a bit score of >= value.thresholds_inclusionOptions controlling inclusion (significance) thresholds.1'Inclusion' thresholds are stricter than reporting thresholds. Inclusion thresholds
control which hits are considered to be reliable enough to be included in an output alignment or a
subsequent search round. In hmmscan, which does not have any alignment output nor any iterative
search steps, inclusion thresholds have little effect. They only affect what domains get
marked as significant ('!') or questionable ('?') in domain output.incEInclude sequences lower than this E-value threshold (--incE)Floatnot defined $incT and $model_specific ne '--cut_ga' incT is None and model_specific != '--cut_ga' 0.01(defined $value and value != vdef) ? " --incE $value" : ""( "" , " --incE " + str(value) )[ value is not None and value != vdef]Use an E-value of <= value as the per-target inclusion threshold. The default is
0.01, meaning that on average, about 1 false positive would be expected in every
100 searches with different query sequences.incdomEInclude domains lower than this E-value threshold (--incdomE)Floatdefined $incdomT and not defined model_specificincdomT is not None and model_specific is None0.01(defined $value and value != vdef) ? " --incdomE $value" : ""( "" , " --incdomE " + str(value) )[ value is not None and value != vdef]Use a conditional E-value of <= value as the per-domain inclusion threshold, in
targets that have already satisfied the overall per-target inclusion threshold. The
default is 0.01.incTInclude sequences upper than this score threshold (--incT)Float$incE == 0.01 and $model_specific ne '--cut_ga'incE == 0.01 and model_specific != '--cut_ga'(defined $value) ? " --incT $value" : ""( "" , " --incT " + str(value) )[ value is not None ]Instead of using E-values for setting the inclusion threshold, instead use a bit score
of >= the value as the per-target inclusion threshold. It would be unusual to use bit
score thresholds with hmmscan, because you don't expect a single score threshold
to work for different profiles; different profiles have slightly different expected score
distributions.incdomTInclude domans upper than this score threshold (--incdomT)Float$incdomE == 0.01 and not defined $model_specific incdomE == 0.01 and model_specific is None(defined $value) ? " --incdomT $value" : ""( "" , " --incdomT " + str(value) )[ value is not None ]Instead of using E-values, instead use a bit score of >= value as the per-domain
inclusion threshold. As with --incT above, it would be unusual to use a single bit
score threshold in hmmscan.model_specificOptions for model-specific thresholdingChoicenot defined $Bit_cutoff and not $E_value_cutoff == 10.0 and not defined $incdomT and $incdomE == 0.01not Bit_cutoff and E_value_cutoff == 10.0 and incdomT is None and incdomE == 0.01nullnull--cut_ga--cut_nc--cut_tc(defined $value and $value ne $vdef) ? " $value" : ""( "" , " " + str(value) )[ value is not None and value != vdef]Curated profile databases may define specific bit score thresholds for each profile, superseding any thresholding
based on statistical significance alone. To use these options, the profile must contain the appropriate
(GA, TC, and/or NC) optional score threshold annotation; this is picked up by hmmbuild from Stockholm
format alignment files. Each thresholding option has two scores: the per-sequence threshold x1 value and the
per-domain threshold x2 value. These act as if -T x1 --incT x1 --domT x2 --incdomT x2 has been
applied specifically using each model's curated thresholds.cut ga: Use the GA (gathering) bit scores in the model to set per-sequence (GA1) and
per-domain (GA2) reporting and inclusion thresholds. GA thresholds are generally
considered to be the reliable curated thresholds defining family membership; for
example, in Pfam, these thresholds define what gets included in Pfam Full alignments
based on searches with Pfam Seed models.cut_nc: Use the NC (noise cutoff) bit score thresholds in the model to set per-sequence
(NC1) and per-domain (NC2) reporting and inclusion thresholds. NC thresholds
are generally considered to be the score of the highest-scoring known false positive.cut_tc: Use the NC (trusted cutoff) bit score thresholds in the model to set per-sequence
(TC1) and per-domain (TC2) reporting and inclusion thresholds. TC thresholds are
generally considered to be the score of the lowest-scoring known true positive that
is above all known false positives.accelerationOptions controlling acceleration heuristics1HMMER3 searches are accelerated in a three-step filter pipeline: the MSV filter, the Viterbi filter, and
the Forward filter. The first filter is the fastest and most approximate; the last is the full Forward scoring
algorithm. There is also a 'bias filter' step between MSV and Viterbi. Targets that pass all the steps
in the acceleration pipeline are then subjected to 'postprocessing' -- domain identification and scoring
using the Forward/Backward algorithm. Changing filter thresholds only removes or includes targets from
consideration; changing filter thresholds does not alter bit scores, E-values, or alignments, all of which are
determined solely in 'postprocessing'.maxTurn all heuristic filters off (less speed, more power) (--max)Boolean0($value) ? " --max" : ""( "" , " --max " )[ value ]Turn off all filters, including the bias filter, and run full Forward/Backward postprocessing
on every target. This increases sensitivity somewhat, at a large cost in speed.F1Stage 1 (MSV) thresholdFloatnot maxnot max0.02(defined $value and $value != $vdef ) ? " --F1 $value" : ""( "" , " --F1 " + str(value) )[ value is not None and value != vdef]Set the P-value threshold for the MSV filter step. The default is 0.02, meaning that
roughly 2% of the highest scoring nonhomologous targets are expected to pass the filter.F2Stage 1 (Vit) thresholdFloatnot maxnot max0.001(defined $value and $value != $vdef ) ? " --F2 $value" : ""( "" , " --F2 " + str(value) )[ value is not None and value != vdef]Set the P-value threshold for the Viterbi filter step. The default is 0.001.F3Stage 1 (Fwd) thresholdFloatnot maxnot max0.00001(defined $value and $value != $vdef ) ? " --F3 $value" : ""( "" , " --F3 " + str(value) )[ value is not None and value != vdef]Set the P-value threshold for the Forward filter step. The default is 1e-5.nobiasTurn off composition bias filter (--nobias)Booleannot maxnot max0($value) ? " --nobias" : ""( "" , " --nobias " )[ value ]Turn off the bias filter. This increases sensitivity somewhat, but can come at a
high cost in speed, especially if the query has biased residue composition (such
as a repetitive sequence region, or if it is a membrane protein with large regions
of hydrophobicity). Without the bias filter, too many sequences may pass the filter
with biased queries, leading to slower than expected performance as the computationally
intensive Forward/Backward algorithms shoulder an abnormally heavy
load.expertOther expert options1nonull2Turn off biased composition score corrections (--nonull2)Boolean0($value) ? " --nonull2" : ""( "" , " --nonull2 " )[ value ]Turn off the 'null2' score corrections for biased composition.E_value_calculationControl of E_value calculation (-Z)Integer(defined $value) ? " -Z $value" : ""( "" , " -Z " + str(value) )[ value is not None ]1Assert that the total number of targets in your searches is the value, for the purposes of
per-sequence E-value calculations, rather than the actual number of targets seen.domZSet Z score of significant sequences, for domain E-value calculation (--domZ)Float(defined $value) ? " --domZ $value" : ""( "" , " --domZ " + str(value) )[ value is not None ]Assert that the total number of targets in your searches is the value, for the purposes
of per-domain conditional E-value calculations, rather than the number of targets
that passed the reporting thresholds.seedSet RNG seed number (--seed)Integer42(defined $value and $value != $vdef) ? " --seed $value " : ""( "" , " --seed " + str(value) )[ value is not None and value !=vdef ]Set the random number seed to value. Some steps in postprocessing require Monte
Carlo simulation. The default is to use a fixed seed (42), so that results are exactly
reproducible. Any other positive integer will give different (but also reproducible)
results. A choice of 0 uses a 'randomly chosen' seed.Enter a value >= 00 <= $value0 <= valuecontrolOutputOptions controlling outputoutfile_nameName of the sequence(s) file (-o)Filename(defined $value ) ? " -o $value" : ""( " " , " -o " + str(value) )[ value is not None ]1output_file_nameOutput fileTextdefined $outfile_nameoutfile_name is not None$outfile_namestr(outfile_name)perseqfile_nameFile name of parseable table of per-sequence hits (--tblout)Filename(defined $value) ? " --tblout $value" : ""( "" , " --tblout " + str(value) )[ value is not None ]Save a simple tabular (space-delimited) file summarizing the 'per-target' output,
with one data line per homologous target model found1output_perseqfile_nameOutput parseable table of per-sequence hitsText$perseqfile_nameperseqfile_name$perseqfile_namestr(perseqfile_name)perdomfile_nameFile name of parseable table of per-domain hits (--domtblout)Filename(defined $value) ? " --domtblout $value" : ""( "" , " --domtblout " + str(value) )[ value is not None ]1Save a simple tabular (space-delimited) file summarizing the 'per-domain' output,
with one data line per homologous domain detected in a query sequence for each
homologous model.accPrefer accessions over names in outputBoolean0($value) ? " --acc " : ""( "" , " --acc " )[ value ]Use accessions instead of names in the main output, where available for profiles
and/or sequencesnoaliDon't output alignments, so output is smallerBoolean0($value) ? " --noali " : ""( "" , " --noali " )[ value ]Omit the alignment section from the main output. This can greatly reduce the
output volume.notextwUnlimit ASCII text output line width (--notextw)Booleantextw == 120textw == 1200($value) ? " --notextw " : ""( "" , " --notextw " )[ value ]Unlimit the length of each line in the main output. The default is a limit of 120
characters per line, which helps in displaying the output cleanly on terminals and
in editors, but can truncate target profile description lines.textwSet max width of ASCII text output lines (--textw)Integer120(defined $value and $value != $vdef) ? " --textw $value " : ""( "" , " --textw " + str(value) )[ value is not None and value !=vdef ]Set the main output's line length limit to value> characters per line. The default is
120.output_perdomfile_nameOutput parseable table of per-domain hitsText$perdomfile_nameperdomfile_name$perdomfile_namestr(perdomfile_name)
Programs-5.1.2/dssp.xml 0000644 0001750 0000156 00000014127 12265225132 013653 0 ustar bneron sis
dssp2000DSSPDefinition of secondary structure of proteins given a set of 3D coordinatesW.Kabsch, C. SanderKabsch,W. and Sander,C. (1983) Biopolymers 22, 2577-2637.http://swift.cmbi.ru.nl/gv/dssp/ftp://ftp.cmbi.ru.nl/pub/molbio/software/sequence:protein:2D_structurestructure:2D_structuredssppdbfilePDB FileAbstractText_3DStructurePDB(defined $value) ? " $value" : " -- "( " -- " , " " + str(value) )[ value is not None ]You must enter either the PDB data or the PDB idnot defined $pdbid and defined $pdbfilepdbfile is not None and pdbid is None10pdbidor you can instead enter a PDB id.String(defined $value) ? "cat pdb$value.ent | " : ""( "" , "cat pdb" + str( value ).lower() + ".ent | " )[ value is not None ]You must enter either the PDB data or the PDB iddefined $pdbid and not defined $pdbfilepdbid is not None and pdbfile is None-1outputOutput parameterssurfaceDisables the calculation of accessible surface (-na)Boolean0($value) ? " -na " : ""( "" , " -na " )[ value ]1classicClassic (pre-July 1995) format (-c)Boolean0($value) ? " -c " : ""( "" , " -c " )[ value ]1disulfideAdds information about disulfide bonds to output file (-ssa)Boolean0($value) ? " -ssa " : ""( "" , " -ssa " )[ value ]1sidechains2XRenames residues with incomplete sidechains to 'X' (-x)Boolean0($value) ? " -x " : ""( "" , " -x " )[ value ]1altLocKeeps an additional AltLoc indicator at the line ends (-alt2)Boolean0($value) ? " -alt2 " : ""( "" , " -alt2 " )[ value ]1outfileStandard outputDsspReportReport"dssp.out""dssp.out"
Programs-5.1.2/hmmalign.xml 0000644 0001750 0000156 00000026610 11767572177 014522 0 ustar bneron sis
hmmalignHMMALIGNAlign sequences to a profile HMMhmm:alignment:multiplealignment:multiple:hmmhmmcmdString"hmmalign""hmmalign"0infileInput filefileInput sequences file3Input sequence file may be in any unaligned or aligned file format. If it is
in a multiple alignment format (e.g. Stockholm, MSF, ClustalW), the existing alignment is
ignored (i.e., the
sequences are read as if they were unaligned).seqfileSequences FileSequenceFASTAEMBLGenbankUniprot2,n" $seqfile" " " + str(seqfile) hmmfileProfile HMM fileHmmProfileAbstractText" $value"" "+str(value)2expert_optionsExpert Options1biotypeAssert sequences and hmm files are both in same typeChoicenullnull--amino--dna--rna(defined $value and $value ne $vdef)? " $value" : ""("", " " + str(value))[ value is not None and value != vdef]The alphabet type (amino, DNA, or RNA) is autodetected by default, by looking at the composition of the
msafile. Autodetection is normally quite reliable, but occasionally alphabet type may be ambiguous and
autodetection can fail (for instance, on tiny toy alignments of just a few residues). To avoid this, or to
increase robustness in automated analysis pipelines, you may specify the alphabet type of msafile with
these options.Protein: Specify that all sequences in seqfile are proteins. By default, alphabet type is
autodetected from looking at the residue composition.DNA: Specify that all sequences in seqfile are DNAs.RNA: Specify that all sequences in seqfile are RNAs.common_optionsCommon Options1allcolInclude all consensus columns in ali, even if all gapsBoolean0($value) ? " --allcol : ""( "" , " --allcol " )[ value ]Include columns in the output alignment for every match (consensus) state in
the hmmfile, even if it means having all-gap columns. This is useful in analysis
pipelines that need to be able to maintain a predetermined profile HMM architecture
(with an unchanging number of consensus columns) through an hmmalign
step.mapaliInclude alignment fileAlignmentSTOCKHOLM(defined $value) ? " --mapali $value" : ""( "" , " --mapali " + str(value) )[ value is not None ]Merge the existing alignment in file into the result, where the file is exactly the
same alignment that was used to build the model in hmmfile. This is done using
a 'map' of alignment columns to consensus profile positions that is stored in the
hmmfile. The multiple alignment in the file will be exactly reproduced in its consensus
columns (as defined by the profile), but the displayed alignment in insert columns
may be altered, because insertions relative to a profile are considered by convention
to be unaligned data.output_optionsOutput Options1trimTrim terminal tails of nonaligned residues from alignmentBoolean0($value) ? " --trim" : ""( "", " --trim" )[ value ]Trim nonhomologous residues (assigned to N and C states in the optimal alignments)
from the resulting multiple alignment output.outfile_nameOutfile name (-o)Filename(defined $value)? " -o $value" : ""("", " -o " + str(value))[value is not None]outputfileAlignment fileAlignmentdefined($outfile_name)outfile_name is not None"$outfile_name"str(outfile_name)outfileAlignment fileAlignmentnot defined($outfile_name)outfile_name is None"hmmalign.out""hmmalign.out"outputformatOutput formatChoiceSTOCKHOLMSTOCKHOLMPfamA2MPSIBLAST(defined $value and $value ne $vdef) ? " --outformat $value" : ""( "" , " --outformat " + str( value) )[ value is not None and value != vdef ]Specify that the msafile is in the selected format. Currently the accepted multiple
alignment sequence file formats only include Stockholm and SELEX. Default is to autodetect
the format of the file.
Programs-5.1.2/iep.xml 0000644 0001750 0000156 00000033343 12072525233 013461 0 ustar bneron sis
iepEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netiepCalculate the isoelectric point of proteinshttp://bioweb2.pasteur.fr/docs/EMBOSS/iep.htmlhttp://emboss.sourceforge.net/docs/themessequence:protein:compositioniepe_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_additionalAdditional sectione_aminoNumber of n-termini (value greater than or equal to 0)Integer1("", " -amino=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 02e_carboxylNumber of c-termini (value greater than or equal to 0)Integer1("", " -carboxyl=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 03e_terminiInclude charge at n and c terminusBoolean1(" -notermini", "")[ bool(value) ]4e_lysinemodifiedNumber of modified lysines (value greater than or equal to 0)Integer0("", " -lysinemodified=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 05e_disulphidesNumber of disulphide bridges (value greater than or equal to 0)Integer0("", " -disulphides=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 06e_advancedAdvanced sectione_stepStep value for ph (value from .01 to 1.)Float.5("", " -step=" + str(value))[value is not None and value!=vdef]Value greater than or equal to .01 is requiredvalue >= .01Value less than or equal to 1. is requiredvalue <= 1.7e_outputOutput sectione_plotPlot charge vs phBoolean0("", " -plot")[ bool(value) ]8e_reportWrite results to a fileBoolean1(" -noreport", "")[ bool(value) ]9e_graphChoose the e_graph output formatChoicee_plotpngpnggifcpspsmetadata(" -graph=" + str(vdef), " -graph=" + str(value))[value is not None and value!=vdef]10xy_goutfileName of the output graphFilenamee_plotiep_xygraph("" , " -goutfile=" + str(value))[value is not None]11xy_outgraph_pngGraph filePictureBinarye_plot and e_graph == "png""*.png"xy_outgraph_gifGraph filePictureBinarye_plot and e_graph == "gif""*.gif"xy_outgraph_psGraph filePostScriptBinarye_plot and e_graph == "ps" or e_graph == "cps""*.ps"xy_outgraph_metaGraph filePictureBinarye_plot and e_graph == "meta""*.meta"xy_outgraph_dataGraph fileTexte_plot and e_graph == "data""*.dat"e_outfileName of the output file (e_outfile)Filenamee_reportiep.e_outfile("" , " -outfile=" + str(value))[value is not None]12e_outfile_outoutfile_out optionIepReportReporte_outfileautoTurn off any promptingString" -auto -stdout"13
Programs-5.1.2/primersearch.xml 0000644 0001750 0000156 00000011746 12072525233 015373 0 ustar bneron sis
primersearchEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netprimersearchSearch DNA sequences for matches with primer pairshttp://bioweb2.pasteur.fr/docs/EMBOSS/primersearch.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:primersprimersearche_inputInput sectione_seqallseqall optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -seqall=" + str(value))[value is not None]1e_infilePrimer pairs filePrimerPairsAbstractText("", " -infile=" + str(value))[value is not None]2e_requiredRequired sectione_mismatchpercentAllowed percent mismatchInteger0("", " -mismatchpercent=" + str(value))[value is not None and value!=vdef]3e_outputOutput sectione_outfileName of the output file (e_outfile)Filenameprimersearch.e_outfile("" , " -outfile=" + str(value))[value is not None]4e_outfile_outoutfile_out optionPrimer3ReportReporte_outfileautoTurn off any promptingString" -auto -stdout"5
Programs-5.1.2/seq-gen.xml 0000644 0001750 0000156 00000067716 12265214442 014257 0 ustar bneron sis
seq-gen1.3.2SeqGenSequence-GeneratorA. Rambaut, N. C. GrasslyRambaut, A. and Grassly, N. C. (1996) Seq-Gen: An application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees. Comput. Appl. Biosci.http://bioweb2.pasteur.fr/docs/seq-gen/index.htmlSeq-Gen is a program that will simulate the evolution of nucleotide or amino acid sequences along a phylogeny, using common models of the substitution process.http://tree.bio.ed.ac.uk/software/seqgen/http://tree.bio.ed.ac.uk/download.html?name=seqgen&version=v1.3.2&id=41&num=1phylogeny:likelihoodseqgenString"seq-gen""seq-gen"0intreeInput tree fileTreeNEWICKNEXUS"< $value""< " + str(value)0inputInput parameters1LengthSequence length (-l)Integer1000(defined $value and $value != $vdef)? " -l $value":""("" , " -l " + str(value))[ value is not None and value != vdef]This option allows the user to set the length in nucleotides that each simulated sequence should be.datasetsNumber of simulated datasets per tree (-n)Integer1(defined $value and $value != $vdef)? " -n $value":""("" , " -n " + str(value))[ value is not None and value != vdef]This option specifies how many separate datasets should be simulated for each tree in the tree file.partition_numbNumber of partitions for each dataset (-p)Integer1(defined $value and $value != $vdef)? " -p $value":""("" , " -p " + str(value))[ value is not None and value != vdef]Number of partion specifies how many partitions of each data set should be simulated. each partition must have its own tree and number specifying how many sites are in partition. Multiple sets of trees are being inputed with varying numbers of partitions, then this should specify the maximum number of partitions that will be requiredscale_branchScale branch lengths (number greater > 0) (-s)Floatnot defined $scale_treescale_tree is None1.0(defined $value and $value != $vdef)? " -s $value":""("" , " -s " + str(value))[ value is not None and value != vdef]Value greater than 0 is required$value > 0value > 0This option allows the user to set a value with which to scale the branch lengths in order to make them equal the expected number of substitutions per site for each branch. Basically Seq-Gen multiplies each branch length by this value.For example if you give an value of 0.5 then each branch length would be halved before using it to simulate the sequences.scale_treeTotal tree scale (a decimal number greater > 0) (-d)Float$scale_branch != 1.0scale_branch != 1.0 (defined $value)? " -d $value":""("" , " -d " + str(value))[ value is not None ]Value greater than 0 is required$value > 0value > 0This option allows the user to set a value which is the desired length of each tree in units of substitutions per site. The term 'tree length' here is the distance from the root to any one of the tips in units of mean number of substitutions per site. This option can only be used when the input trees are rooted and ultrametric (no difference in rate amongst the lineages). This has the effect of making all the trees in the input file of the same length before simulating data. The option multiplies each branch length by a value equal to SCALE divided by the actual length of the tree.input_seqAncestral Sequence number (-k)Integer(defined $value)? " -k $value":""("" , " -k " + str(value))[ value is not None ]This option allows the user to use a supplied sequence as the ancestral sequence at the root (otherwise a random sequence is used). The value is an integer number greater than zero which refers to one of the sequences supplied as input with the tree.Method: The user can supply a sequence alignment as input, as well as the trees. This should be in relaxed PHYLIP format. The trees can then be placed in this file at the end, after a line stating how many trees there are. The file may look like this: 4 50Taxon1 ATCTTTGTAGTCATCGCCGTATTAGCATTCTTAGATCTAATaxon2 ATCCTAGTAGTCGCTTGCGCACTAGCCTTCCGAAATCTAGTaxon3 ACTTCTGTGTTTACTGAGCTACTAGCTTCCCTAAATCTAGTaxon4 ATTCCTATATTCGCTAATTTCTTAGCTTTCCTGAATCTGG1(((Taxon1:0.2,Taxon2:0.2):0.1,Taxon3:0.3):0.1,Taxon4:0.4);Note that the labels in the alignment do not have to match those in the tree (the ones in the tree will be used for output) there doesn't even have to be the same number of taxa in the alignment as in the trees. The sequence length supplied by the alignment will be used to obtain the simulated sequence length (unless the l option is set). The k option also refers to one of the sequences to specify the ancestral sequence. (see Appendix A)substitutionSubstitution model options1modelModel of substitution (-m)ChoiceJTTJTTWAGPAMBLOSUMMTREVGENERALF84HKYGTR(defined $value )? " -m $value" : "" ( "" , " -m " + str(value) )[ value is not None ]shapeShape of the gamma distribution to use with gamma rate heterogeneity (-a)Float(defined $value)? " -a $value":""("" , " -a " + str(value))[ value is not None ]1Using this option the user may specify a shape for the gamma rate heterogeneity. The default is no site-specific rate heterogeneity. Enter a decimal number.categoriesNumber of categories for the discrete gamma rate heterogeneity model (-g)Integer(defined $value)? " -g $value":""("" , " -g " + str(value))[ value is not None ]Enter an integer number between 2 and 32$value >= 2 and $value <= 32value >= 2 and value <= 32 1Using this option the user may specify the number of categories for the discrete gamma rate heterogeneity model. The default is no site-specific rate heterogeneity (or the continuous model if only the -a option is specified. Enter an integer number between 2 and 32invar_siteProportion of sites that should be invariable (-i)Float0.0(defined $value and $value != $vdef)? " -i $value":""("" , " -i " + str(value))[ value is not None and value != vdef]Enter a real number between 0.0 and 1.0 $value >= 0.0 and $value <= 1.0value >= 0.0 and value <= 1.01Specify the proportion of sites that should be invariable. These sites will be chosen randomly with this expected frequency. The default is no invariable sites. Invariable sites are sites thar cannot change as opposed to sites which don't exhibit any changes due to chance (and perhaps a low rate). Enter a real number between 0.0 and 1.0nucleotide_optNucleotid model specific options1rateRates for codon position heterogeneity (-c)Using this option the user may specify the relative rates for each codon position. This allows codon-specific rate heterogeneity to be simulated. The default is no site-specific rate heterogeneity.You can only have codon rates when using nucleotide models of substitution.rate1First positionFloat""""Enter a decimal numberYou can only have codon rates when using nucleotide models of substitution.rate2Second position Float""""Enter a decimal numberrate3Third position (enter a decimal number)Float""""Enter a decimal numberrateAllRatesFloatdefined $rate1 and defined $rate2 and defined $rate3rate1 is not None and rate2 is not None and rate3 is not None" -c $rate1 $rate2 $rate3"" -c %f %f %f " %(rate1,rate2,rate3)transratioTransition transversion ratio (TS/TV) for HKY or F84 model (-t)Float$model eq 'HKY' or $model eq 'F84'model =='HKY' or model == 'F84'(defined $value)? " -t $value":""("" , " -t " + str(value))[ value is not None ]This option allows the user to set a value for the transition transversion ratio (TS/TV). This is only valid when either the HKY or F84 model has been selected.matrix6 values for the general reversable model's rate matrix (ACTG x ACTG) separated by one space (-r)String1.0,1.0,1.0,1.0,1.0,1.0(defined $value and $value ne $vdef)? " -r $value":""("" , " -r " + str(value))[ value is not None and value != vdef]1This option allows the user to set 6 values for the general reversable model's rate matrix. This is only valid when either the REV model has been selected.The values are six decimal numbers for the rates of transition from A to C, A to G, A to T, C to G, C to T and G to T respectively, separated by spaces or commas. The matrix is symmetrical so the reverse transitions equal the ones set (e.g. C to A equals A to C) and therefore only six values need be set. These values will be scaled such that the last value (G to T) is 1.0 and the others are set relative to this.frequenciesRelative frequencies of nucleotides (-f)1This option is used to specify the relative frequencies of the four nucleotides. By default, Seq-Gen will assume these to be equal. If the given values don't sum to 1.0 then they will be scaled so that they do.You must give the frequencies for the 4 nucleotidesfreqAFrequencies of the A nucleotideFloat""""freqCFrequencies of the C nucleotide Float""""freqGFrequencies of the G nucleotideFloat""""freqTFrequencies of the T nucleotideFloat""""freqAllFrequenciesStringdefined $freqA and defined $freqC and defined $freqG and defined $freqTfreqA is not None and freqC is not None and freqG is not None and freqT is not None" -f $freqA,$freqC,$freqG,$freqT"" -f " + str(freqA) + "," + str(freqC) + "," + str(freqG) + "," + str(freqT)miscellaneous_optMiscellaneous options1random_seedRandom number seed (-z)Integer(defined $value)? "-z $value":""("" , "-z " + str(value))[ value is not None ]This option allows to specify a seed for the random number generator. Using the same seed (with the same input) will result in identical simulated datasets. This is useful because you can recreate a set of simulations, you must use exactly the same model optionsoutputOutput parameters1output_formatOutput file format (-o)Choicepprn(defined $value and $value ne $vdef)? " -o$value":""("" , " -o" + str(value))[ value is not None and value != vdef]quietNon verbose output (-q)Boolean0($value)? " -q":""("" , " -q")[ value ]1write_ancestWrite the ancestral sequences (-wa)Boolean0($value)? " -wa":""("" , " -wa")[ value ]This option allows to obtain the sequences for each of the internal nodes in the tree. The sequences are written out along with the sequences for the tips of the tree in relaxed PHYLIP format.write_sitesWrite the sites rates (-wr)Boolean0($value)? " -wr":""("" , " -wr")[ value ]This option allows to obtain the relative
rate of substitution for each sites as used in each
simulation. This will go to stderr and will be produced
for each replicate simulation.outfileOutput alignment fileAlignmentPHYLIPIRPHYLIPNEXUS"seqgen.out""seqgen.out"
Programs-5.1.2/clustalw-sequence.xml 0000644 0001750 0000156 00000110762 12073003734 016347 0 ustar bneron sis
clustalw-sequenceClustalw: Sequence to Profile alignmentsSequentially add profile2 sequences to profile1 alignmentalignment:multipleclustalw -sequencesprofileProfile Alignments parameters2By PROFILE ALIGNMENT, we mean alignment using existing alignments. Profile alignments allow you to store alignments of your favorite sequences and add new sequences to them in small bunches at a time. (e.g. an alignment output file from CLUSTAL W). One or both sets of input sequences may include secondary structure assignments or gap penalty masks to guide the alignment.Merge 2 alignments by profile alignmentprofile1Profile 1AlignmentCLUSTAL(defined $value) ? " -profile1=$value" : ""( "" , " -profile1=" + str( value ) )[value is not None]profile2Profile 2SequenceFASTA1,n(defined $value) ? " -profile2=$value" : ""( "" , " -profile2=" + str( value ) )[value is not None]general_settingsGeneral settings3typeseqProtein or DNA (-type)Choiceautoautoproteindna(defined $value) ? " -type=$value" : ""("", " -type="+str(value))[value is not None]quicktreeToggle Slow/Fast pairwise alignments (-quicktree)Choiceslowslowfast($value eq "fast") ? " -quicktree" : ""( "" , " -quicktree")[ value == "fast"]slow: by dynamic programming (slow but accurate)fast: method of Wilbur and Lipman (extremely fast but approximate)fastpwFast Pairwise Alignments parameters$quicktree eq "fast"quicktree == "fast"2These similarity scores are calculated from fast, approximate, global alignments, which are controlled by 4 parameters. 2 techniques are used to make these alignments very fast: 1) only exactly matching fragments (k-tuples) are considered; 2) only the 'best' diagonals (the ones with most k-tuple matches) are used.ktupleWord size (-ktuple)Integer1(defined $value and $value != $vdef) ? " -ktuple=$value" : ""( "" , " -ktuple=" + str( value ) )[value is not None and value != vdef ]2K-TUPLE SIZE: This is the size of exactly matching fragment that is used. INCREASE for speed (max= 2 for proteins; 4 for DNA), DECREASE for sensitivity. For longer sequences (e.g. >1000 residues) you may need to increase the default.topdiagsNumber of best diagonals (-topdiags)Integer5(defined $value and $value != $vdef) ? " -topdiags=$value" : ""( "" , " -topdiags=" + str( value ))[value is not None and value != vdef ]2The number of k-tuple matches on each diagonal (in an imaginary dot-matrix plot) is calculated. Only the best ones (with most matches) are used in the alignment. This parameter specifies how many. Decrease for speed; increase for sensitivity.windowWindow around best diags (-window)Integer5(defined $value and $value != $vdef) ? " -window=$value" : ""( "" , " -window=" + str( value ) )[ value is not None and value != vdef ]2WINDOW SIZE: This is the number of diagonals around each of the 'best' diagonals that will be used. Decrease for speed; increase for sensitivitypairgapGap penalty (-pairgap)Float3(defined $value and $value != $vdef) ? " -pairgap=$value" : ""( "" , " -pairgap=" + str( value ))[ value is not None and value != vdef ]2This is a penalty for each gap in the fast alignments. It has little affect on the speed or sensitivity except for extreme values.scorePercent or absolute score ? (-score)Choicepercentpercentabsolute(defined $value and $value ne $vdef) ? " -score=$value" : ""( "" , " -score=" +str( value ) )[value is not None and value !=vdef]2slowpwSlow Pairwise Alignments parameters$quicktree eq "slow"quicktree == "slow"2These parameters do not have any affect on the speed of the alignments. They are used to give initial alignments which are then rescored to give percent identity scores. These % scores are the ones which are displayed on the screen. The scores are converted to distances for the trees.pwgapopenGap opening penalty (-pwgapopen)Float10.00(defined $value and $value != $vdef) ? " -pwgapopen=$value" : ""( "" , " -pwgapopen=" + str( value ) )[ value is not None and value != vdef ]pwgapextGap extension penalty (-pwgapext)Float0.10(defined $value and $value != $vdef) ? " -pwgapext=$value" : ""( "" , " -pwgapext=" + str( value ) )[ value is not None and value != vdef ]slowpw_protProtein parameters$typeseq eq "protein"typeseq == "protein"pwmatrixProtein weight matrix (-pwmatrix)Choicegonnetblosumgonnetpamid(defined $value and $value ne $vdef) ? " -pwmatrix=$value" : ""( "" , " -pwmatrix=" + str(value) )[value is not None and value != vdef ]The scoring table which describes the similarity of each amino acid to each other. For DNA, an identity matrix is used.BLOSUM (Henikoff). These matrices appear to be the best available for carrying out data base similarity (homology searches). The matrices used are: Blosum80, 62, 40 and 30.The Gonnet Pam 250 matrix has been reported as the best single matrix for alignment, if you only choose one matrix. Our experience with profile database searches is that the Gonnet series is unambiguously superior to the Blosum series at high divergence. However, we did not get the series to perform systematically better than the Blosum series in Clustal W (communication of the authors).PAM (Dayhoff). These have been extremely widely used since the late '70s. We use the PAM 120, 160, 250 and 350 matrices.slowpw_dnaDNA parameters$typeseq eq "dna"typeseq == "dna"pwdnamatrixDNA weight matrix (-pwdnamatrix)Choiceiubiubclustalw(defined $value and $value ne $vdef) ? " -pwdnamatrix=$value" : ""( "" , " -pwdnamatrix=" + str(value) )[ value is not None and value != vdef ]For DNA, a single matrix (not a series) is used. Two hard-coded matrices are available:1) IUB. This is the default scoring matrix used by BESTFIT for the comparison of nucleic acid sequences. X's and N's are treated as matches to any IUB ambiguity symbol. All matches score 1.9; all mismatches for IUB symbols score 0.2) CLUSTALW(1.6). The previous system used by ClustalW, in which matches score 1.0 and mismatches score 0. All matches for IUB symbols also score 0.structureStructure Alignments parameters2These options, when doing a profile alignment, allow you to set 2D structure parameters. If a solved structure is available, it can be used to guide the alignment by raising gap penalties within secondary structure elements, so that gaps will preferentially be inserted into unstructured surface loops. Alternatively, a user-specified gap penalty mask can be supplied directly.A gap penalty mask is a series of numbers between 1 and 9, one per position in the alignment. Each number specifies how much the gap opening penalty is to be raised at that position (raised by multiplying the basic gap opening penalty by the number) i.e. a mask figure of 1 at a position means no change in gap opening penalty; a figure of 4 means that the gap opening penalty is four times greater at that position, making gaps 4 times harder to open.Gap penalty masks is to be supplied with the input sequences. The masks work by raising gap penalties in specified regions (typically secondary structure elements) so that gaps are preferentially opened in the less well conserved regions (typically surface loops).CLUSTAL W can read the masks from SWISS-PROT, CLUSTAL or GDE format input files. For many 3-D protein structures, secondary structure information is recorded in the feature tables of SWISS-PROT database entries. You should always check that the assignments are correct - some are quite inaccurate. CLUSTAL W looks for SWISS-PROT HELIX and STRAND assignments e.g.FT HELIX 100 115FT HELIX 100 115The structure and penalty masks can also be read from CLUSTAL alignment format as comment lines beginning !SS_ or GM_ e.g.!SS_HBA_HUMA ..aaaAAAAAAAAAAaaa.aaaAAAAAAAAAAaaaaaaAaaa.........aaaAAAAAA!GM_HBA_HUMA 112224444444444222122244444444442222224222111111111222444444HBA_HUMA VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKNote that the mask itself is a set of numbers between 1 and 9 each of which is assigned to the residue(s) in the same column below. In GDE flat file format, the masks are specified as text and the names must begin with SS_ or GM_. Either a structure or penalty mask or both may be used. If both are included in an alignment, the user will be asked which is to be used.nosecstr1Do not use secondary structure-gap penalty mask for profile 1 (-nosecstr1)Boolean0($value) ? " -nosecstr1" : ""( "" , " -nosecstr1")[ value ]2This option controls whether the input secondary structure information or gap penalty masks will be used.nosecstr2Do not use secondary structure-gap penalty mask for profile 2 (-nosecstr2)Boolean0($value) ? " -nosecstr2" : ""( "" , " -nosecstr2")[ value ]This option controls whether the input secondary structure information or gap penalty masks will be used.helixgapHelix gap penalty (-helixgap)Integer4(defined $value and $value != $vdef) ? " -helixgap=$value" : ""( "" , " -helixgap=" + str( value ) )[ value is not None and value != vdef ]This option provides the value for raising the gap penalty at core Alpha Helical (A) residues. In CLUSTAL format, capital residues denote the A and B core structure notation. The basic gap penalties are multiplied by the amount specified.strandgapStrand gap penalty (-strandgap)Integer4(defined $value and $value != $vdef) ? " -strandgap=$value" : ""( "" , " -strandgap=" + str( value ) )[ value is not None and value != vdef ]This option provides the value for raising the gap penalty at Beta Strand (B) residues. In CLUSTAL format, capital residues denote the A and B core structure notation. The basic gap penalties are multiplied by the amount specified.loopgapLoop gap penalty (-loopgap)Integer1(defined $value and $value != $vdef) ? " -loopgap=$value" : ""( "" , " -loopgap=" + str( value ) )[ value is not None and value != vdef ]This option provides the value for the gap penalty in Loops. By default this penalty is not raised. In CLUSTAL format, loops are specified by . in the secondary structure notation.terminalgapSecondary structure terminal penalty (-terminalgap)Integer2(defined $value and $value != $vdef) ? " -terminalgap=$value" : ""( "" , " -terminalgap=" + str( value ) )[ value is not None and value != vdef ]This option provides the value for setting the gap penalty at the ends of secondary structures. Ends of secondary structures are observed to grow and-or shrink in related structures. Therefore by default these are given intermediate values, lower than the core penalties. All secondary structure read in as lower case in CLUSTAL format gets the reduced terminal penalty.helixendinHelix terminal positions: number of residues inside helix to be treated as terminal (-helixendin)Integer3(defined $value and $value != $vdef) ? " -helixendin=$value" : ""( "" , " -helixendin=" + str( value ) )[ value is not None and value != vdef ]This option (together with the -helixendin) specify the range of structure termini for the intermediate penalties. In the alignment output, these are indicated as lower case. For Alpha Helices, by default, the range spans the end helical turn.helixendoutHelix terminal positions: number of residues outside helix to be treated as terminal (-helixendout)Integer0(defined $value and $value != $vdef) ? " -helixendout=$value" : ""( "" , " -helixendout=" + str( value ) )[ value is not None and value != vdef ]This option (together with the -helixendin) specify the range of structure termini for the intermediate penalties. In the alignment output, these are indicated as lower case. For Alpha Helices, by default, the range spans the end helical turn.strandendinStrand terminal positions: number of residues inside strand to be treated as terminal (-strandendin)Integer1(defined $value and $value != $vdef) ? " -strandendin=$value" : ""( "" , " -strandendin=" + str( value ) )[ value is not None and value != vdef ]This option (together with the -strandendout option) specify the range of structure termini for the intermediate penalties. In the alignment output, these are indicated as lower case. For Beta Strands, the default range spans the end residue and the adjacent loop residue, since sequence conservation often extends beyond the actual H-bonded Beta Strand.strandendoutStrand terminal positions: number of residues outside strand to be treated as terminal (-strandendout)Integer1(defined $value and $value != $vdef) ? " -strandendout=$value" : ""( "" , " -strandendout=" + str( value ) )[ value is not None and value != vdef ]This option (together with the -strandendin option) specify the range of structure termini for the intermediate penalties. In the alignment output, these are indicated as lower case. For Beta Strands, the default range spans the end residue and the adjacent loop residue, since sequence conservation often extends beyond the actual H-bonded Beta Strand.secstroutOutput in alignment (-secstrout)ChoiceSTRUCTURESTRUCTUREMASKBOTHNONE(defined $value and $value ne $vdef) ? " -secstrout=$value" : ""( "" , " -secstrout=" + str( value ) )[ value is not None and value != vdef ]This option lets you choose whether or not to include the masks in the CLUSTAL W output alignments. Showing both is useful for understanding how the masks work. The secondary structure information is itself very useful in judging the alignment quality and in seeing how residue conservation patterns vary with secondary structure.outputparamOutput parameters5outputformatOutput format (-output)ChoicenullnullGCGGDEPHYLIPINEXUSPIRFASTA(defined $value and $value ne $vdef) ? " -output=$value" : ""( "" , " -output=" + str( value) )[ value is not None and value != vdef ]seqnosOutput sequence numbers in the output file (for clustalw output only) (-seqnos)Booleannot defined $outputformatoutputformat is None0(defined $value and $value != $vdef) ? " -seqnos=on" : ""( "" , " -seqnos=on")[ value is not None and value != vdef]outorderResult order (-outorder)Choicealignedinputaligned(defined $value and $value ne $vdef) ? " -outorder=$value" : ""( "" , " -outorder=" + str(value))[ value is not None and value != vdef ]outfileSequence alignment file name(-outfile)Filename(defined $value) ? " -outfile=$value" : ""( "" , " -outfile=" + str( value))[ value is not None ]clustalaligfileAlignment fileAlignmentCLUSTALnot defined $outputformatoutputformat is None(defined $outfile)? "$outfile":"*.aln"("*.aln", str(outfile))[outfile is not None]In the conservation line output in the clustal format alignment file, three characters are used:'*' indicates positions which have a single, fully conserved residue.':' indicates that one of the following 'strong' groups is fully conserved (STA,NEQK,NHQK,NDEQ,QHRK,MILV,MILF,HY,FYW).'.' indicates that one of the following 'weaker' groups is fully conserved (CSA,ATV,SAG,STNK,STPA,SGND,SNDEQK,NDEQHK,NEQHRK,FVLIM,HFY).These are all the positively scoring groups that occur in the Gonnet Pam250
matrix. The strong and weak groups are defined as strong score >0.5 and weak
score =<0.5 respectively.aligfileAlignment fileAlignment$outputformat =~ /^(NEXUS|GCG|PHYLIPI|FASTA)$/outputformat in [ "NEXUS", "GCG", "PHYLIPI","FASTA"](defined $outfile)? "$outfile":"*.fasta *.nxs *.phy *.msf"{ "OUTFILE":outfile, "FASTA":"*.fasta", "NEXUS": "*.nxs", "PHYLIPI": "*.phy" , 'GCG': '*.msf' }[( "OUTFILE", outputformat)[outfile is None]]seqfileSequences fileSequenceNBRFGDE$outputformat =~ /^(GDE|PIR)$/outputformat in [ 'GDE', 'PIR' ](defined $outfile)? "$outfile":"*.gde *.pir"{ "OUTFILE":outfile, 'GDE':'*.gde', 'PIR':'*.pir}[( "OUTFILE", outputformat)[outfile is None]]dndfileTree fileTreeNEWICK"*.dnd""*.dnd"
Programs-5.1.2/btwisted.xml 0000644 0001750 0000156 00000011437 12072525233 014531 0 ustar bneron sis
btwistedEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netbtwistedCalculate the twisting in a B-DNA sequencehttp://bioweb2.pasteur.fr/docs/EMBOSS/btwisted.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:compositionbtwistede_inputInput sectione_sequencesequence optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -sequence=" + str(value))[value is not None]1e_angledataDna base pair twist angle data fileBasePairTwistAnglesAbstractText("", " -angledata=" + str(value))[value is not None ]2e_energydataDna base pair stacking energies data fileBasePairStackingEnergiesAbstractText("", " -energydata=" + str(value))[value is not None ]3e_outputOutput sectione_outfileName of the output file (e_outfile)Filenamebtwisted.e_outfile("" , " -outfile=" + str(value))[value is not None]4e_outfile_outoutfile_out optionBtwistedReportReporte_outfileautoTurn off any promptingString" -auto -stdout"5
Programs-5.1.2/trnascan.xml 0000644 0001750 0000156 00000105405 11767572177 014537 0 ustar bneron sis
trnascan1.23tRNAscan-SEDetection of transfer RNA genesT. Lowe, S. EddyFichant, G.A. and Burks, C. (1991) Identifying potential tRNA genes in genomic DNA sequences, J. Mol. Biol., 220, 659-671.Eddy, S.R. and Durbin, R. (1994) RNA sequence analysis using covariance models, Nucl. Acids Res., 22, 2079-2088.Pavesi, A., Conterio, F., Bolchi, A., Dieci, G., Ottonello, S. (1994) Identification of new eukaryotic tRNA genes in genomic DNA databases by a multistep weight matrix analysis of trnascriptional control regions, Nucl. Acids Res., 22, 1247-1256.Lowe, T.M. and Eddy, S.R. (1997) tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence, Nucl. Acids Res., 25, 955-964.tRNAscan-SE identifies transfer RNA genes in genomic DNA or RNA sequences. It combines the specificity of the Cove probabilistic RNA prediction package (Eddy & Durbin, 1994) with the speed and sensitivity of tRNAscan 1.3 (Fichant & Burks, 1991) plus an implementation of an algorithm described by Pavesi and colleagues
(1994) which searches for eukaryotic pol III tRNA promoters (our implementation referred to as EufindtRNA). tRNAscan and EufindtRNA are used as first-pass prefilters to identify "candidate" tRNA regions of the sequence. These subsequences are then passed to Cove for further analysis, and output if Cove confirms the initial tRNA prediction. http://selab.janelia.org/software.html#trnascanftp://selab.janelia.org/pub/software/tRNAscan-SE/sequence:nucleic:patterntRNAscan-SEsequenceSequence FileDNASequenceFASTA" $value"" "+str(value)2search_optionsSearch Mode options1prokaryoticImprove detection of prokaryotic tRNAs (-P)Boolean0 ($value) ? " -P":""("" , " -P")[ value ]This parameter loosens the search parameters for EufindtRNA to improve detection of prokaryotic tRNAs. Use this option when scanning prokaryotic sequences or both eukaryotic and prokaryotic sequences in the same sequence file. This option also disables pseudogene checking automatically since criteria for pseudogene checking were developed for eukaryotic pseudogenes.Use of this mode with prokaryotic sequences will also improve bounds prediction of the 3' end (the terminal CCA triplet).archealSelect archeal-specific covariance model (-A)Boolean0($value) ? " -A" : ""( "" , " -A" )[ value ]This option selects an archaeal-specific covariance model for tRNA analysis, as well as slightly loosening the EufindtRNA search cutoffs.organellarBypasses the fast first-pass scanners that are poor at detecting organellar tRNAs (-O)Boolean0 ($value) ? " -O":""("" , " -O")[ value ]This parameter bypasses the fast first-pass scanners that are poor at detecting organellar tRNAs and runs Cove analysis only. Since true organellar tRNAs have been found to have Cove scores between 15 and 20 bits, the search cutoff is lowered from 20 to 15 bits. Also, pseudogene checking is disabled since it is only applicable to eukaryotic cytoplasmic tRNA pseudogenes. Since Cove-only mode is used, searches will be very slow (see -C option below) relative to the default mode.generalGeneral covariance model trained on all three phylogenetic domains (-G)Boolean0($value) ? " -G" : ""( "" , " -G" )[ value ]This option selects the general tRNA covariance model that was trained on tRNAs from all three phylogenetic domains (archaea, bacteria, & eukarya). This mode can be used when analyzing a mixed collection of sequences from more than one phylogenetic domain, with only slight loss of sensitivity and selectivity.The original publication describing this program and tRNAscan-SE version 1.0 used this general tRNA model exclusively. If you wish to compare scores to those found in the paper or scans using v1.0, use this option. Use of this option is compatible with all other search mode options described in this section.cove_onlyAnalyze sequences using Cove only (-C)Boolean0 ($value) ? " -C":""("" , " -C")[ value ]Directs tRNAscan-SE to analyze sequences using Cove analysis only. This option allows a slightly more sensitive search than the default tRNAscan + EufindtRNA -> Cove mode, but is much slower (by approx. 250 to 3,000 fold). Output format and other program defaults are otherwise identical to the normal analysis.breakdownShow both primary and secondary structure components to covariance model bit score (-H)Boolean0($value) ? " -H" : ""( "" , " -H" )[ value ]Since tRNA pseudogenes often have one very low component (good secondary structure but poor primary sequence similarity to the tRNA model, or vice versa), this information may be useful in deciding whether a low-scoring tRNA is likely to be a pseudogene. The heuristic pseudogene detection filter uses this information to flag possible pseudogenes -- use this option to see why a hit is marked as a possible pseudogene. The user may wish to examine score breakdowns from known tRNAs in the organism of interest to get a frame of reference.disable_checkingDisable pseudogene checking (-D)Boolean0 ($value) ? " -D":""("" , " -D")[ value ]This will slightly speed the program and may be necessary for non-eukaryotic sequences that are flagged as possible pseudogenes but are known to be functional tRNAs.special_optionsSpecial options1trnascan_onlyUse tRNAscan only to analyze sequences (-T)Boolean0($value) ? " -T":""("" , " -T")[ value ]Directs tRNAscan-SE to use only tRNAscan to analyze sequences. This mode will default to using 'strict' parameters with tRNAscan analysis (similar to tRNAscan version 1.3 operation). This mode of operation is faster (3-5 times faster than default mode analysis), but will result in approximately 0.2 to 0.6 false positive tRNAs per Mbp, decreased sensitivity, and less reliable prediction of anticodons, tRNA isotype, and introns.eufindtrna_onlyUse EufindtRNA only to search for tRNAs (-E)Boolean0($value) ? " -E":""("" , " -E")[ value ]Since Cove is not being used as a secondary filter to remove false positives, this run mode defaults to 'Normal' parameters which more closely approximates the sensitivity and selectivity of the original algorithm describe by Pavesi and colleagues (see the option -e for a description of the various run modes).trnascan_modeStrict or relaxed tRNAscan mode (-t)ChoiceSSR(defined $value and $value ne $vdef) ? " -t $value":""("" , " -t "+ str(value))[ value is not None and value != vdef]Relaxed parameters may give very slightly increased search sensitivity, but increase search time by 20-40 fold.eufindtrna_modeEufindtRNA mode (-e)ChoiceSSRN(defined $value) ? " -e $value":""("" , " -e "+ str(value))[ value is not None ]Explicitly set EufindtRNA params, where <mode>= R, N, or S (relaxed, normal, or strict). The 'relaxed' mode is used for EufindtRNA when using tRNAscan-SE in default mode. With relaxed parameters, tRNAs that lack pol III poly-T terminators are not penalized, increasing search sensitivity, but decreasing selectivity. When Cove analysis is being used as a secondary filter for false positives (as in tRNAscan-SE's default mode), overall selectivity is not decreased.Using 'normal' parameters with EufindtRNA does incorporate a log odds score for the distance between the B box and the first poly-T terminator, but does not disqualify tRNAs that do not have a terminator signal within 60 nucleotides. This mode is used by default when Cove analysis is not being used as a secondary false positive filter.Using 'strict' parameters with EufindtRNA also incorporates a log odds score for the distance between the B box and the first poly-T terminator, but _rejects_ tRNAs that do not have such a signal within 60 nucleotides of the end of the B box. This mode most closely approximates the originally published search algorithm (3); sensitivity is reduced relative to using 'relaxed' and 'normal' modes, but selectivity is increased which is important if no secondary filter, such as Cove analysis, is being used to remove false positives. This mode will miss most prokaryotic tRNAs since the poly-T terminator signal is a feature specific to eukaryotic tRNAs genes (always use 'relaxed' mode for scanning prokaryotic sequences for tRNAs).save_first_passSave first pass results (-r)Boolean0 ($value) ? " -r\\#":""("" , " -r#")[ value ]Save tabular, formatted output results from tRNAscan and/or EufindtRNA first pass scans. The format is similar to the final tabular output format, except no Cove score is available at this point in the search (if EufindtRNA has detected the tRNA, the negative log likelihood score is given). Also, the sequence ID number and source sequence length appear in the columns where intron bounds are shown in final output. This option may be useful for examining false positive tRNAs predicted by first-pass scans that have been filtered out by Cove analysis.previous_first_pass_resultUse a previous first pass result tabular file (-u)TrnaScanFirstPassResultAbstractText$matching or $startmatching and start (defined $value) ? " -u $value":""("" , " -u "+ str(value))[ value is not None ]This option allows the user to re-generate results from regions identified to have tRNAs by a previous tRNAscan-SE run. Either a regular tabular result file, or output saved with the -r option may be used as the specified <file>. This option is particularly useful for generating either secondary structure output (-f option) or ACeDB output (-a option) without having to re-scan entire sequences. Alternatively, if the -r option is used to generate the previous results file, tRNAscan-SE will pick up at the stage of Cove-confirmation of tRNAs and output final tRNA predictons as with a normal run.false_positivesSave false positives (-F)Boolean0 ($value) ? " -F\\#":""("" , " -F#")[ value ]Save first-pass candidate tRNAs that were then found to be false positives by Cove analysis. This option saves candidate tRNAs found by either tRNAscan and/or EufindtRNA that were then rejected by Cove analysis as being false positives. tRNAs are saved in the FASTA sequence format.specify_optionsSpecify Alternate Cutoffs / Data Files options1cutoffCove cutoff score for reporting tRNAs (-X)Integer20(defined $value and $value != $vdef) ? " -X $value":""("" , " -X "+ str(value))[ value is not None and value != vdef]This option allows the user to specify a different Cove score threshold for reporting tRNAs. It is not recommended that novice users change this cutoff, as a lower cutoff score will increase the number of pseudogenes and other false positives found by tRNAscan-SE (especially when used with the 'Cove only' scan mode). Conversely, a higher cutoff than 20.0 bits will likely cause true tRNAs to be missed by tRNAscan (numerous 'real' tRNAs have been found just above the 20.0 cutoff). Knowledgable users may wish to experiment with this parameter to find very unusual tRNAs or pseudogenes beyond the normal range of detection with the preceding caveats in mind.LengthMax length of tRNA intron+variable region (-L)Integer116(defined $value and $value !=$vdef) ? " -L $value":""("" , " -L "+ str(value))[ value is not None and value !=vdef]Set max length of tRNA intron+variable region (default=116bp). The default maximum tRNA length for tRNAscan-SE is 192 bp, but this limit can be increased with this option to allow searches with no practical limit on tRNA length. In the first phase of tRNAscan-SE, EufindtRNA searches for A and B boxes of <length> maximum distance apart, and passes only the 5' and 3' tRNA ends to covariance model analysis for confirmation (removing the bulk of long intervening sequences). tRNAs containing group I and II introns have been detected by setting this parameter to over 800 bp. Caution: group I or II introns in tRNAs tend to occur in positions other than the canonical position of protein-spliced introns, so tRNAscan-SE mispredicts the intron bounds and anticodon sequence for these cases. tRNA bound predictions, however, have been found to be reliable in these same tRNAs.add_to_both_endsNumber of nucleotids to add to both ends during first-pass (-z)Integer7(defined $value and $value != $vdef) ? " -z $value" : ""( "" , " -z "+ str(value) )[ value is not None and value != vdef]By default, tRNAscan-SE adds 7 nucleotides to both ends of tRNA predictions when first-pass tRNA predictions are passed to covariance model (CM) analysis. CM analysis generally trims these bounds back down, but on occasion, allows prediction of an otherwise truncated first-pass tRNA prediction.geneticGenetic code (-g)Choicenot $trnascan_only and not $eufindtrna_onlynot trnascan_only and not eufindtrna_only StandardStandardgcode.cilnucgcode.echdmitogcode.invmitogcode.othmitogcode.vertmitogcode.ystmito(defined $value and $value ne $vdef) ? " -g $value":""("" , " -g "+ str(value))[ value is not None and value != vdef]This option does not have any effect when using the -T or -E options -- you must be running in default or Cove only analysis mode.covarianteSpecify an alternate covariance model (-c)Text (defined $value) ? " -c $value":""("" , " -c " + str(value))[ value is not None]misc_optionsMisc options1matchingSearch only sequences with names matching this string (-n)String (defined $value) ? " -n $value":""("" , " -n "+ str(value))[ value is not None] Search only sequences with names matching this string. Only those sequences with names (first non-white space word after '>' symbol on FASTA name/description line) matching this string are analyzed for tRNAs.startStart search at first sequence with name matching this string (-s)String (defined $value) ? " -s $value":""("" , " -s "+ str(value))[ value is not None ]Start search at first sequence with name matching <EXPR> string and continue to end of input sequence file(s). This may be useful for re-starting crashed/aborted runs at the point where the previous run stopped. (If same names for output file(s) are used, program will ask if files should be over-written or appended to -- choose append and run will successfully be restarted where it left off).output_optionsOutput options1secondary_structureSave secondary structure results file (-f)Boolean0 ($value) ? " -f\\#":""("" , " -f#")[ value ]Save final results and Cove tRNA secondary structure predictions. This output format makes visual inspection of individual tRNA predictions easier since the tRNA sequence is displayed along with the predicted tRNA base pairings.acedbOutput final results in ACeDB format instead of the default tabular format (-a)Boolean0 ($value) ? " -a":""("" , " -a")[ value ]statisticsSave statistics summary for run (-m)Boolean0 ($value) ? " -m\\#":""("" , " -m#")[ value ]This option directs tRNAscan-SE to write a brief summary to a file which contains the run options selected as well as statistics on the number of tRNAs detected at each phase of the search, search speed, and other bits of information. See Manual documentation for explanation of each statistic.progressDisplay program progress (-d)Boolean0 ($value) ? " -d":""("" , " -d")[ value ]Messages indicating which phase of the tRNA search are printed to standard output. If final results are also being sent to standard output, some of these messages will be suppressed so as to not interrupt display of the results.logSave log of program progress (-l)Boolean0 ($value) ? " -l\\#":""("" , " -l#")[ value ]quietQuiet mode (-q)Boolean0 ($value) ? " -q":""("" , " -q")[ value ]The credits & run option selections normally printed to standard error at the beginning of each run are suppressed.briefUse brief output format (-b)Boolean0 ($value) ? " -b":""("" , " -b")[ value ]This eliminates column headers that appear by default when writing results in tabular output format. Useful if results are to be parsed or piped to another program.trna_codonOutput a tRNA's corresponding codon in place of its anticodon (-N)Boolean0 ($value) ? " -N":""("" , " -N")[ value ]labelUse prefix for all default output file names (-p)Filename (defined $value) ? " -p $value":""("" , " -p "+str(value))[ value is not None ]scannersDisplays which of the first-pass scanners detected the tRNA being output (-y)Boolean0($value) ? " -y":""("" , " -y")[ value ]1'Ts', 'Eu', or 'Bo' will appear in the last column of Tabular output, indicating that either tRNAscan 1.4, EufindtRNA, or both scanners detected the tRNA, respectively.resultsResults filesText"*.stats""*.log""*.ss""*.fpos""*.stats""*.log""*.ss""*.fpos"first_pass_scan_resultsFirst pass scan resultTrnaScanFirstPassResultAbstractText"*.fpass.out""*.fpass.out"
Programs-5.1.2/digest.xml 0000644 0001750 0000156 00000033047 12072525233 014164 0 ustar bneron sis
digestEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netdigestReports on protein proteolytic enzyme or reagent cleavage siteshttp://bioweb2.pasteur.fr/docs/EMBOSS/digest.htmlhttp://emboss.sourceforge.net/docs/themessequence:protein:motifsdigeste_inputInput sectione_seqallseqall optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -seqall=" + str(value))[value is not None]1e_mwdataMolecular weights data fileMolecularWeightsAbstractText("", " -mwdata=" + str(value))[value is not None ]2Molecular weight data for amino acidse_requiredRequired sectione_menuEnzymes and reagentsChoice112345678("", " -menu=" + str(value))[value is not None and value!=vdef]3e_monoUse monoisotopic weightsBoolean0("", " -mono")[ bool(value) ]4e_advancedAdvanced sectione_unfavouredAllow unfavoured cutsBoolean0("", " -unfavoured")[ bool(value) ]5Trypsin will not normally cut after 'KR' if they are followed by any of 'KRIFLP'. Lys-C will not normally cut after 'K' if it is followed by 'P'. Arg-C will not normally cut after 'R' if it is followed by 'P'. V8-bicarb will not normally cut after 'E' if it is followed by any of 'KREP'. V8-phosph will not normally cut after 'DE' if they are followed by 'P'. Chymotrypsin will not normally cut after 'FYWLM' if they are followed by 'P'. Specifying unfavoured shows these unfavoured cuts as well as the favoured ones.e_raggingAllow raggingBoolean0("", " -ragging")[ bool(value) ]6Allows semi-specific and non-specific digestion. This option is particularly useful for generating lists of peptide sequences for protein identification using mass-spectrometry.e_terminiRagging value (value from 1 to 4)Choice11234("", " -termini=" + str(value))[value is not None and value!=vdef]7e_outputOutput sectione_overlapShow overlapping partialsBoolean0("", " -overlap")[ bool(value) ]8Used for partial digestion. Shows all cuts from favoured cut sites plus 1..3, 2..4, 3..5 etc but not (e.g.) 2..5. Overlaps are therefore fragments with exactly one potential cut site within it.e_allpartialsShow all partialsBoolean0("", " -allpartials")[ bool(value) ]9As for overlap but fragments containing more than one potential cut site are included.e_outfileName of the report fileFilenamedigest.report("" , " -outfile=" + str(value))[value is not None]10e_rformat_outfileChoose the report output formatChoiceSEQTABLEDASGFFDBMOTIFDIFFSEQEMBLEXCELFEATTABLEGENBANKGFFLISTFILEMOTIFNAMETABLECODATAREGIONSSEQTABLESIMPLESRSSWISSTABLETAGSEQ("", " -rformat=" + str(value))[value is not None and value!=vdef]11e_outfile_outoutfile_out optionTexte_rformat_outfile in ['DASGFF', 'DBMOTIF', 'DIFFSEQ', 'EMBL', 'EXCEL', 'FEATTABLE', 'GENBANK', 'GFF', 'LISTFILE', 'MOTIF', 'NAMETABLE', 'CODATA', 'REGIONS', 'SEQTABLE', 'SIMPLE', 'SRS', 'SWISS', 'TABLE', 'TAGSEQ']e_outfileautoTurn off any promptingString" -auto -stdout"12
Programs-5.1.2/seqret.xml 0000644 0001750 0000156 00000014600 12072525233 014202 0 ustar bneron sis
seqretEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netseqretReads and writes (returns) sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/seqret.htmlhttp://emboss.sourceforge.net/docs/themessequence:editseqrete_inputInput sectione_featureUse feature informationBoolean0("", " -feature")[ bool(value) ]1e_sequencesequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]2e_advancedAdvanced sectione_firstonlyRead one sequence and stopBoolean0("", " -firstonly")[ bool(value) ]3e_outputOutput sectione_outseqName of the output sequence file (e_outseq)Filenameseqret.e_outseq("" , " -outseq=" + str(value))[value is not None]4e_osformat_outseqChoose the sequence output formatChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]5e_outseq_outoutseq_out optionSequencee_outseqautoTurn off any promptingString" -auto -stdout"6
Programs-5.1.2/dca.xml 0000644 0001750 0000156 00000026217 11767572177 013460 0 ustar bneron sis
dca1.1DCADivide-and-Conquer Multiple Sequence AlignmentJ. StoyeA.W.M. Dress, G. Fullen, S.W. Perrey, A Divide and Conquer Approach to Multiple Alignment, Proceedings of the Third International Conference on Intelligent Systems for Molecular Biology (ISMB 95), AAAI Press, Menlo Park, CA, USA, 107-113, 1995.J. Stoye, Multiple Sequence Alignment with the Divide-and-Conquer Method, Gene 211(2), GC45-GC56, 1998. (Gene-COMBIS)Divide-and-Conquer Multiple Sequence Alignment (DCA) is a program for producing fast, high quality simultaneous multiple sequence alignments of amino acid, RNA, or DNA sequences. The program is based on the DCA algorithm, a heuristic approach to sum-of-pairs (SP) optimal alignment that has been developed at the FSPM over the years 1995-97.http://bibiserv.techfak.uni-bielefeld.de/dca/http://bibiserv.techfak.uni-bielefeld.de/download/tools/dca.htmlalignment:multipledcaseqSequences FileSequenceFASTA" $value"" "+str(value)100controlControl parameterscostCost matrix (-c)Choicenullnullblosum30blosum45blosum62pam160pam250unitcostdnarnadnarna(defined $value and $value ne $vdef)? " -c $value" : "" ( "" , " -c " + str(value) )[ value is not None and value != vdef]1gapsPenalize end gaps as internal gaps (-g)Boolean0 ($value)? " -g ":""("" , " -g ")[ value ]1Default: free shiftapproximateUse approximate cut positions (-a)Boolean0 ($value)? " -a":""("" , " -a")[ value ]1On: FastDCA (use approximate cut positions); Off: slower, more accurate algorithm (search for exact cut positions)intensityWeight intensity (-b)Float0.0(defined $value and $value != $vdef)? " -b $value" : "" ( "" , " -b " + str(value) )[ value is not None and value != vdef]Weight intensity must be >= 0.0 and <= 1.0$intensity >= 0.0 and $intensity <= 1.0intensity >= 0.0 and intensity <= 1.01recursionRecursion stop size (-l)Integer30 (defined $value and $value != $vdef)? " -l $value" : "" ( "" , " -l " + str(value) )[ value is not None and value != vdef]15 ... 100 recommended; small: faster algorithm, maybe worse.windowWindow size (-w)Integer0(defined $value and $value != $vdef)? " -w $value" : "" ( "" , " -w " + str(value) )[ value is not None and value != vdef]1To correct the alignment in the proximity of division sites, the sequences can be re-aligned inside a window of size w >= 0 placed across each slicing site.outputOutput parametersquietString" -q"" -q"1output_formatOutput format (-f)Choice21234(defined $value and $value ne $vdef)? " -f $value" : "" ( "" , " -f " + str(value) )[ value is not None and value != vdef]1suppress_outputSuppress output about progress of the program (-o)String" -o"" -o"1fasta_outfileAlignment fileAlignmentFASTANEXUS$output_format eq "2" or $output_format eq "3"output_format == "2" or output_format == "3""dca.out""dca.out"aln_outfileDCA alignment fileDcalignmentAbstractTextCLUSTALDCA$output_format eq "1" or $output_format eq "4"output_format == "1" or output_format == "4""dca.out""dca.out"
Programs-5.1.2/phiblast.xml 0000644 0001750 0000156 00000076030 11767572177 014535 0 ustar bneron sis
phiblastPHI-BlastPattern-Hit Initiated BLASTR. Baeza-Yates and G. Gonnet, Communications of the ACM 35(1992), pp. 74-82.S. Wu and U. Manber, Communications of the ACM 35(1992), pp. 83-91.database:search:homologydatabase:search:patternphiblastProgram (-p)Choiceblastpgpblastpgppatseedpseedp"blastpgp -p $value""blastpgp -p " + str(value)PHI-BLAST (Pattern-Hit Initiated BLAST) is a search program that combines matching of regular expressions with local alignments surrounding the match. The calculation of local alignments is done using a method very similar to (and much of the same code as) gapped BLAST.Program modes:. patseedp: normal phiblast mode. seedp: Restrict the search for local alignments to a subset of the pattern occurrences in the query. This program option requires the user to specify the location(s) of the interesting pattern occurrence(s) in the pattern file (for the syntax see below). When there are multiple pattern occurrences in the query it may be important to decide how many are of interest because the E-value for matches is effectively multiplied by the number of interesting pattern occurrences.querySequence File (-i)SequenceFASTA" -i $query" " -i " + str(value)3start_regionStart of required region in query (-S)Integer1(defined $value and $value != $vdef)? " -S $value" : ""( "" , " -S " + str(value) )[ value is not None and value != vdef]5end_regionEnd of required region in query (-H)Integer-1(defined $value and $value != $vdef)? " -H $value" : ""( "" , " -H " + str(value) )[ value is not None and value != vdef]5Location on query sequence. -1 indicates end of querypatternPattern file- Prosite syntax (-k)PrositePatternAbstractText" -k $value" " -k " + str(value)3Given a protein sequence S and a regular expression pattern P occurring in S, PHI-BLAST helps answer the question: What other protein sequences both contain an occurrence of P and are homologous to S in the vicinity of the pattern occurrences?Rules for pattern syntax:The syntax for patterns in PHI-BLAST follows the conventions of PROSITE. When using the stand-alone program, it is permissible to have multiple patterns in a file separated by a blank line between patterns. Valid protein characters for PHI-BLAST patterns:ABCDEFGHIKLMNPQRSTVWXYZUOther useful delimiters:[ ] means any one of the characters enclosed in the brackets e.g., [LFYT] means one occurrence of L or F or Y or T - means nothing (this is a spacer character used by PROSITE) x with nothing following means any residuex(5) means 5 positions in which any residue is allowed (and similarly for any other single number in parentheses after x)x(2,4) means 2 to 4 positions where any residue is allowed, and similarly for any other two numbers separated by a comma; the first number should be < the second number.
IP PATTERN
PA [LIVM]-x-D-x(2)-[GA]-[NQS]-K-G-T-G-x-W
protein_dbProtein database (-d)Choicenullnull " -d $value" " -d " + str(value) 2scoringScoring options4open_a_gapCost to open a gap (-G)Integer11(defined $value and $value != $vdef) ? " -G $value" : ""( "" , " -G " + str(value) )[ value is not None and value != vdef]extend_a_gapCost to extend a gap (-E)Integer1(defined $value and $value != $vdef) ? " -E $value" : ""( "" , " -E " + str(value) )[ value is not None and value != vdef]Limited values for gap existence and extension are supported for these three programs. Some supported and suggested values are:Existence Extension10 -- 110 -- 211 -- 18 -- 29 -- 2(source: NCBI Blast page)matrixSimilarity matrix (-M)ChoiceBLOSUM62BLOSUM45BLOSUM80BLOSUM62PAM30PAM70(defined $value and $value ne $vdef)? " -M $value" : ""( "" , " -M " + str(value) )[ value is not None and value != vdef]filter_optFiltering and masking options5This options also takes a string as an argument. One may use such a string to change the specific parameters of seg or invoke other filters. Please see the 'Filtering Strings' section (below) for details.filterFilter query sequence with SEG (-F)Boolean0($value) ? " -F T" : ""( "" , " -F T" )[ value ]lower_caseUse lower case filtering (-U)Boolean0($value) ? " -U T" : ""("", " -U T")[value]This option specifies that any lower-case letters in the input FASTA file should be masked.selectivity_optSelectivity options5ExpectExpected value (-e)Float10(defined $value and $value != $vdef)? " -e $value":""("" , " -e " + str(value))[ value is not None and value != vdef]The statistical significance threshold for reporting matches against database sequences; the default value is 10, such that 10 matches are expected to be found merely by chance, according to the stochastic model of Karlin and Altschul (1990). If the statistical significance ascribed to a match is greater than the EXPECT threshold, the match will not be reported. Lower EXPECT thresholds are more stringent, leading to fewer chance matches being reported. Fractional values are acceptable. word_sizeWord Size (-W)Integer(defined $value) ? " -W $value" : ""("" , " -W "+str(value))[value is not None]Valid wordsize range is 2 to 3$value >= 2 and $value <=3value >= 2 and value <=3Use words of size N.Zero invokes default behaviorDefault value: 3windowMultiple hits window size (-A)Integer40(defined $value and $value != $vdef)? " -A $value" : ""( "" , " -A " + str(value) )[ value is not None and value != vdef]When multiple hits method is used, this parameter defines the distance from last hit on the same diagonal to the new one.Zero means single hit algorithm.extend_hitThreshold for extending hits (-f)Integer11(defined $value and $value != $vdef)? " -f $value" : ""( "" , " -f " + str(value) )[ value is not None and value !=vdef]Blast seeks first short word pairs whose aligned score reaches at least this valuedropoffX dropoff value for gapped alignment (-X)Integer(defined $value)? " -X $value":""("" , " -X " + str(value))[ value is not None ]This is the value that control the path graph region explored by Blast during a gapped extension (Xg in the NAR paper).dropoff_zX dropoff value for final gapped alignment (-Z)Integer25(defined $value and $value != $vdef)? " -Z $value" : ""( "" , " -Z " + str(value) )[ value is not None and value != vdef]This parameter controls the dropoff for the final reported alignment. See also the -X parameter.dropoff_yDropoff for blast ungapped extensions in bits (-y)Float7.0(defined $value and $value != $vdef) ? " -y $value" : ""( "" , " -y " + str(value) )[ value is not None and value != vdef]This parameter controls the dropoff at ungapped extension stage. See also the -X parameter.eff_lenEffective length of the search space (-Y)Integer0(defined $value and $value != $vdef) ? " -Y $value" : ""("" , " -Y "+str(value))[value is not None and value !=vdef]Use zero for the real sizekeep_hitsNumber of best hits from a region to keep (-K)Integer(defined $value) ? " -K $value" : ""("" , " -K "+str(value))[value is not None]If this option is used, a value of 100 is recommended.modeSingle-hit or multiple-hit mode (-P)Choice001($value eq "0") ? " -P $value" : ""("" , " -P "+str(value))[value != "0"]nb_bitsNumber of bits to trigger gapping (-N)Integer22(defined $value and $value != $vdef) ? " -N $value" : ""("" , " -N "+str(value))[value is not None and value !=vdef]phi_spec_optPHI-Blast specific selectivity options5multipassMaximum number of passes to use in multipass version (-j)Integer1(defined $value and $value != $vdef) ? " -j $value" : ""("" , " -j "+str(value))[value is not None and value !=vdef]pseudocountsConstant in pseudocounts for multipass version (-c)Integer9(defined $value and $value != $vdef) ? " -c $value" : ""("" , " -c "+str(value))[value is not None and value !=vdef]e_thresholde-value threshold for inclusion in multipass model (-h)Float0.002(defined $value and $value != $vdef) ? " -h $value" : ""("" , " -h "+str(value))[value is not None and value !=vdef]affichageReport options5DescriptionsNumber of one-line descriptions to show? (-v)Integer500(defined $value and $value != $vdef) ? " -v $value" : ""( "" , " -v " + str(value) )[ value is not None and value != vdef]Maximum number of database sequences for which one-line descriptions will be reported.AlignmentsNumber of database sequences to show alignments? (-b)Integer250(defined $value and $value != $vdef) ? " -b $value" : ""( "" , " -b " + str(value) )[ value is not None and value != vdef]Maximum number of database sequences for which high-scoring segment pairs will be reported (-b).view_alignmentsAlignment view options (-m)Choice0012345678(defined $value and $value ne $vdef)? " -m $value" : "" ( "" , " -m " + str(value) )[ value is not None and value != vdef]txtoutputText outputString$view_alignments ne "7"view_alignments != "7"" -o phiblast.txt"" -o phiblast.txt"10xmloutputXml outputString$view_alignments eq "7"view_alignments == "7"" -o phiblast.xml"" -o phiblast.xml"10htmloutputHtml outputBoolean$view_alignments !~ /^[78]$/view_alignments not in [ "7" , "8" ]1($value) ? " && html4blast -g -o phiblast.html phiblast.txt" : ""("" , " && html4blast -g -o phiblast.html phiblast.txt")[value]11believeBelieve the query defline (-J)Boolean0($value)? " -J":""("" , " -J")[ value ]seqalign_fileSeqAlign file (-J option must be true) (-O)Filename$believebelieve(defined $value)? " -O $value" : ""( "" , " -O " + str(value) )[ value is not None ]SeqAlign is in ASN.1 format, so that it can be read with NCBI tools (such as sequin). This allows one to view the results in different formats.txtfileBlast text reportBlastTextReportReport$view_alignments ne "7"view_alignments != "7""phiblast.txt""phiblast.txt"xmlfileBlast xml reportBlastXmlReportReport$view_alignments eq "7"view_alignments == "7""phiblast.xml""phiblast.xml"htmlfileBlast html reportBlastHtmlReportReport$view_alignments !~ /^[78]$/view_alignments not in ["7", "8"]"phiblast.html""phiblast.html"imgfilePictureBinary$view_alignments !~ /^[78]$/view_alignments not in ["7", "8"]"*.png""*.gif""*.png""*.gif"
Programs-5.1.2/distmat.xml 0000644 0001750 0000156 00000024361 11672346320 014354 0 ustar bneron sis
distmatEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netdistmatCreate a distance matrix from a multiple sequence alignmenthttp://bioweb2.pasteur.fr/docs/EMBOSS/distmat.htmlhttp://emboss.sourceforge.net/docs/themesphylogeny:distancedistmate_inputInput sectione_sequencesequence optionAlignmentFASTAMSFPAIRMARKX0MARKX1MARKX2MARKX3MARKX10SRSSRSPAIRSCOREUNKNOWNMULTIPLESIMPLEMATCH1,n("", " -sequence=" + str(value))[value is not None]1File containing a sequence alignment.e_requiredRequired sectione_nucmethodMultiple substitution correction methods for nucleotidesChoice0012345("", " -nucmethod=" + str(value))[value is not None and value!=vdef]2Multiple substitution correction methods for nucleotides.e_protmethodMultiple substitution correction methods for proteinsChoice0012("", " -protmethod=" + str(value))[value is not None and value!=vdef]3Multiple substitution correction methods for proteins.e_additionalAdditional sectione_ambiguousUse the ambiguous codes in the calculation.Boolean0("", " -ambiguous")[ bool(value) ]4Option to use the ambiguous codes in the calculation of the Jukes-Cantor method or if the sequences are proteins.e_gapweightWeight given to gapsFloat0.("", " -gapweight=" + str(value))[value is not None and value!=vdef]5Option to weight gaps in the uncorrected (nucleotide) and Jukes-Cantor distance methods.e_positionBase position to analyseInteger123("", " -position=" + str(value))[value is not None and value!=vdef]6Choose base positions to analyse in each codon i.e. 123 (all bases), 12 (the first two bases), 1, 2, or 3 individual bases.e_calculateaCalculate the nucleotide jin-nei parameter 'a'Boolean0("", " -calculatea")[ bool(value) ]7This will force the calculation of parameter 'a' in the Jin-Nei Gamma distance calculation, otherwise the default is 1.0 (see -parametera option).e_parameteraNucleotide jin-nei parameter 'a'Float1.0("", " -parametera=" + str(value))[value is not None and value!=vdef]8User defined parameter 'a' to be use in the Jin-Nei Gamma distance calculation. The suggested value to be used is 1.0 (Jin et al.) and this is the default.e_outputOutput sectione_outfileName of the output file (e_outfile)Filenamedistmat.e_outfile("" , " -outfile=" + str(value))[value is not None]9e_outfile_outoutfile_out optionEmbossDistanceMatrixAbstractTexte_outfileautoTurn off any promptingString" -auto -stdout"10
Programs-5.1.2/seqretsetall.xml 0000644 0001750 0000156 00000013473 12072525233 015416 0 ustar bneron sis
seqretsetallEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netseqretsetallReads and writes (returns) many sets of sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/seqretsetall.htmlhttp://emboss.sourceforge.net/docs/themessequence:editseqretsetalle_inputInput sectione_featureUse feature informationBoolean0("", " -feature")[ bool(value) ]1e_sequencesequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF2,n("", " -sequence=" + str(value))[value is not None]2e_outputOutput sectione_outseqName of the output sequence file (e_outseq)Filenameseqretsetall.e_outseq("" , " -outseq=" + str(value))[value is not None]3e_osformat_outseqChoose the sequence output formatChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]4e_outseq_outoutseq_out optionSequencee_outseqautoTurn off any promptingString" -auto -stdout"5
Programs-5.1.2/garnier.xml 0000644 0001750 0000156 00000020756 12072525233 014337 0 ustar bneron sis
garnierEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netgarnierPredicts protein secondary structure using GOR methodhttp://bioweb2.pasteur.fr/docs/EMBOSS/garnier.htmlhttp://emboss.sourceforge.net/docs/themessequence:protein:2D_structurestructure:2D_structuregarniere_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_advancedAdvanced sectione_idcIndex decision constants parameter (value from 0 to 6)Integer0("", " -idc=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0 is requiredvalue >= 0Value less than or equal to 6 is requiredvalue <= 62In their paper, GOR mention that if you know something about the secondary structure content of the protein you are analyzing, you can do better in prediction. 'idc' is an index into a set of arrays, dharr[] and dsarr[], which provide 'decision constants' (dch, dcs), which are offsets that are applied to the weights for the helix and sheet (extend) terms. So, idc=0 says don't use the decision constant offsets, and idc=1 to 6 indicates that various combinations of dch,dcs offsets should be used.e_outputOutput sectione_outfileName of the report fileFilenamegarnier.report("" , " -outfile=" + str(value))[value is not None]3e_rformat_outfileChoose the report output formatChoiceTAGSEQDASGFFDBMOTIFDIFFSEQEMBLEXCELFEATTABLEGENBANKGFFLISTFILEMOTIFNAMETABLECODATAREGIONSSEQTABLESIMPLESRSSWISSTABLETAGSEQ("", " -rformat=" + str(value))[value is not None and value!=vdef]4e_outfile_outoutfile_out optionTexte_rformat_outfile in ['DASGFF', 'DBMOTIF', 'DIFFSEQ', 'EMBL', 'EXCEL', 'FEATTABLE', 'GENBANK', 'GFF', 'LISTFILE', 'MOTIF', 'NAMETABLE', 'CODATA', 'REGIONS', 'SEQTABLE', 'SIMPLE', 'SRS', 'SWISS', 'TABLE', 'TAGSEQ']e_outfileautoTurn off any promptingString" -auto -stdout"5
Programs-5.1.2/fetchSequences.xml 0000644 0001750 0000156 00000005421 11767572177 015670 0 ustar bneron sis
fetchSequences1.0fetch sequencesRetrieve sequences in databases from list of identifier (USA list)EMBOSShttp://emboss.sourceforge.net/http://bioweb2.pasteur.fr/docs/EMBOSS/seqret.htmlhttp://emboss.sourceforge.net/docs/themesdatabase:search:sequenceseqretinputInput sectionUSAListlist of sequences identifier in USA formatGenesIdAbstractTextUSAList1" @" + str(value)2
list of identifiers in USA format:
databank:Acc
( one item per line )
sp:Q74K65
sp:Q2W4W1
sp:P63394
sp:P63393
sp:P18767
sp:Q042G7
outputOutput sectionsequence_outthe sequencesSequence(1,n)"fetchSequences.out"autoTurn off any promptingString" -auto -stdout"10
Programs-5.1.2/fitch.xml 0000644 0001750 0000156 00000060224 11724156742 014007 0 ustar bneron sis
fitchfitchFitch-Margoliash and Least-Squares Distance Methodshttp://bioweb2.pasteur.fr/docs/phylip/doc/fitch.htmlThis program carries out Fitch-Margoliash,
Least Squares, and a number of similar methodsphylogeny:distancefitchString"fitch <fitch.params""fitch <fitch.params"0infileDistances matrix FilePhylipDistanceMatrixAbstractText$infile ne "infile"infile != "infile""ln -s $infile infile && ""ln -s " + str( value ) + " infile && "-5Give a file containing a distance matrix
obtained by distance matrix programs like protdist or
dnadist
5
Alpha 0.000000 0.330447 0.625670 1.032032 1.354086
Beta 0.330447 0.000000 0.375578 1.096290 0.677616
Gamma 0.625670 0.375578 0.000000 0.975798 0.861634
Delta 1.032032 1.096290 0.975798 0.000000 0.226703
Epsilon 1.354086 0.677616 0.861634 0.226703 0.000000
MethodMethod (D)ChoiceFMFM""""ME"D\\n""D\n"fitch.paramsfitch_optionsFitch optionsnegative_branchNegative branch lengths allowed (-)Boolean0($value) ? "-\\n" : ""( "" , "-\n" )[ value ]1fitch.paramspowerPower (P)Float2.0(defined $value and $value != $vdef) ? "P\\n$value\\n" : ""( "" , "P\n" + str(value ) + "\n" )[ value is not None and value != vdef ]1For the Fitch-Margoliash method, which is the default method with this program, P is 2.0. For the Cavalli-Sforza and Edwards least squares method it should be set to 0 (so that the denominator is always 1). An intermediate method is also available in which P is 1.0, and any other value of P, such as 4.0 or -2.3, can also be used. This generates a whole family of methods.Please read the documentation (man distance).fitch.paramsjumble_optionsRandomize optionsjumbleRandomize (jumble) input order (J)Booleannot $user_treenot user_tree0($value) ? "J\\n$jumble_seed\\n$jumble_number\\n" : ""( "" , "J\n"+str( jumble_seed ) +"\n" + str( jumble_number ) +"\n" )[ value ]20fitch.paramsjumble_seedRandom number seed (must be odd)Integer$jumblejumble""""Random number seed must be odd$value > 0 and ($value % 2) != 0value > 0 and (value % 2) != 019jumble_numberNumber of times to jumbleInteger$jumblejumble1""""19bootstrapBootstrap optionsmultipleAnalyze multiple data sets (M)Boolean0($value) ? "M\\n$multiple_number\\n$multiple_seed\\n" : ""( "", "M\n"+str(multiple_number)+"\n"+str(multiple_seed)+"\n")[value]10fitch.paramsmultiple_numberHow many data setsInteger$multiplemultiple""""There must be no more than 1000 datasets for this server$value <= 1000value <= 10009multiple_seedRandom number seed (must be odd)Integer$multiplemultiple""""Random number seed must be odd$value > 0 and ($value % 2) != 0value > 0 and (value % 2) != 019consenseCompute a consensus treeBoolean$multiple and $print_treefilemultiple and print_treefile0($value) ? " && cp infile fitch.infile && cp fitch.outtree intree && consense <consense.params && mv outtree consense.outtree && mv outfile consense.outfile" : ""( "" , " && cp infile fitch.infile && cp fitch.outtree intree && consense <consense.params && mv outtree consense.outtree && mv outfile consense.outfile" )[ value ]10consense_confirmString$consenseconsense"Y\\n""Y\n"1000consense.paramsconsense_terminal_typeString$consenseconsense"T\\n""T\n"-2consense.paramsconsense_outfileConsense output fileText$consenseconsense"consense.outfile""consense.outfile"consense_treefileConsense tree fileTreeNEWICK$consenseconsense"consense.outtree""consense.outtree"user_tree_optUser tree optionsuser_treeUse User tree (default: no, search for best tree) (U)Booleandefined $tree_filetree_file is not None0($value) ? "U\\n" : ""( "" , "U\n")[ value ]You cannot randomize (jumble) your dataset and give a user tree at the same timenot ( $user_tree and $jumble )not ( user_tree and jumble )1To give your tree to the program, you must normally put it in the alignment file, after the sequences, preceded by a line indicating how many trees you give.Here, this will be automatically appended: just give a treefile and the number of trees in it.fitch.paramstree_fileUser Tree fileTreeNEWICK$user_treeuser_tree(defined $value) ? "cat $tree_file >> intree; " : ""("" , "cat "+str( tree_file ) + " >> intree; " )[ value is not None ]-1use_lengthsUse lengths from user trees (N)Boolean$user_treeuser_tree0($value) ? "N\\n" : ""( "" , "N\n" )[ value ]2fitch.paramsoutputOutput optionsprint_treePrint out tree (3)Boolean1($value) ? "" : "3\\n"( "3\n" , "" )[ value ]1Tells the program to print a semi-graphical picture of the tree in the outfile.fitch.paramsprint_treefileWrite out trees onto tree file (4)Boolean1($value) ? "" : "4\\n"( "4\n" , "" )[ value ]1Tells the program to save the tree in a treefile (a standard representation of trees where the tree is specified by a nested pairs of parentheses, enclosing names and separated by commas).fitch.paramsprintdataPrint out the data at start of run (1)Boolean0($value) ? "1\\n" : ""( "" , "1\n" )[ value ]1fitch.paramsother_optionsOther optionsoutgroupOutgroup species root (O)Integer1(defined $value and $value != $vdef) ? "O\\n$value\\n" : ""( "" , "O\n" +str( value )+ "\n" )[ value is not None and value != vdef]Please enter a value greater than 0$value > 0value > 01fitch.paramstriangularMatrix formatChoicesquaresquare""""lower"L\\n""L\n"upper"R\\n""R\n"1fitch.paramssubreplicatesSubreplicates (S)Boolean0($value) ? "S\\n" : ""( "" , "S\n" )[ value ]1If the S (subreplication) option is in effect, the above degrees of freedom must be modified by noting that N is not n(n-1)/2 but is the sum of the numbers of replicates of all cells in the distance matrix read in, which may be either square or triangular. A further explanation of the statistical test of the clock is given in a paper of mine (Felsenstein, 1986).fitch.paramsglobalGlobal rearrangements (G)Boolean0($value) ? "G\\n" : ""( "", "G\n")[ value ]1is the Global search option. This causes, after the last species is added to the tree, each possible group to be removed and re-added. This improves the result, since the position of every species is reconsidered. It approximately triples the run-time of the program. It is not an option in KITSCH because it is the default and is always in force there.fitch.paramsoutfileFitch output fileText" && mv outfile fitch.outfile"" && mv outfile fitch.outfile""fitch.outfile""fitch.outfile"treefileFitch tree fileTreeNEWICK$print_treefileprint_treefile" && mv outtree fitch.outtree"" && mv outtree fitch.outtree""fitch.outtree""fitch.outtree"confirmString"Y\\n""Y\n"1000fitch.paramsterminal_typeString"0\\n""0\n"-1fitch.params
Programs-5.1.2/golden.xml 0000644 0001750 0000156 00000013777 12127560631 014167 0 ustar bneron sis
golden1.1aGOLDENFetch a database entryN. Jolyftp://ftp.pasteur.fr/pub/gensoft/projects/golden/database:search:sequencegoldendbDatabaseChoicenull" $db:"" " + db + ":"2queryQuery (Entry name or Accession number)String"$value"str(value)3acSearch with Accession number only (-a)Boolean0($value) ? " -a" : ""( "" , " -a" )[ value ]1IdSearch with entry name only (-i)Boolean0($value) ? " -i" : ""( "" , " -i" )[ value ]1nucleic_sequence_outSequence$db =~ /^(embl|genbank|imgt|rdpii)$/db in ( 'embl' , 'genbank' , 'imgt' ,'rdpii' )DNASequenceEMBLGENBANKEMBLGENBANKGENBANK"golden.out""golden.out"protein_sequence_outSequence$db =~ /^(genpept|uniprot)$/db in ( 'genpept' , 'uniprot' )ProteinSequenceGENBANKSWISSPROT"golden.out""golden.out"refseq_outSequence$db eq 'refseq'db == 'refseq' DNAProteinSequenceGENBANK"golden.out""golden.out"motif_outMotif$db eq 'prosite'db == 'prosite'ProteinAbstractTextMotifPROSITE"golden.out""golden.out"enzyme_outEnzyme$db eq 'enzyme'db == 'enzyme'ProteinAbstractTextEnzymeENZYME"golden.out""golden.out"
Programs-5.1.2/cai.xml 0000644 0001750 0000156 00000103101 12072525233 013426 0 ustar bneron sis
caiEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netcaiCalculate codon adaptation indexhttp://bioweb2.pasteur.fr/docs/EMBOSS/cai.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:codon_usagecaie_inputInput sectione_seqallseqall optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -seqall=" + str(value))[value is not None]1e_cfilecfile optionChoiceEyeast_cai.cutEacc.cutEacica.cutEadenovirus5.cutEadenovirus7.cutEagrtu.cutEaidlav.cutEanasp.cutEani.cutEani_h.cutEanidmit.cutEarath.cutEasn.cutEath.cutEatu.cutEavi.cutEazovi.cutEbacme.cutEbacst.cutEbacsu.cutEbacsu_high.cutEbja.cutEbly.cutEbme.cutEbmo.cutEbna.cutEbommo.cutEbov.cutEbovin.cutEbovsp.cutEbpphx.cutEbraja.cutEbrana.cutEbrare.cutEbst.cutEbsu.cutEbsu_h.cutEcac.cutEcaeel.cutEcal.cutEcanal.cutEcanfa.cutEcaucr.cutEccr.cutEcel.cutEchi.cutEchick.cutEchicken.cutEchisp.cutEchk.cutEchlre.cutEchltr.cutEchmp.cutEchnt.cutEchos.cutEchzm.cutEchzmrubp.cutEcloab.cutEcpx.cutEcre.cutEcrigr.cutEcrisp.cutEctr.cutEcyapa.cutEdayhoff.cutEddi.cutEddi_h.cutEdicdi.cutEdicdi_high.cutEdog.cutEdro.cutEdro_h.cutEdrome.cutEdrome_high.cutEdrosophila.cutEeca.cutEeco.cutEeco_h.cutEecoli.cutEecoli_high.cutEemeni.cutEemeni_high.cutEemeni_mit.cutEerwct.cutEf1.cutEfish.cutEfmdvpolyp.cutEhaein.cutEhalma.cutEhalsa.cutEham.cutEhha.cutEhin.cutEhma.cutEhorvu.cutEhum.cutEhuman.cutEkla.cutEklepn.cutEklula.cutEkpn.cutElacdl.cutElla.cutElyces.cutEmac.cutEmacfa.cutEmaize.cutEmaize_chl.cutEmam_h.cutEmammal_high.cutEmanse.cutEmarpo_chl.cutEmedsa.cutEmetth.cutEmixlg.cutEmouse.cutEmsa.cutEmse.cutEmta.cutEmtu.cutEmus.cutEmussp.cutEmva.cutEmyctu.cutEmze.cutEmzecp.cutEncr.cutEneigo.cutEneu.cutEneucr.cutEngo.cutEoncmy.cutEoncsp.cutEorysa.cutEorysa_chl.cutEpae.cutEpea.cutEpet.cutEpethy.cutEpfa.cutEphavu.cutEphix174.cutEphv.cutEphy.cutEpig.cutEplafa.cutEpolyomaa2.cutEpombe.cutEpombecai.cutEpot.cutEppu.cutEprovu.cutEpse.cutEpseae.cutEpsepu.cutEpsesm.cutEpsy.cutEpvu.cutErab.cutErabbit.cutErabit.cutErabsp.cutErat.cutEratsp.cutErca.cutErhile.cutErhime.cutErhm.cutErhoca.cutErhosh.cutEric.cutErle.cutErme.cutErsp.cutEsalsa.cutEsalsp.cutEsalty.cutEsau.cutEschma.cutEschpo.cutEschpo_cai.cutEschpo_high.cutEsco.cutEserma.cutEsgi.cutEsheep.cutEshp.cutEshpsp.cutEsli.cutEslm.cutEsma.cutEsmi.cutEsmu.cutEsoltu.cutEsoy.cutEsoybn.cutEspi.cutEspiol.cutEspn.cutEspo.cutEspo_h.cutEspu.cutEsta.cutEstaau.cutEstrco.cutEstrmu.cutEstrpn.cutEstrpu.cutEsty.cutEsus.cutEsv40.cutEsyhsp.cutEsynco.cutEsyncy.cutEsynsp.cutEtbr.cutEtcr.cutEter.cutEtetsp.cutEtetth.cutEtheth.cutEtob.cutEtobac.cutEtobac_chl.cutEtobcp.cutEtom.cutEtrb.cutEtrybr.cutEtrycr.cutEvco.cutEvibch.cutEwheat.cutEwht.cutExel.cutExenla.cutExenopus.cutEyeast.cutEyeast_cai.cutEyeast_high.cutEyeast_mit.cutEyeastcai.cutEyen.cutEyeren.cutEyerpe.cutEysc.cutEysc_h.cutEyscmt.cutEysp.cutEzebrafish.cutEzma.cut("", " -cfile=" + str(value))[value is not None and value!=vdef]2e_outputOutput sectione_outfileName of the output file (e_outfile)Filenamecai.e_outfile("" , " -outfile=" + str(value))[value is not None]3e_outfile_outoutfile_out optionCaiReportReporte_outfileautoTurn off any promptingString" -auto -stdout"4
Programs-5.1.2/html4blast.xml 0000644 0001750 0000156 00000013172 12265214442 014761 0 ustar bneron sis
html4blast1.7ahtml4blastHTML blast results formatterNicolas Jolyftp://ftp.pasteur.fr/pub/gensoft/projects/html4blast/database:search:displayhtml4blastString"html4blast""html4blast"1inputBlast input fileBlastTextReportReport" $value"" %s" % str(value)10linksDatabase linksChoicegoldennone" -n"" -n"golden""""srs" -s"" -s"extern" -e"" -e"2graphGraphical alignment summary (-g)Boolean1($value) ? " -g" : ""("" , " -g")[ value ]3hsplineDraw one HSP per graphic line (-l)Boolean$graphgraph0($value) ? " -l" : ""("" , " -l")[ value ]4queryimagenameGenerate query based images names (-q)Boolean$graphgraph0($value) ? " -q" : ""("" , " -q")[ value ]5outfileOutfile name (-o)Filenameblast.html(defined $value and $value ne $vdef) ? " -o $value" : ""("", " -o " + str(value))[value is not None and value != vdef]6outputOutput fileBlastHtmlReportReport$outfileoutfileimagePictureBinary$graphgraph"*.png""*.gif""*.png""*.gif"
Programs-5.1.2/clustalw-multialign.xml 0000644 0001750 0000156 00000127776 12073003734 016721 0 ustar bneron sis
clustalw-multialignClustalw: Multiple alignmentDo full multiple alignmentalignment:multipleclustalw -align inputData Inputsequences_inputSequences File ( a file containing several sequences ) (-infile)not $alignment_input or ($sequences_input and $alignment_input)not alignment_input or (sequences_input and alignment_input)SequenceFASTANBRFEMBLGCGGDESWISSPROT2,n" -infile=$value"" -infile=" + str( value )Can not handle both Sequence and Alignment at the same timenot $alignment_inputnot alignment_input1alignment_inputAligned sequencesnot $sequences_input or ($sequences_input and $alignment_input)not sequences_input or (sequences_input and alignment_input)ProteinDNAAlignmentCLUSTALFASTA1" -infile=$value"" -infile=" + str( value )Can not handle both Sequence and Alignment at the same timenot $sequences_inputnot sequences_input When the sequences are aligned (all sequences
have the same length and at least one sequence has at least one
gap)generalGeneral settings2quicktreeToggle Slow/Fast pairwise alignments (-quicktree)Choiceslowslowfast($value eq "fast") ? " -quicktree" : ""( "" , " -quicktree")[ value == "fast"]slow: by dynamic programming (slow but accurate)fast: method of Wilbur and Lipman (extremely fast but approximate)typeseqProtein or DNA (-type)Choiceautoautoproteindna(defined $value) ? " -type=$value" : ""("", " -type="+str(value))[value is not None]multalignMultiple Alignments parameters3Multiple alignments are carried out in 3 stages :1) all sequences are compared to each other (pairwise alignments);2) a dendrogram (like a phylogenetic tree) is constructed, describing the approximate groupings of the sequences by similarity (stored in a file).3) the final multiple alignment is carried out, using the dendrogram as a guide.Pairwise alignment parameters control the speed/sensitivity of the initial alignments.Multiple alignment parameters control the gaps in the final multiple alignments.gapopenGap opening penalty (-gapopen)Float10.00(defined $value and $value != $vdef) ? " -gapopen=$value" : ""( "" , " -gapopen=" + str( value ))[ value is not None and value != vdef ]gapextGap extension penalty (-gapext)Float0.20(defined $value and $value != $vdef) ? " -gapext=$value" : ""( "" , " -gapext=" + str( value ))[ value is not None and value != vdef ]endgapsNo end gap separation penalty (-endgaps)Boolean0($value) ? " -endgaps" : ""( "" ," -endgaps" )[ value ]End gap separation treats end gaps just
like internal gaps for the purposes of avoiding gaps that
are too close (set by GAP SEPARATION DISTANCE above). If
you turn this off, end gaps will be ignored for this
purpose. This is useful when you wish to align fragments
where the end gaps are not biologically meaningful.gapdistGap separation penalty range (-gapdist)Integer8(defined $value and $value != $vdef) ? " -gapdist=$value" : ""( "" , " -gapdist=" + str( value ))[ value is not None and value != vdef]Gap separation distance tries to decrease
the chances of gaps being too close to each other. Gaps
that are less than this distance apart are penalised more
than other gaps. This does not prevent close gaps; it makes
them less frequent, promoting a block-like appearance of
the alignment.maxdivDelay divergent sequences : % ident. for delay (-maxdiv)Integer30(defined $value and $value != $vdef) ? " -maxdiv=$value" : ""( "" , " -maxdiv=" + str( value ))[ value is not None and value != vdef ]Delays the alignment of the most distantly
related sequences until after the most closely related
sequences have been aligned. The setting shows the percent
identity level required to delay the addition of a
sequence; sequences that are less identical than this level
to any other sequences will be aligned later.newtreeFile for new guide tree (-newtree)Filename(defined $value) ? " -newtree=$value" : ""( "" , " -newtree=" + str( value ))[value is not None]newtreefileOutput treeTreeNEWICKdefined $newtreenewtree is not None$newtreenewtreeusetreeFile for old guide tree (-usetree)TreeNEWICK(defined $value) ? " -usetree=$value" : ""( "" ," -usetree=" + str( value ))[value is not None]You can give a previously computed tree (.dnd file) - on the same datamultalign_protProtein parameters$typeseq eq "protein"typeseq == "protein"matrixProtein weight matrix (-matrix)Choicegonnetgonnetblosumpamid(defined $value and $value ne $vdef) ? " -matrix=$value" : ""("", " -matrix="+str(value))[value is not None and value!=vdef]There are three 'in-built' series of weight
matrices offered. Each consists of several matrices which
work differently at different evolutionary distances. To
see the exact details, read the documentation. Crudely, we
store several matrices in memory, spanning the full range
of amino acid distance (from almost identical sequences to
highly divergent ones). For very similar sequences, it is
best to use a strict weight matrix which only gives a high
score to identities and the most favoured conservative
substitutions. For more divergent sequences, it is
appropriate to use 'softer' matrices which give a high
score to many other frequent substitutions.BLOSUM (Henikoff). These matrices appear to
be the best available for carrying out data base similarity
(homology searches). The matrices used are: Blosum80, 62,
40 and 30.The Gonnet Pam 250 matrix has been reported
as the best single matrix for alignment, if you only choose
one matrix. Our experience with profile database searches
is that the Gonnet series is unambiguously superior to the
Blosum series at high divergence. However, we did not get
the series to perform systematically better than the Blosum
series in Clustal W (communication of the authors).PAM (Dayhoff). These have been extremely
widely used since the late '70s. We use the PAM 120, 160,
250 and 350 matrices.negativeNegative values in matrix ? (-negative)Boolean0($value) ? " -negative" : ""( "" , " -negative" )[ value ]pgapResidue specific gaps off (-nopgap)Boolean1($value) ? " -nopgap" : ""( "" , " -nopgap" )[ value ]Residue specific penalties are amino acid
specific gap penalties that reduce or increase the gap
opening penalties at each position in the alignment or
sequence. As an example, positions that are rich in glycine
are more likely to have an adjacent gap than positions that
are rich in valine.Table of residue specific gap modification
factors:A 1.13 M 1.29C 1.13 N 0.63D 0.96 P 0.74E 1.31 Q 1.07F 1.20 R 0.72G 0.61 S 0.76H 1.00 T 0.89I 1.32 V 1.25K 0.96 Y 1.00L 1.21 W 1.23The values are normalised around a mean value of 1.0 for H. The lower the value, the greater the chance of having an adjacent gap. These are derived from the original table of relative frequencies of gaps adjacent to each residue (12) by subtraction from 2.0.hgapHydrophilic gaps off (-nohgap)Boolean1($value) ? " -nohgap" : ""( "" , " -nohgap" )[ value ]Hydrophilic gap penalties are used to
increase the chances of a gap within a run (5 or more
residues) of hydrophilic amino acids; these are likely to
be loop or random coil regions where gaps are more
common. The residues that are 'considered' to be
hydrophilic are set by menu item 3.hgapresiduesHydrophilic residues list (-hgapresidues)MultipleChoiceRNDQEGKPSARNDCQEGHILKMFPSTWYV($value and $value ne $vdef) ? " -hgapresidues=\\"$value\\"" : ""( '' , ' -hgapresidues="%s"' % str(value) )[ value and value != vdef ]multalign_dnaDNA parameters$typeseq eq "dna"typeseq == "dna"dnamatrixDNA weight matrix (-dnamatrix)Choiceiubiubclustalw(defined $value and $value ne $vdef) ? " -dnamatrix=$value" : ""("", " -dnamatrix=" + str(value))[value is not None and value!=vdef]1) IUB. This is the default scoring matrix
used by BESTFIT for the comparison of nucleic acid
sequences. X's and N's are treated as matches to any IUB
ambiguity symbol. All matches score 1.9; all mismatches for
IUB symbols score 0.2) CLUSTALW(1.6). The previous system used
by ClustalW, in which matches score 1.0 and mismatches
score 0. All matches for IUB symbols also score 0.transweightTransitions weight (between 0 and 1) (-transweight)Float0.5(defined $value and $value != $vdef) ? " -transweight=$value" : ""( "" , " -transweight=" + str( value ) )[ value is not None and value != vdef ]A weight of
zero means that the transitions are scored as mismatches; a
weight of 1 gives transitions the full match score. For
distantly related DNA sequences, the weight should be near
to zero; for closely related sequences it can be useful to
assign a higher score.fastpwFast Pairwise Alignments parameters$quicktree eq "fast"quicktree == "fast"4These similarity scores are calculated from fast,
approximate, global alignments, which are controlled by 4
parameters. 2 techniques are used to make these alignments very
fast: 1) only exactly matching fragments (k-tuples) are
considered; 2) only the 'best' diagonals (the ones with most
k-tuple matches) are used.ktupleWord size (-ktuple)Integer1(defined $value and $value != $vdef) ? " -ktuple=$value" : ""( "" , " -ktuple=" + str( value ) )[value is not None and value != vdef ]2K-TUPLE SIZE: This is the size of exactly matching fragment that is used. INCREASE for speed (max= 2 for proteins; 4 for DNA), DECREASE for sensitivity. For longer sequences (e.g. >1000 residues) you may need to increase the default.topdiagsNumber of best diagonals (-topdiags)Integer5(defined $value and $value != $vdef) ? " -topdiags=$value" : ""( "" , " -topdiags=" + str( value ))[value is not None and value != vdef ]The number of k-tuple matches on each
diagonal (in an imaginary dot-matrix plot) is
calculated. Only the best ones (with most matches) are used
in the alignment. This parameter specifies how
many. Decrease for speed; increase for sensitivity.windowWindow around best diags (-window)Integer5(defined $value and $value != $vdef) ? " -window=$value" : ""( "" , " -window=" + str( value ) )[ value is not None and value != vdef ]WINDOW SIZE: This is the number of
diagonals around each of the 'best' diagonals that will be
used. Decrease for speed; increase for sensitivitypairgapGap penalty (-pairgap)Float3(defined $value and $value != $vdef) ? " -pairgap=$value" : ""( "" , " -pairgap=" + str( value ))[ value is not None and value != vdef ]This is a penalty for each gap in the fast
alignments. It has little affect on the speed or
sensitivity except for extreme values.scorePercent or absolute score ? (-score)Choicepercentpercentabsolute(defined $value and $value ne $vdef) ? " -score=$value" : ""( "" , " -score=" +str( value ) )[value is not None or value != vdef]slowpwSlow Pairwise Alignments parameters$quicktree eq "slow"quicktree == "slow"4These parameters do not have any affect on the
speed of the alignments. They are used to give initial alignments
which are then rescored to give percent identity scores. These %
scores are the ones which are displayed on the screen. The scores
are converted to distances for the trees.pwgapopenGap opening penalty (-pwgapopen)Float10.00(defined $value and $value != $vdef) ? " -pwgapopen=$value" : ""( "" , " -pwgapopen=" + str( value ) )[ value is not None and value != vdef ]pwgapextGap extension penalty (-pwgapext)Float0.10(defined $value and $value != $vdef) ? " -pwgapext=$value" : ""( "" , " -pwgapext=" + str( value ) )[ value is not None and value != vdef ]slowpw_protProtein parameters$typeseq eq "protein"typeseq == "protein"pwmatrixProtein weight matrix (-pwmatrix)Choicegonnetblosumgonnetpamid(defined $value and $value ne $vdef) ? " -pwmatrix=$value" : ""( "" , " -pwmatrix=" + str(value) )[value is not None and value != vdef ]The scoring table which describes the
similarity of each amino acid to each other. For DNA, an
identity matrix is used.BLOSUM (Henikoff). These matrices appear to
be the best available for carrying out data base similarity
(homology searches). The matrices used are: Blosum80, 62,
40 and 30.The Gonnet Pam 250 matrix has been reported
as the best single matrix for alignment, if you only choose
one matrix. Our experience with profile database searches
is that the Gonnet series is unambiguously superior to the
Blosum series at high divergence. However, we did not get
the series to perform systematically better than the Blosum
series in Clustal W (communication of the authors).PAM (Dayhoff). These have been extremely
widely used since the late '70s. We use the PAM 120, 160,
250 and 350 matrices.slowpw_dnaDNA parameters$typeseq eq "dna"typeseq == "dna"pwdnamatrixDNA weight matrix (-pwdnamatrix)Choiceiubiubclustalw(defined $value and $value ne $vdef) ? " -pwdnamatrix=$value" : ""( "" , " -pwdnamatrix=" + str(value) )[ value is not None and value != vdef ]For DNA, a single matrix (not a series) is
used. Two hard-coded matrices are available:1) IUB. This is the default scoring matrix
used by BESTFIT for the comparison of nucleic acid
sequences. X's and N's are treated as matches to any IUB
ambiguity symbol. All matches score 1.9; all mismatches for
IUB symbols score 0.2) CLUSTALW(1.6). The previous system used
by ClustalW, in which matches score 1.0 and mismatches
score 0. All matches for IUB symbols also score 0.outputparamOutput parameters5outputformatOutput format (-output)ChoicenullnullFASTAGCGGDEPHYLIPIPIRNEXUS(defined $value ) ? " -output=$value" : ""( "" , " -output=" + str( value) )[ value is not None ]seqnosOutput sequence numbers in the output file (for clustalw output only) (-seqnos)Booleannot defined $outputformatoutputformat is None0(defined $value and $value != $vdef) ? " -seqnos=on" : ""( "" , " -seqnos=on")[ value is not None and value != vdef]outorderResult order (-outorder)Choicealignedinputaligned(defined $value and $value ne $vdef) ? " -outorder=$value" : ""( "" , " -outorder=" + str(value))[ value is not None and value != vdef ]outfileSequence alignment file name (-outfile)Filename(defined $value) ? " -outfile=$value" : ""( "" , " -outfile=" + str( value))[ value is not None ]clustalaligfileAlignment fileAlignmentCLUSTALnot defined $outputformatoutputformat is None(defined $outfile)? "$outfile":"*.aln"("*.aln", str(outfile))[outfile is not None]In the conservation line output in the clustal format alignment file, three characters are used:'*' indicates positions which have a single, fully conserved residue.':' indicates that one of the following 'strong' groups is fully conserved (STA,NEQK,NHQK,NDEQ,QHRK,MILV,MILF,HY,FYW).'.' indicates that one of the following 'weaker' groups is fully conserved (CSA,ATV,SAG,STNK,STPA,SGND,SNDEQK,NDEQHK,NEQHRK,FVLIM,HFY).These are all the positively scoring groups that occur in the Gonnet Pam250
matrix. The strong and weak groups are defined as strong score >0.5 and weak
score =<0.5 respectively.aligfileAlignment fileAlignment$outputformat =~ /^(NEXUS|GCG|PHYLIPI|FASTA)$/outputformat in [ "NEXUS", "GCG", "PHYLIPI","FASTA"](defined $outfile)? "$outfile":"*.fasta *.nxs *.phy *.msf"{ "OUTFILE":outfile, "FASTA":"*.fasta", "NEXUS": "*.nxs", "PHYLIPI": "*.phy" , 'GCG': '*.msf' }[( "OUTFILE", outputformat)[outfile is None]]seqfileSequences fileSequenceNBRFGDE$outputformat =~ /^(GDE|PIR)$/outputformat in [ 'GDE', 'PIR' ](defined $outfile)? "$outfile":"*.gde *.pir"{ "OUTFILE":outfile, 'GDE':'*.gde', 'PIR':'*.pir}[( "OUTFILE", outputformat)[outfile is None]]dndfileTree fileTreeNEWICKnot defined $newtreenewtree is None"*.dnd""*.dnd"gde_lowerUpper case (for GDE output only) (-case)Boolean$outputformat eq "GDE"outputformat == "GDE"0($value) ? " -case=upper" : ""( "" , " -case=upper" )[ value ]2
Programs-5.1.2/pepcoil.xml 0000644 0001750 0000156 00000023011 12072525233 014326 0 ustar bneron sis
pepcoilEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netpepcoilPredicts coiled coil regions in protein sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/pepcoil.htmlhttp://emboss.sourceforge.net/docs/themessequence:protein:2D_structuresequence:protein:motifsstructure:2D_structurepepcoile_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_requiredRequired sectione_windowWindow size (value from 7 to 28)Integer28("", " -window=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 7 is requiredvalue >= 7Value less than or equal to 28 is requiredvalue <= 282e_outputOutput sectione_coilReport coiled coil regionsBoolean1(" -nocoil", "")[ bool(value) ]3e_frameShow coil frameshiftsBooleane_coil0("", " -frame")[ bool(value) ]4Yes if -coil is truee_otherReport non coiled coil regionsBoolean0("", " -other")[ bool(value) ]5e_outfileName of the report fileFilenamepepcoil.report("" , " -outfile=" + str(value))[value is not None]6e_rformat_outfileChoose the report output formatChoiceMOTIFDASGFFDBMOTIFDIFFSEQEMBLEXCELFEATTABLEGENBANKGFFLISTFILEMOTIFNAMETABLECODATAREGIONSSEQTABLESIMPLESRSSWISSTABLETAGSEQ("", " -rformat=" + str(value))[value is not None and value!=vdef]7e_outfile_outoutfile_out optionTexte_rformat_outfile in ['DASGFF', 'DBMOTIF', 'DIFFSEQ', 'EMBL', 'EXCEL', 'FEATTABLE', 'GENBANK', 'GFF', 'LISTFILE', 'MOTIF', 'NAMETABLE', 'CODATA', 'REGIONS', 'SEQTABLE', 'SIMPLE', 'SRS', 'SWISS', 'TABLE', 'TAGSEQ']e_outfileautoTurn off any promptingString" -auto -stdout"8
Programs-5.1.2/prose.xml 0000644 0001750 0000156 00000006677 12265214442 014047 0 ustar bneron sis
prose0.02aPROSEProsite Pattern searchK. Schuererftp://ftp.pasteur.fr/pub/gensoft/projects/prose/sequence:protein:patternproseseqfileProtein Sequence FileSequenceFASTA" $value"" " + str(value)2abundantInclude abundant patterns (-s)Boolean0($value) ? " -s" : ""("", " -s")[value]reportReport occurrences (-m)Choiceshortshortlongall($value ne $vdef) ? " -m $value" : ""("", " -m " + str(value))[value != vdef]casePerform case sensitive search (-c)Boolean0($value) ? " -c" : ""("", " -c")[value]plistPattern list file (-l)ProsePatternAbstractText(defined $value) ? " -l $value" : ""("", " -l " + str(value))[value is not None]This file require exactly one pattern per
line, in the following format:NAME followed by PATTERN.
Programs-5.1.2/geecee.xml 0000644 0001750 0000156 00000007461 12072525233 014123 0 ustar bneron sis
geeceeEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netgeeceeCalculate fractional GC content of nucleic acid sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/geecee.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:cpg_islandsgeeceee_inputInput sectione_sequencesequence optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_outputOutput sectione_outfileName of the output file (e_outfile)Filenamegeecee.e_outfile("" , " -outfile=" + str(value))[value is not None]2e_outfile_outoutfile_out optionGeeceeReportReporte_outfileautoTurn off any promptingString" -auto -stdout"3
Programs-5.1.2/listor.xml 0000644 0001750 0000156 00000015054 12072525233 014217 0 ustar bneron sis
listorEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netlistorWrite a list file of the logical OR of two sets of sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/listor.htmlhttp://emboss.sourceforge.net/docs/themessequence:editlistore_inputInput sectione_firstsequencesfirstsequences optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF2,n("", " -firstsequences=" + str(value))[value is not None]1e_secondsequencessecondsequences optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF2,n("", " -secondsequences=" + str(value))[value is not None]2e_additionalAdditional sectione_operatorLogical operator to combine sequence listsChoicemobyle_nullmobyle_nullOAXN("", " -operator=" + str(value))[value is not None and value!=vdef]3The following logical operators combine the sequences in the following ways:
OR - gives all that occur in one set or the other
AND - gives only those which occur in both sets
XOR - gives those which only occur in one set or the other, but not in both
NOT - gives those which occur in the first set except for those that also occur in the seconde_outputOutput sectione_outfileName of the output file (e_outfile)Filenameoutfile.list("" , " -outfile=" + str(value))[value is not None]4The list of sequence names will be written to this list filee_outfile_outoutfile_out optionUsaListAbstractTexte_outfileautoTurn off any promptingString" -auto -stdout"5
Programs-5.1.2/pepwheel.xml 0000644 0001750 0000156 00000032126 12072525233 014513 0 ustar bneron sis
pepwheelEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netpepwheelDraw a helical wheel diagram for a protein sequencehttp://bioweb2.pasteur.fr/docs/EMBOSS/pepwheel.htmlhttp://emboss.sourceforge.net/docs/themesdisplay:protein:2D_structurestructure:2D_structurepepwheele_inputInput sectione_sequencesequence optionProteinSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -sequence=" + str(value))[value is not None]1e_outputOutput sectione_wheelPlot the wheelBoolean1(" -nowheel", "")[ bool(value) ]2e_stepsNumber of steps (value from 2 to 100)Integer18("", " -steps=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 2 is requiredvalue >= 2Value less than or equal to 100 is requiredvalue <= 1003The number of residues plotted per turn is this value divided by the 'turns' value.e_turnsNumber of turns (value from 1 to 100)Integer5("", " -turns=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 1 is requiredvalue >= 1Value less than or equal to 100 is requiredvalue <= 1004The number of residues plotted per turn is the 'steps' value divided by this value.e_graphChoose the e_graph output formatChoicepngpnggifcpspsmetadata(" -graph=" + str(vdef), " -graph=" + str(value))[value is not None and value!=vdef]5e_goutfileName of the output graphFilenamepepwheel_graph("" , " -goutfile=" + str(value))[value is not None]6outgraph_pngGraph filePictureBinarye_graph == "png""*.png"outgraph_gifGraph filePictureBinarye_graph == "gif""*.gif"outgraph_psGraph filePostScriptBinarye_graph == "ps" or e_graph == "cps""*.ps"outgraph_metaGraph filePictureBinarye_graph == "meta""*.meta"outgraph_dataGraph fileTexte_graph == "data""*.dat"e_markupsectionMarkup sectione_amphipathicPrompt for amphipathic residue markingBoolean0("", " -amphipathic")[ bool(value) ]7If this is true then the residues ACFGILMVWY are marked as squares and all other residues are unmarked. This overrides any other markup that you may have specified using the qualifiers '-squares', '-diamonds' and '-octags'.e_squaresMark as squaresStringnot e_amphipathicILVM("", " -squares=" + str(value))[value is not None and value!=vdef]8By default the aliphatic residues ILVM are marked with squares.e_diamondsMark as diamondsStringnot e_amphipathicDENQST("", " -diamonds=" + str(value))[value is not None and value!=vdef]9By default the residues DENQST are marked with diamonds.e_octagsMark as octagonsStringnot e_amphipathicHKR("", " -octags=" + str(value))[value is not None and value!=vdef]10By default the positively charged residues HKR are marked with octagons.autoTurn off any promptingString" -auto -stdout"11
Programs-5.1.2/syco.xml 0000644 0001750 0000156 00000121401 12072525233 013652 0 ustar bneron sis
sycoEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netsycoDraw synonymous codon usage statistic plot for a nucleotide sequencehttp://bioweb2.pasteur.fr/docs/EMBOSS/syco.htmlhttp://emboss.sourceforge.net/docs/themessequence:nucleic:codon_usagesequence:nucleic:gene_findingsycoe_inputInput sectione_sequencesequence optionDNASequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,1("", " -sequence=" + str(value))[value is not None]1e_cfilecfile optionChoicemobyle_nullmobyle_nullEacc.cutEacica.cutEadenovirus5.cutEadenovirus7.cutEagrtu.cutEaidlav.cutEanasp.cutEani.cutEani_h.cutEanidmit.cutEarath.cutEasn.cutEath.cutEatu.cutEavi.cutEazovi.cutEbacme.cutEbacst.cutEbacsu.cutEbacsu_high.cutEbja.cutEbly.cutEbme.cutEbmo.cutEbna.cutEbommo.cutEbov.cutEbovin.cutEbovsp.cutEbpphx.cutEbraja.cutEbrana.cutEbrare.cutEbst.cutEbsu.cutEbsu_h.cutEcac.cutEcaeel.cutEcal.cutEcanal.cutEcanfa.cutEcaucr.cutEccr.cutEcel.cutEchi.cutEchick.cutEchicken.cutEchisp.cutEchk.cutEchlre.cutEchltr.cutEchmp.cutEchnt.cutEchos.cutEchzm.cutEchzmrubp.cutEcloab.cutEcpx.cutEcre.cutEcrigr.cutEcrisp.cutEctr.cutEcyapa.cutEdayhoff.cutEddi.cutEddi_h.cutEdicdi.cutEdicdi_high.cutEdog.cutEdro.cutEdro_h.cutEdrome.cutEdrome_high.cutEdrosophila.cutEeca.cutEeco.cutEeco_h.cutEecoli.cutEecoli_high.cutEemeni.cutEemeni_high.cutEemeni_mit.cutEerwct.cutEf1.cutEfish.cutEfmdvpolyp.cutEhaein.cutEhalma.cutEhalsa.cutEham.cutEhha.cutEhin.cutEhma.cutEhorvu.cutEhum.cutEhuman.cutEkla.cutEklepn.cutEklula.cutEkpn.cutElacdl.cutElla.cutElyces.cutEmac.cutEmacfa.cutEmaize.cutEmaize_chl.cutEmam_h.cutEmammal_high.cutEmanse.cutEmarpo_chl.cutEmedsa.cutEmetth.cutEmixlg.cutEmouse.cutEmsa.cutEmse.cutEmta.cutEmtu.cutEmus.cutEmussp.cutEmva.cutEmyctu.cutEmze.cutEmzecp.cutEncr.cutEneigo.cutEneu.cutEneucr.cutEngo.cutEoncmy.cutEoncsp.cutEorysa.cutEorysa_chl.cutEpae.cutEpea.cutEpet.cutEpethy.cutEpfa.cutEphavu.cutEphix174.cutEphv.cutEphy.cutEpig.cutEplafa.cutEpolyomaa2.cutEpombe.cutEpombecai.cutEpot.cutEppu.cutEprovu.cutEpse.cutEpseae.cutEpsepu.cutEpsesm.cutEpsy.cutEpvu.cutErab.cutErabbit.cutErabit.cutErabsp.cutErat.cutEratsp.cutErca.cutErhile.cutErhime.cutErhm.cutErhoca.cutErhosh.cutEric.cutErle.cutErme.cutErsp.cutEsalsa.cutEsalsp.cutEsalty.cutEsau.cutEschma.cutEschpo.cutEschpo_cai.cutEschpo_high.cutEsco.cutEserma.cutEsgi.cutEsheep.cutEshp.cutEshpsp.cutEsli.cutEslm.cutEsma.cutEsmi.cutEsmu.cutEsoltu.cutEsoy.cutEsoybn.cutEspi.cutEspiol.cutEspn.cutEspo.cutEspo_h.cutEspu.cutEsta.cutEstaau.cutEstrco.cutEstrmu.cutEstrpn.cutEstrpu.cutEsty.cutEsus.cutEsv40.cutEsyhsp.cutEsynco.cutEsyncy.cutEsynsp.cutEtbr.cutEtcr.cutEter.cutEtetsp.cutEtetth.cutEtheth.cutEtob.cutEtobac.cutEtobac_chl.cutEtobcp.cutEtom.cutEtrb.cutEtrybr.cutEtrycr.cutEvco.cutEvibch.cutEwheat.cutEwht.cutExel.cutExenla.cutExenopus.cutEyeast.cutEyeast_cai.cutEyeast_high.cutEyeast_mit.cutEyeastcai.cutEyen.cutEyeren.cutEyerpe.cutEysc.cutEysc_h.cutEyscmt.cutEysp.cutEzebrafish.cutEzma.cut("", " -cfile=" + str(value))[value is not None and value!=vdef]2Codon usage filee_advancedAdvanced sectione_windowAveraging windowInteger30("", " -window=" + str(value))[value is not None and value!=vdef]3e_uncommonShow common codon usageBoolean0("", " -uncommon")[ bool(value) ]4e_minimumMinimum value for a common codon (value from 0.0 to .99)Float.15("", " -minimum=" + str(value))[value is not None and value!=vdef]Value greater than or equal to 0.0 is requiredvalue >= 0.0Value less than or equal to .99 is requiredvalue <= .995e_outputOutput sectione_plotProduce plotBoolean1(" -noplot", "")[ bool(value) ]6e_graphChoose the e_graph output formatChoicee_plotpngpnggifcpspsmetadata(" -graph=" + str(vdef), " -graph=" + str(value))[value is not None and value!=vdef]7xy_goutfileName of the output graphFilenamee_plotsyco_xygraph("" , " -goutfile=" + str(value))[value is not None]8xy_outgraph_pngGraph filePictureBinarye_plot and e_graph == "png""*.png"xy_outgraph_gifGraph filePictureBinarye_plot and e_graph == "gif""*.gif"xy_outgraph_psGraph filePostScriptBinarye_plot and e_graph == "ps" or e_graph == "cps""*.ps"xy_outgraph_metaGraph filePictureBinarye_plot and e_graph == "meta""*.meta"xy_outgraph_dataGraph fileTexte_plot and e_graph == "data""*.dat"e_outfileName of the output file (e_outfile)Filenamenot e_plotsyco.e_outfile("" , " -outfile=" + str(value))[value is not None]9e_outfile_outoutfile_out optionSycoReportReporte_outfileautoTurn off any promptingString" -auto -stdout"10
Programs-5.1.2/plotcon.xml 0000644 0001750 0000156 00000042053 11672346320 014363 0 ustar bneron sis
plotconEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netplotconPlot conservation of a sequence alignmenthttp://bioweb2.pasteur.fr/docs/EMBOSS/plotcon.htmlhttp://emboss.sourceforge.net/docs/themesalignment:multipleplotcone_inputInput sectione_sequencessequences optionAlignmentFASTAMSFPAIRMARKX0MARKX1MARKX2MARKX3MARKX10SRSSRSPAIRSCOREUNKNOWNMULTIPLESIMPLEMATCH1,n("", " -sequences=" + str(value))[value is not None]1File containing a sequence alignmente_scorefileComparison matrix fileChoicemobyle_nullmobyle_nullEBLOSUM30EBLOSUM35EBLOSUM40EBLOSUM45EBLOSUM50EBLOSUM55EBLOSUM60EBLOSUM62EBLOSUM62-12EBLOSUM65EBLOSUM70EBLOSUM75EBLOSUM80EBLOSUM85EBLOSUM90EBLOSUMNEDNAFULLEDNAMATEDNASIMPLEEPAM10EPAM100EPAM110EPAM120EPAM130EPAM140EPAM150EPAM160EPAM170EPAM180EPAM190EPAM20EPAM200EPAM210EPAM220EPAM230EPAM240EPAM250EPAM260EPAM270EPAM280EPAM290EPAM30EPAM300EPAM310EPAM320EPAM330EPAM340EPAM350EPAM360EPAM370EPAM380EPAM390EPAM40EPAM400EPAM410EPAM420EPAM430EPAM440EPAM450EPAM460EPAM470EPAM480EPAM490EPAM50EPAM500EPAM60EPAM70EPAM80EPAM90SSSUB("", " -scorefile=" + str(value))[value is not None and value!=vdef]2This is the scoring matrix file used when comparing sequences. By default it is the file 'EBLOSUM62' (for proteins) or the file 'EDNAFULL' (for nucleic sequences). These files are found in the 'data' directory of the EMBOSS installation.e_requiredRequired sectione_winsizeWindow sizeInteger4("", " -winsize=" + str(value))[value is not None and value!=vdef]3Number of columns to average alignment quality over. The larger this value is, the smoother the plot will be.e_outputOutput sectione_graphChoose the e_graph output formatChoicepngpnggifcpspsmetadata(" -graph=" + str(vdef), " -graph=" + str(value))[value is not None and value!=vdef]4xy_goutfileName of the output graphFilenameplotcon_xygraph("" , " -goutfile=" + str(value))[value is not None]5xy_outgraph_pngGraph filePictureBinarye_graph == "png""*.png"xy_outgraph_gifGraph filePictureBinarye_graph == "gif""*.gif"xy_outgraph_psGraph filePostScriptBinarye_graph == "ps" or e_graph == "cps""*.ps"xy_outgraph_metaGraph filePictureBinarye_graph == "meta""*.meta"xy_outgraph_dataGraph fileTexte_graph == "data""*.dat"autoTurn off any promptingString" -auto -stdout"6
Programs-5.1.2/notseq.xml 0000644 0001750 0000156 00000022470 12072525233 014214 0 ustar bneron sis
notseqEMBOSS6.3.1EMBOSSEuropean Molecular Biology Open Software SuiteRice,P. Longden,I. and Bleasby, A.EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice,P. Longden,I. and Bleasby, A. Trends in Genetics 16, (6) pp276--277http://emboss.sourceforge.net/downloadhttp://emboss.sourceforge.netnotseqWrite to file a subset of an input stream of sequenceshttp://bioweb2.pasteur.fr/docs/EMBOSS/notseq.htmlhttp://emboss.sourceforge.net/docs/themessequence:editnotseqe_inputInput sectione_sequencesequence optionSequenceEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF1,n("", " -sequence=" + str(value))[value is not None]1e_requiredRequired sectione_excludeSequence names to excludeString("", " -exclude=" + str(value))[value is not None]2Enter a list of sequence names or accession numbers to exclude from the sequences read in. The excluded sequences will be written to the file specified in the 'junkout' parameter. The remainder will be written out to the file specified in the 'outseq' parameter.
The list of sequence names can be separated by either spaces or commas.
The sequence names can be wildcarded.
The sequence names are case independent.
An example of a list of sequences to be excluded is:
myseq, hs*, one two three
a file containing a list of sequence names can be specified by giving the file name preceeded by a '@', eg: '@names.dat'e_outputOutput sectione_outseqName of the output sequence file (e_outseq)Filenamenotseq.e_outseq("" , " -outseq=" + str(value))[value is not None]3e_osformat_outseqChoose the sequence output formatChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]4e_outseq_outoutseq_out optionSequencee_outseqe_junkoutseqName of the output sequence file (e_junkoutseq)Filenamenotseq.e_junkoutseq("" , " -junkoutseq=" + str(value))[value is not None]5This file collects the sequences which you have excluded from the main output file of sequences.e_osformat_junkoutseqChoose the sequence output formatChoiceFASTAEMBLFASTAGCGGENBANKNBRFCODATARAWSWISSPROTGFF("", " -osformat=" + str(value))[value is not None and value!=vdef]6e_junkoutseq_outjunkoutseq_out optionSequencee_junkoutseqautoTurn off any promptingString" -auto -stdout"7
Programs-5.1.2/Env/ 0000755 0001750 0000156 00000000000 12306362647 012714 5 ustar bneron sis Programs-5.1.2/Env/penncnv_data.xml 0000644 0001750 0000156 00000000216 11672707410 016071 0 ustar bneron sis /path/to/penncnv/executables/dir//path/to/penncnv/shared/data/dir/